topic badge

9.08 Compare data sets

Worksheet
Compare data sets
1

The residents of two blocks of townhouses were asked the number of pets they own. The frequency of various responses are presented in the following dot plots:

a

Is the pet ownership a little lower or higher in Block A than Block B?

b

In Block A, how many pets do most households have?

c

In Block B, how many pets do most households have?

d

Describe the shape of the data for Block A.

e

Find the range of the number of pets in Block A.

f

Which block has more variability in the the number of pets?

g

Do either sets of scores have an outlier?

2

Consider the following graph:

a

Which city has the highest average daily sunshine hours?

b

Which city has the month with the least average sunshine hours?

c

Which city has the greatest variation in sunshine hours over the year?

3

Consider the following side-by-side column graphs which compare the perceived and actual number of immigrants for 9 countries:

a

Which country has the greatest difference between actual and perceived number of immigrants?

b

Which country has the smallest difference between actual and perceived number of immigrants?

c

Which data set (perceived or actual) has the biggest range?

4

Consider the column graph that shows the records of a stationery shop on the number of pens and notebooks sold in one week:

a

Find the number of pens sold on Tuesday.

b

Find the number of notebooks sold on Friday.

c

Find the percentage of pens sold on Thursday correct to two decimal places.

d

Find the total number of pens and notebooks sold during the week.

e

Find the average average number of pens and notebooks sold per day.

f

Which is the better selling product?

5

Consider the column graph that shows the number of blood donations per month in a given year:

a

Which state had the most donations?

b

Both states show a period with a lower number of donations due to cold and flu symptoms preventing donors being eligible. Which period is this?

A

Autumn months

B

Summer months

C

Spring months

D

Winter months

c

Which month had the highest monthly donations?

d

If the total donated over a year across all states is 763\,542 units and 2\% is used for trauma and road accidents, how many units of blood are required in a year for these incidents?

6

Consider the column graph that shows the distribution of blood types in Australia, Egypt and the world, as of 2019:

a

Which blood type is most common?

b

Which blood type is the rarest?

c

In which of the blood types does Australia have a significant proportion less than the general world population?

d

Consider the blood type distributions given below:

O+A+B+ABO-A-B-AB-
Australia40\%31\%8\%2\%9\%7\%2\%1\%
Egypt52\%24\%12.4\%3.8\%5\%2\%0.6\%0.2\%

Find the highest percentage difference between the proportion of a particular blood type between Egypt and Australia.

e

If, at the time that this data was collected, Australia had a population of 24\,642\,693 and Egypt had a population of 95\,220\,838, find the difference in the number of people with the blood type in part (d). Round your answer to the nearest whole number.

7

Consider the back-to-back column graph below which compares the ages of male and female students in a primary school:

a

Which distribution is positively skewed?

b
Which distribution has the highest mode?
c

Which distribution has the highest mean?

8

Consider the back-to-back column graph below which compares the unemployment rates of Texas and California over a 30-year period:

a
Which state had the highest range of unemployment rates?
b

Which state had rates most strongly skewed in a positive direction?

c

Which state had the highest modal rate?

d

Which state has a more symmetrical distribution of unemployment rates?

9

Consider the back-to-back column graphs below which compare the grades scored by females and males on a mathematics examination:

a

Which gender had the biggest range in scores?

b

Which gender was bi-modal?

c

Which gender had more students?

d

Which gender had more symmetrical results?

e

Which gender was more strongly skewed positively?

10

Consider the back-to-back histograms which compares the heights of female and male standard poodles:

a

Which data set is bi-modal?

b

Which data set has the highest range?

c

Which data set has the highest mean?

d

Which set is more positively skewed?

11

Consider the two histograms below which show the grade distributions in two university courses:

a

Which distribution has a clear mode?

b

Which distribution would be described as “uniform”?

c

Which distribution has the highest range?

d

Compare the mean and median of Section 1.

e

Which distribution would have the highest standard deviation?

12
Consider the back-to-back histograms below which compare the populations of males and females in Australia in 2017:
a
Is the data for both genders skewed negatively or positively?
b

State the modal class for males.

c

State the modal class for females.

d

Which gender has the highest numbers over 80 years old?

13

Two Science classes, each with 20 students, were given a 10-question True/False test. The results for each class are shown below:

a

Do you think Class 1 studied for their test? Justify your answer.

b

Do you think Class 2 studied for their test? Justify your answer.

c

Which statistic is the same for each class?

Back-to-back stem plots
14

The weight (in kilograms) of two groups, A and B, were recorded in a stem plot as shown:

a

Find the mean weight of Group A.

b

Find the mean weight of Group B.

c

Which group contains individuals that are generally heavier?

d

Calculate the population standard deviation for Group A correct to one decimal place.

e

Calculate the sample standard deviation for Group B correct to one decimal place.

f

Which group had more consistent weights?

Group AGroup B
50\ 1\ 1\ 2\ 3
7\ 6\ 5\ 3\ 060\ 0\ 2\ 3
2\ 2\ 2\ 1\ 070

Key: 1|8|3 = 81 \text{ and } 83

15

Ten participants had their pulse measured before and after exercise with results shown in the stem-and-leaf plot:

a

What is the mode pulse rate after exercise?

b

Find the range of pulse rates before exercise.

c

Find the range of pulse rates after exercise.

d

Find the mean pulse rate before exercise.

e

Find the mean pulse rate after exercise.

f

Hence, what can you conclude about exercise and pulse rate from the measures of centre and spread?

Pulse Rate Before ExercisePulse Rate After Exercise
5\ 5\ 05
9\ 9\ 7\ 46
4\ 37
084
95\ 7\ 8
103
113\ 5\ 5
120\ 1
\text{ Key: } 6|1|2 =12 \text{ and } 16
16

The following stem-and-leaf plot shows the length (in minutes) of a random sample of phone calls made by Elizabeth and Kathleen:

a

Who made a 13-minute phone call?

b

Is Elizabeth’s median higher than Kathleen’s median?

c

Is Elizabeth's mean greater than her median?

ElizabethKathleen
310\ 7
5\ 3\ 2\ 2\ 2\ 023\ 8\ 8
931\ 1
8\ 547
6\ 252\ 4\ 8

Key: 2 \vert 2 \vert 6 = 22 \text{ and } 26

17

The back-to-back stem-and-leaf plot shows the number of desserts ordered at Hotel A and Hotel B over several randomly chosen days:

a

Interpret the lowest score for Hotel A.

b

Which hotel's median is higher?

c

Is the mean greater than the median in both groups?

Hotel AHotel B
30
4\ 3\ 213\ 4
7\ 627
4\ 333\ 4
646\ 7
252\ 3\ 4

Key: 2 \vert 1 \vert 3 = 12 \text{ and }13

18

The back-to-back stem plot shows the total number of pieces of paper used over several days in two classes:

Determine whether the following statements are true or false:

a

Roald's students did not use 7 pieces of paper on any day.

b

James's median is higher than Roald’s median.

c

The median is greater than the mean in both groups.

Roald's classJames's class
707
311\ 2\ 3
828
4\ 332\ 3\ 4
7\ 6\ 549
3\ 252

\text{Key: } 2 \vert 6 \vert 0 = 62 \text{ and }60

19

The stem-and-leaf plot shows the batting scores of two cricket teams, Team A and Team B:

a

Find the median score of Team A and the median for Team B.

b

Find the range of scores for Team A and the range for Team B.

c

Find the interquartile range for Team A and also for Team B.

d

Find the population standard deviation for Team A and also for Team B, to two decimal places.

e

Which team had more varied scores?

Team ATeam B
7\ 6\ 262\ 6\ 8
8\ 6\ 5\ 2\ 271\ 5\ 7
8\ 481\ 4\ 7\ 9
94\ 7

Key: 6|1|2 = 12 \text{ and } 16

20

The Cancer Council surveyed 60 random people, asking them approximately how many hours they spent in the sun in the last month. The responders were split up into two groups, tourists and local residents and the results are shown below:

a

What is the median number of hours that each group spent in the sun?

b

If the two groups were combined, what would be the median number of hours spent in the sun?

c

If the two groups were combined, what would be the range of responses?

TouristsLocals
9\ 8\ 7\ 7\ 6\ 5\ 4\ 4\ 3\ 1\ 112\ 4\ 4\ 5\ 5\ 5\ 6\ 8\ 9
9\ 5\ 1\ 022\ 3\ 5\ 9
9\ 7\ 6\ 130\ 2\ 5
7\ 6\ 6\ 5\ 4\ 3\ 140\ 0\ 2\ 3\ 6\ 6\ 7\ 8
9\ 6\ 3\ 052\ 2\ 3\ 5\ 8\ 9

\text{Key: }6 \vert 1 \vert 2 = 12 \text{ and } 16

21

The stem-and-leaf plot shows the batting scores of two cricket teams, A and B:

a

What is the highest score in Team A?

b

What is the highest score in Team B?

c

Find the mean score of Team A.

AB
5\ 232\ 3\ 5\ 7\ 9
9\ 8\ 5\ 4\ 2\ 142\ 9
8\ 253\ 6
64

Key: 6 | 1 | 2 = 12 \text{ and } 16

22

The data below shows the results of a survey conducted on the price of concert tickets locally and the price of the same concerts at an international venue:

a

What was the most expensive ticket price at the international venue?

b

What was the median ticket price at the international venue?

c

What percentage of local ticket prices were cheaper than the international median?

d

At the international venue, what percentage of tickets cost between \$90 and \$110 (inclusive)?

e

At the local venue, what percentage of tickets cost between \$90 and \$100 (inclusive)?

LocalInternational
7\ 5\ 2\ 260\ 5
9\ 6\ 5\ 4\ 072\ 3\ 8\ 8
9\ 6\ 5\ 3\ 082\ 3\ 7\ 8
8\ 7\ 4\ 3\ 190\ 1\ 6\ 7\ 9
5100\ 2\ 3\ 5\ 8

Key: 6|1|2 = \$16 \text{ and }\$12

23

The stem-and-leaf plot shows the test scores of two Year 11 classes, A and B:

a

Find the highest score in Class A.

b

Find the highest score in Class B.

c

Find the mean score of Class A, to two decimal places.

d

Find the mean score of Class B, to two decimal places.

e

Calculate the overall mean of all of the Year 11 students, to two decimal places.

Class AClass B
8\ 3\ 062\ 4\ 6
9\ 7\ 6\ 3\ 173\ 5\ 8
8\ 281\ 3\ 6\ 8
92\ 5

Key: 6 | 1 | 2 = 12 \text{ and } 16

24

The data below shows the results of a survey conducted on the price of concert tickets locally and the price of the same concert at an international venue:

a

Find the five number summary for the price of concert tickets at local venues.

b

Find the five number summary for the price of concert tickets at international venues.

c

Draw parallel box plots for this data.

LocalInternational
7\ 6\ 3\ 061\ 8
8\ 6\ 4\ 3\ 273\ 5\ 5\ 9
9\ 6\ 5\ 1\ 181\ 5\ 7\ 9
8\ 7\ 5\ 2\ 091\ 3\ 4\ 6\ 8
1101\ 2\ 4\ 7\ 8

Key: 2 \vert 6 \vert 0 = 62 \text{ and }60

25

The data below represents how long each student in two different classes could hold their breath for, measured to the nearest second:

  • Mrs Nguyen's class: \, 55,\, 59,\, 61,\, 66,\, 71,\, 75,\, 80,\, 89,\, 91,\, 95,\, 101,\, 103,\, 103,\, 109,\, 111

  • Miss Humphreys's class: \, 51,\, 66,\, 67,\, 68,\, 77,\, 78,\, 79,\, 81,\, 83,\, 85,\, 85,\, 86,\, 92,\, 101,\, 110

a

Display the data in a back-to-back stem plot.

b

Who is the teacher of the student who can hold their breath the longest?

c

If you want to determine which class, in general, has the stronger breath hold capacity, which measure would be most appropriate to use?

d

If you want to determine which class, in general, has the more consistent breath hold capacity, which measure would be most appropriate to use?

26

Two friends have been growing sunflowers. They have measured the height of their sunflowers to the nearest centimetre, with their results shown below:

  • Tricia: \, 39,\, 18,\, 14,\, 44,\, 37,\, 18,\, 23,\, 28

  • Quentin: \, 49,\, 25,\, 42,\, 5,\, 47,\, 12,\, 15,\, 8,\, 35,\, 22,\, 28,\, 6,\, 21

a

Display the data in a back-to-back stem plot.

b

Find the median height of Tricia's sunflowers.

c

Find the median height of Quentin's sunflowers.

d

Find the mean height of Tricia's sunflowers.

e

Find the mean height of Quentin's sunflowers. Round your answer to two decimal places.

f

Which friend generally grows taller plants?

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

2.1.14

compare back to back stem plots for different data sets

2.1.17

compare the characteristics of the shape of histograms using symmetry, skewness and bimodality

What is Mathspace

About Mathspace