We have seen a useful way to compare numerical data sets is parallel box plots as these compare the two sets on the same scale. We can create a similar comparison by creating a back-to-back stem plot. Comparing the shape of histograms also provides a visual way to compare two or more sets of data.
The advantage of using a back-to-back stem plot to compare two sets of data is that the original data is retained and we can calculate the mean, mode and other statistics exactly. A disadvantage is that it is only suitable for small to medium data sets and we can only compare two sets of data at a time.
Two sets of data can be displayed side-by-side using a back-to-back stem plot. In the example below, the pulse rates of $18$18 students were recorded before and after exercise.
Reading a back-to-back stem plot is very similar to reading a regular stem plot.
Referring to the example above:
To create a stem plot, it is usually easier to arrange all of the data values in ascending order, before ordering them in the plot.
10 participants had their pulse measured before and after exercise with results shown in the stem and leaf plot below.
Key: | 6 | 1 | 2 | $=$= | 12 and 16 |
What is the mode pulse rate after exercise?
$\editable{}$
What is the range of pulse rates before exercise?
What is the range of pulse rates after exercise?
Calculate the mean pulse rate before exercise.
What is the mean pulse rate after exercise?
What can you conclude from the measures of centre and spread that you have just calculated?
The mode pulse rate is the best comparison of pulse rates before and after exercise.
The range of pulse rates decreases after exercise.
The range of pulse rates increasing after exercise shows that some people are fitter than others.
The range of pulse rates and the mean pulse rate increase after exercise.
The data below shows the results of a survey conducted on the price of concert tickets locally and the price of the same concerts at an international venue.
Local | International | ||||||||||||||||||||||||||||||
|
|
Key: $1$1$\mid$∣$2$2$=$=$12$12 |
What was the most expensive ticket price at the international venue?
$\editable{}$ dollars
What was the median ticket price at the international venue? Leave your answer to two decimal places if needed.
What percentage of local ticket prices were cheaper than the international median?
At the international venue, what percentage of tickets cost between $\$90$$90 and $\$110$$110 (inclusive)?
At the local venue, what percentage of tickets cost between $\$90$$90 and $\$100$$100 (inclusive)?
Two friends have been growing sunflowers. They have measured the height of their sunflowers to the nearest cm, with their results shown below:
Quentin$=$=$39,18,14,44,37,18,23,28$39,18,14,44,37,18,23,28
Tricia$=$=$49,25,42,5,47,12,15,8,35,22,28,6,21$49,25,42,5,47,12,15,8,35,22,28,6,21
Display the data on the stem-and-leaf plot.
Quentin | Tricia | |
0 | $\editable{}$ $\editable{}$ $\editable{}$ | |
$\editable{}$ $\editable{}$ $\editable{}$ | 1 | $\editable{}$ $\editable{}$ |
$\editable{}$ $\editable{}$ | 2 | $\editable{}$ $\editable{}$ $\editable{}$ $\editable{}$ |
$\editable{}$ $\editable{}$ | 3 | $\editable{}$ |
$\editable{}$ | 4 | $\editable{}$ $\editable{}$ $\editable{}$ |
What is the median length of Tricia's sunflowers?
What is the median length of Quentin's sunflowers?
Which of these statements is true?
Quentin's flowers have a higher median length and larger range of lengths, which shows that Quentin has taller flowers overall.
Tricia's flowers have a higher median length and larger range of lengths, which shows that Tricia has taller flowers overall.
Tricia's flowers have a higher median length and smaller range of lengths, which shows that Tricia has taller flowers overall.
Quentin's flowers have a higher median length and smaller range of lengths, which shows that Quentin has taller flowers overall.
Histograms and column graphs can be compared in a number of ways. The columns can be drawn side by side or the entire graphs can be drawn back to back as shown in the diagrams below. When comparing histograms we often comment on differences in features such as location, spread, skewness and modality.
Side by side column graph | Back to back histogram |
---|---|
Two Science classes, each with $20$20 students, were given a $10$10 question True/False test. The results for each class are shown below.
Do you think Class 1 studied for their test?
Yes
No
Which statistical piece of evidence supports your answer?
The mean.
The range.
The positive skew of the graph.
Do you think Class 2 studied for their test?
Yes
No
Which statistical piece of evidence supports your answer?
The mean.
The range.
The graph is negatively skewed.
Which statistic is the same for each class?
The mode.
The range.
Calculate the mean for class 1 correct to one decimal place:
Calculate the mean for class 2 correct to one decimal place.
Consider the column graph that shows the number of blood donations per month in a given year.
Which state had the most donations?
NSW
VIC
Both states show a period with a lower number of donations due to cold and flu symptoms preventing donors being eligible. Which period is this?
Autumn months
Summer months
Spring months
Winter months
Which month had the highest monthly donations?
May
January
November
April
If the total donated over a year across all states is $763542$763542 units and $2%$2% is used for trauma and road accidents, how many units of blood is required in a year for these incidents?
Which distribution is positively skewed?
Which distribution has the highest mean?