topic badge

11.05 Comparisons using standard deviation

Lesson

Comparisons using standard deviation

Standard deviation (\sigma) is a measure of spread, which helps give us a meaningful estimate of the variability in a data set. It is a weighted average of the distance of each data point from the mean. A small standard deviation indicates that most scores are close to the mean, while a large standard deviation indicates that the scores are more spread out away from the mean value.

Note that the mean will determine where the scores are clustered, while the standard deviation tells us how tightly they are clustered. The two sets of data above have a similar mean but different standard deviations.

When comparing data sets, to find which data set had higher scores, we would choose the set with the higher mean. To find the set with the more consistent scores, we would choose the set with the lower standard deviation.

Examples

Example 1

Two machines A and B are producing chocolate bars with the following mean and standard deviation for the weight of the bars.

MachineMean (g) Standard deviation (g)
\text{A}521.5
\text{B}560.65
a

What does a comparison of the mean of the two machines tell us?

Worked Solution
Create a strategy

Compare the means from the table.

Apply the idea

We can see that Machine B has a greater mean weight than Machine A. This means that Machine B generally produces heavier chocolate bars.

b

What does a comparison of the standard deviation of the two machines tell us?

Worked Solution
Create a strategy

Compare the standard deviations from the table.

Apply the idea

Machine B has a smaller standard deviation than Machine A. This means that Machine B produces chocolate bars with a more consistent weight.

Example 2

Han, a cricketer, has made scores of 52,\,20,\,68,\,70, and 150 in all his innings this season. In his next innings, he scores no runs.

a

What is the change in his season batting average before and after the sixth inning?

Worked Solution
Create a strategy

Subtract the old batting mean from the new batting mean.

To find the mean, we use the formula: \text{Mean}=\dfrac{\text{Sum of the scores}}{\text{Number of scores}}

Apply the idea

The old list is: 52,\,20,\,68,\,70,\,150.

The new list is: 52,\,20,\,68,\,70,\,150,\,0.

\displaystyle \text{Old mean}\displaystyle =\displaystyle \dfrac{52+20+68+70+150}{5}Find the mean for 5 innings
\displaystyle =\displaystyle 72Evaluate
\displaystyle \text{New mean}\displaystyle =\displaystyle \dfrac{52+20+68+70+150+0}{6}Find the mean for 6 innings
\displaystyle =\displaystyle 60Evaluate
\displaystyle \text{Change in mean}\displaystyle =\displaystyle 60 - 72Subtract the means
\displaystyle =\displaystyle -12Evaluate

This means that the batting average dropped by 12.

b

What is the change in his standard deviation before and after his sixth innings? Give your answer correct to two decimal places.

Worked Solution
Create a strategy

Subtract the old standard deviation from the new standard deviation.

Apply the idea

We use the same lists from part (a) to find each standard deviation.

\displaystyle \text{Change in } \sigma\displaystyle =\displaystyle \sigma_{\text{new}}-\sigma_{\text{old}}Subtract the standard deviations
\displaystyle =\displaystyle 47.48-42.91Calculate the standard deviations
\displaystyle =\displaystyle 4.57Evaluate the subtraction
Idea summary

When comparing data sets, to find which data set had higher scores, we would choose the set with the higher mean. To find the set with the more consistent scores, we would choose the set with the lower standard deviation.

Outcomes

VCMSP372 (10a)

Calculate and interpret the mean and standard deviation of data and use these to compare data sets. Investigate the effect of individual data values including outliers, on the standard deviation

What is Mathspace

About Mathspace