topic badge

9.05 Standard deviation

Lesson

Standard deviation is a measure of spread, which helps give us a meaningful estimate of the variability in a data set. While the quartiles gave us a measure of spread about the median, the standard deviation gives us a measure of spread with respect to the mean. It is a weighted average of the distance of each data point from the mean. A small standard deviation indicates that most scores are close to the mean, while a large standard deviation indicates that the scores are more spread out away from the mean value.

The symbol used for the standard deviation of a population is:

$\text{Population standard deviation}$Population standard deviation $=$= $\sigma$σ (lowercase sigma)

In statistics mode on a calculator, the following symbol might be used:

$\text{Population standard deviation}$Population standard deviation $=$= $\sigma_n$σn

Standard deviation is a very powerful way of comparing the spread of different data sets, particularly if there are different means and population numbers.

Standard deviation can be calculated using a formula. However, as this process is time consuming we will be using our calculator to find the standard deviation. 

Remember!

The three main measures of spread are:

  • Range–the size of the interval the data is spread over:

$\text{Range}=\text{Highest score}-\text{Lowest score}$Range=Highest scoreLowest score

The range is simple to calculate but only takes into account two values. The range is also significantly impacted by outliers.

  • Interquartile range–the range of the middle $50%$50% of data:

$IQR=Q_3-Q_1$IQR=Q3Q1

The interquartile range is relatively simple to calculate but only takes into account two values. It is not significantly affected by outliers.

  • Standard deviation–a weighted average of how far each piece of data varies from the mean:

The standard deviation is a more complex calculation but takes every data point into account. The standard deviation is significantly impacted by outliers.

For each measure of spread:

  • larger value indicates a wider spread (more variable) data set.
  • A smaller value indicates a more tightly packed (less variable) data set.

 

Practice questions

Question 1

The mean income of people in Country A is $\$19069$$19069. This is the same as the mean income of people in Country B. The standard deviation of Country A is greater than the standard deviation of Country B. In which country is there likely to be the greatest difference between the incomes of the rich and poor?

  1. Country A

    A

    Country B

    B

Question 2

Find the population standard deviation of the following set of scores, to two decimal places, by using the statistics mode on the calculator:

$8,20,9,9,8,19,9,18,5,10$8,20,9,9,8,19,9,18,5,10

Question 3

The scores of five diving attempts by a professional diver are recorded below.

$5.6,6.6,6.3,5.9,6.4$5.6,6.6,6.3,5.9,6.4

  1. Calculate the population standard deviation of the scores to two decimal places if necessary.

  2. On the sixth attempt, the diver scores $8.8$8.8. This score will:

    decrease the mean and decrease the population standard deviation

    A

    decrease the mean and increase the population standard deviation

    B

    increase the mean and increase the population standard deviation

    C

    increase the mean and decrease the population standard deviation

    D

Question 4

Two machines $A$A and $B$B are producing chocolate bars with the following mean and standard deviation for the weight of the bars.

Machine Mean (g) Standard deviation (g)
$A$A $52$52 $1.5$1.5
$B$B $56$56 $0.65$0.65
  1. What does a comparison of the mean of the two machines tell us?

    Machine $A$A produces chocolate bars with a more consistent weight.

    A

    Machine $B$B produces chocolate bars with a more consistent weight.

    B

    Machine $A$A generally produces heavier chocolate bars.

    C

    Machine $B$B generally produces heavier chocolate bars.

    D
  2. What does a comparison of the standard deviation of the two machines tell us?

    Machine $B$B generally produces heavier chocolate bars.

    A

    Machine $A$A generally produces heavier chocolate bars.

    B

    Machine $B$B produces chocolate bars with a more consistent weight.

    C

    Machine $A$A produces chocolate bars with a more consistent weight.

    D

Outcomes

2.1.12

interpret statistical measures of spread, such as: the range, interquartile range and standard deviation

What is Mathspace

About Mathspace