topic badge

11.04 Standard deviation

Lesson

Standard deviation

Standard deviation is a measure of spread, which helps give us a meaningful estimate of the variability in a data set. It is a weighted average of the distance of each data point from the mean. A small standard deviation indicates that most scores are close to the mean, while a large standard deviation indicates that the scores are more spread out away from the mean value.

Larger standard deviation - more spread out:

A histogram showing a spread out data set. Ask your teacher for more information.

Smaller standard deviation - closer to the mean:

A histogram showing a data set where most values are close to the mean. Ask your teacher for more information.

The symbol used for the standard deviation of a population is:

\displaystyle \text{Standard deviation}\displaystyle =\displaystyle \sigma(pronounced sigma)

In statistics mode on a calculator, the symbol \sigma_{n} or \sigma_{x} may be used.

Standard deviation can be calculated using a formula. However, as this process is time consuming we will be using our calculator to find the standard deviation.

Standard deviation is a weighted average of how far each piece of data varies from the mean.

The standard deviation is a more complex calculation but takes every data point into account. The standard deviation is significantly impacted by outliers.

There is a second type of standard deviation for if you are working with a sample and not a population. This is the sample standard deviation, with the symbol s_{x}. This will normally have a value very close to \sigma_{x}. In this course, when standard deviation is mentioned it will only be referring to population standard deviation and represented by \sigma_{x}. Be careful when using your calculator or online tools that you are finding the population standard deviation and not the sample standard deviation.

Examples

Example 1

The test results for four Geography classes are shown. The classes are labelled as \text{W},\,\text{X},\,\text{Y}, and \text{Z}.

The image shows four histograms for  classes W, X, Y, and Z. Ask your teacher for more information.

Select the the option that correctly lists the classes in order from largest standard deviation to smallest standard deviation.

A
Y,\,W,\,Z,\,X
B
W,\,X,\,Y,\,Z
C
X,\,W,\,Z,\,Y
D
X,\,Z,\,W,\,Y
Worked Solution
Create a strategy

Compare how spread out the data is for each class.

Apply the idea

Class Y has all the data on the mean, so it will have the lowest standard deviation.

Class W has most of the data near the mean, so it will have the second lowest standard deviation.

Class X has all its data away from the mean so it will have the largest standard deviation.

So the order is: X,\,Z,\,W,\,Y or option D.

Reflect and check

Using a calculator, we can compute the standard deviation of each graph given the score and its frequency.

Example 2

The number of runs scored by Mario in each of his innings is listed below. 33,\,32,\,32,\,32,\,31,\,32,\,32,\,32,\,32,\,32

What was his standard deviation? Round your answer to two decimal places.

Worked Solution
Create a strategy

Use the population deviation function, \sigma_{x} on your calculator.

Apply the idea

Using Statistics mode enter each data point into your calculator then press the \sigma_{x} button. \sigma_{x} \approx 0.45

Example 3

Use technology to determine the standard deviation for the data represented by the frequency table.

ScoreFrequency
1513
169
1723
1819
198
2013

Round your answer to two decimal places.

Worked Solution
Create a strategy

Use the population deviation function, \sigma_{x} on your calculator.

Apply the idea

Using Statistics mode enter each score along with its frequency into a data table on your calculator. Then find the standard deviation.\sigma_{x} \approx 1.58

Example 4

Use technology to determine the standard deviation for the data represented by the grouped frequency table, using the class centres.

ClassClass centreFrequency
40 \leq x \lt 4542.54
45 \leq x \lt 5047.511
50 \leq x \lt 5552.516
55 \leq x \lt 6057.517
60 \leq x \lt 6562.57
65 \leq x \lt 7067.512
70 \leq x \lt 7572.511
75 \leq x \lt 8077.55

Round your answer to two decimal places.

Worked Solution
Create a strategy

Enter the class centres as the scores and frequencies in your calculator to find the standard deviation.

Apply the idea

Since we are given grouped data, we can only get an estimate of the standard deviation since we are using the class centres rather than the actual scores.

Using Statistics mode we can enter the class centres as the values alongside the frequencies, and then find the standard deviation.

\sigma_{x} \approx 9.76

Idea summary

Standard deviation is a weighted average of how far each piece of data varies from the mean. It takes every data point into account and is significantly impacted by outliers.

  • A larger value indicates a wider spread (more variable) data set.

  • A smaller value indicates a more tightly packed (less variable) data set.

For this course you should always use the \sigma_{x} value for standard deviation.

Outcomes

VCMSP372 (10a)

Calculate and interpret the mean and standard deviation of data and use these to compare data sets. Investigate the effect of individual data values including outliers, on the standard deviation

What is Mathspace

About Mathspace