Univariate Data

Hong Kong

Stage 4 - Stage 5

Lesson

Standard deviation is a measure of spread, which helps give us a meaningful estimate of the variability in a data set. A small standard deviation means most scores are close to the mean. Conversely, a large standard deviation means the scores are very spread out.

The standard deviation is found by calculating the square root of the variance.

Variance is the average of the squared differences from the mean. Here is its formula.

$\sigma^2=\frac{1}{n}\Sigma\left(x_i-\mu\right)^2$`σ`2=1`n`Σ(`x``i`−`μ`)2

The following are the steps required. It is clear that for data sets of only moderate size, the amount of calculation needed is quite large and this makes calculations time-consuming and error-prone. For this reason, modern statistics depends heavily on automation by computer software and by hand-held calculators.

This is the formula by which a calculator calculates the standard deviation of a data set from a full population. That is, it is the formula used for *census* data rather than *sample data.*

$\sigma=\sqrt{\frac{1}{n}\Sigma\left(x_i-\mu\right)^2}$`σ`=√1`n`Σ(`x``i`−`μ`)2

In this formula, the numbers $x_i$`x``i` are the values in the data set. There is one value for each subscript $i$`i`.

There are $n$`n` numbers $x_i$`x``i` in the data set. So, $i$`i` goes from $1$1 to $n$`n` in the summation.

The symbol $\mu$`μ` (Greek letter 'mu') is the population mean.

The Greek letter $\sigma$`σ` (sigma) is used for the population standard deviation.

The symbol $\Sigma$Σ (upper case sigma) is the summation symbol.

- Calculate the mean. $\mu=\frac{1}{n}\Sigma_{i=1}^n\ x_i$
`μ`=1`n`Σ`n``i`=1`x``i` - Find the difference from the mean for each score. $x_i-\mu$
`x``i`−`μ` - Square each of the differences. $\left(x_i-\mu\right)^2$(
`x``i`−`μ`)2 - Sum the squared differences. $\Sigma\left(x_i-\mu\right)^2$Σ(
`x``i`−`μ`)2 - Divide the sum by the number of scores. $\frac{1}{n}\Sigma\left(x_i-\mu\right)^2$1
`n`Σ(`x``i`−`μ`)2 - Take the square root. $\sigma=\sqrt{\frac{1}{n}\Sigma\left(x_i-\mu\right)^2}$
`σ`=√1`n`Σ(`x``i`−`μ`)2

Find the following based on this set of scores:

$19,18,14,19,10$19,18,14,19,10

Find the mean.

Complete the following table.

Score($x$ `x`)$(x-$( `x`−mean$)$)$(x-$( `x`−mean$)^2$)2$19$19 $\editable{}$ $\editable{}$ $18$18 $\editable{}$ $\editable{}$ $14$14 $\editable{}$ $\editable{}$ $19$19 $\editable{}$ $\editable{}$ $10$10 $\editable{}$ $\editable{}$ Thus, find the standard deviation, correct to 2 decimal places.

Find the range of the set of scores.

The mean income of people in Country A is $\$19069$$19069. This is the same as the mean income of people in Country B. The standard deviation of Country A is greater than the standard deviation of Country B. In which country is there likely to be the greatest difference between the incomes of the rich and poor?

Country A

ACountry B

B