topic badge

3.05 Outliers

Worksheet
Identify outliers
1

Identify any outliers in each of the following data sets:

a
73,\, 77,\, 81,\, 86,\, 131
b
7,\, 25,\, 28,\, 35,\, 42
c
69,\, 79,\, 86,\, 72,\, 86,\, 77,\, 73,\, 82,\, 81,\, 76,\, 83,\, 47,\, 87,\, 70,\, 80,\, 85
d
58,\, 63,\, 58,\, 59,\, 64,\, 68,\, 68,\, 30,\, 73,\, 25,\, 72,\, 61,\, 65,\, 69,\, 75,\, 72
e
Leaf
12
27\ 7\ 9\ 9
31\ 3\ 3\ 3\ 3\ 5\ 8
44\ 4\ 5

Key: 5 | 2 \ = \ 52 hours

f
2

For each of the following data sets, calculate:

i

The interquartile range.

ii

The value of the lower fence.

iii

The value of the upper fence.

a
\text{Minimum}5
\text{Q}16
\text{Median}12
\text{Q}317
\text{Maximum}28
b
2
4
6
8
10
12
14
16
18
3

Consider the given dot plot:

a

Find the:

i

Median

ii

Lower quartile

iii

Upper quartile

iv

Interquartile range

v

Value of the lower fence

vi

Value of the upper fence

b

Identify any outliers.

4

For each of the following sets of data:

i

Construct the five-number summary.

ii

Calculate the interquartile range.

iii

Calculate the value of the lower fence.

iv

Calculate the value of the upper fence.

v

Would the value -5 be considered an outlier?

vi

Would the value 16 be considered an outlier?

a

9,\, 5,\, 3,\, 2,\, 6,\, 1

b

3,\, 10,\, 9,\, 2,\, 7,\, 5,\, 6

c

12,\, 5,\, 11,\, 1,\, 9,\, 8,\, 5,\, 6

5

For each of the following sets of data:

i

Construct the five-number summary.

ii

Would the value -3 be considered an outlier?

iii

Would the value 15 be considered an outlier?

a

1,\, 4,\, 8,\, 10,\, 6,\, 2,\, 5

b

9,\, 4,\, 6,\, 11,\, 10,\, 8,\, 10

6

A group of Year 12 students were asked how many hours they spend on Hashtagram per day. The results are given below:

1.9, 1.1, \text{ }2.4, 2.3, \text{ }2.1, 1.2, \text{ }1.3, 1.6, \text{ }1.5, 1.8

a

Construct the five-number summary.

b

Another girl, Naylaa spends 3.6 hours using Hashtagram. If her score was added to this group, would it be considered an outlier?

7

The height (in metres) of certain karri trees, which grow in the south west of Australia, are shown below:

74, \, 77, \, 76, \, 81, \, 71, \, 72, \, 78, \, 75, \, 73, \, 84, \, 79

a

Construct the five-number summary.

b

A tree is measured to be 66 \text{ m} tall. Would this tree be considered an outlier?

8

The data point 5 is below the lower fence and is considered an outlier. The interquartile range is 12.

Find the smallest integer value the lower quartile can be.

9

The data point 37 is above the upper fence and is considered an outlier. The interquartile range is 10.

Find the largest integer value the upper quartile can be.

10

\text{VO}_{2} Max is a measure of how efficiently your body uses oxygen during exercise. The more physically fit you are, the higher your \text{VO}_{2} Max.

Here are some people’s results when their \text{VO}_{2} Max was measured:

46,\, 27,\, 32,\, 46,\, 30,\, 25,\, 41,\, 24,\, 26,\, 29,\, 21,\, 21,\, 26,\, 47,\, 21,\, 30,\, 41,\, 26,\, 28,\, 26,\, 76

a

Sort the values into ascending order.

b

Find the median \text{VO}_{2} Max.

c

Find the upper quartile value.

d

Find the lower quartile value.

e

Calculate 1.5 \times IQR, where IQR is the interquartile range.

f

Identify any outliers using upper and lower fences.

g

Create a box plot of the data with the outlier displayed separately.

h

An average untrained healthy person has a \text{VO}_{2} Max between 30 and 40.

Using the boxplot, what level of exercise is likely to describe the majority of people in this group?

Effects of outliers
11

The number of three-pointers scored in a basketball game are shown in the dot plot:

a

Find the range of the data.

b

If the outlier is removed what is the new range?

12

The number of three-pointers scored in a basketball game are shown in the dot plot:

The mode is 2, if the outlier is removed what is the new mode?

13

Consider the given stem plot:

If the outlier is removed what is the new mean? Round your answer to two decimal places.

Leaf
34\ 4\ 9
46\ 6\ 8\ 9
51\ 4
6
7
84

Key: 2 \vert 3 = 23

14

Consider the given stem plot:

If the outlier is removed find the new range.

Leaf
25
3
49\ 9
50\ 0\ 4\ 5\ 7
62\ 6

Key: 1 | 2 \ = \ 12

15

Consider the following frequency table:

If the outlier is removed what is the new mean? Round your answer to two decimal places if needed.

Weight in kilogramsFrequency
122
135
141
152
160
170
181
16

Consider the following frequency table:

If the outlier is removed what is the new mode?

Weight in kilogramsFrequency
141
150
160
173
186
194
202
17

The glass windows for an airplane are rolled to a certain thickness, but machine production means there is some variation. The thickness of each pane of glass produced is measured (in millimetres), and the dot plot shows the results:

a

The current median is 11.15. If the outlier is removed what is the new median?

b

The current mean is 11.1. If the outlier is removed what is the new mean? Round your answer to two decimal places.

18

For each of the following sets of data:

i

Find the mean, median, mode, and range. Round your answers to two decimal places where necessary.

ii

Identify the outlier.

iii

Remove the outlier from the set and recalculate the values found in part (i).

iv

Describe how each of the four statistics changed after removing the outlier.

a
53, \, 46,\, 25,\, 50,\, 30,\, 30,\, 40,\, 30,\, 47,\, 109
b
4.7,\, 2.8,\, 1.9,\, 0.9,\, 0.9,\, 2.2,\, 2.2,\, 1.2,\, 1.5,\, 0.9
c
4700,\, 4700,\, 4700,\, 4500,\, 5300,\, 4900,\, 5200,\, 4800,\, 1500,\, 5100
19

True or False: When the outlier is removed from a set of data, the range will always decrease.

20

For each of the following scenarios, determine whether the outlier that was removed must have had a value smaller or larger than the values that remain:

a

A set of data has an outlier removed and the mean lowers.

b

A set of data has an outlier removed and the mean rises.

c

A set of data has an outlier removed and the median lowers.

d

A set of data has an outlier removed and the median rises.

21

When an outlier is removed from a data set, describe the effect on the following:

a
Mode
b
Range
c
Mean
d
Median
22

The selling price of recently sold houses are:

\$467\,000,\, \$413\,000,\, \$410\,000,\, \$456\,000,\, \$487\,000,\, \$929\,000

a

Find the mean selling price, to the nearest thousand dollars.

b

Which of the selling prices raises the mean so that it is not reflective of most of the prices?

c

Recalculate the mean selling price excluding this outlier.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

MA12-8

solves problems using appropriate statistical processes

What is Mathspace

About Mathspace