topic badge

11.04 Effects of outliers

Worksheet
Quantify outliers
1

Identify any outliers in each of the following data sets:

a

73,\, 77,\, 81,\, 86,\, 131

b

69,\, 79,\, 86,\, 72,\, 86,\, 77,\, 73,\, 82,\, 81,\, 76,\, 83,\, 47,\, 87,\, 70,\, 80,\, 85

c

58,\, 63,\, 58,\, 59,\, 64,\, 68,\, 68,\, 30,\, 73,\, 25,\, 72,\, 61,\, 65,\, 69,\, 75,\, 72

e
Leaf
12
27\ 7\ 9\ 9
31\ 3\ 3\ 3\ 3\ 5\ 8
44\ 4\ 5

Key: 5 | 2 \ = \ 52 hours

f
2

State the coordinates of the outlier on the following graph:

3
6
9
12
15
18
x
3
6
9
12
15
18
21
y
3

The table shows the average temperature (\degree \text{C}) in a particular city over several years. State the year in which the temperature is an outlier.

Year2 0022 0032 0042 0052 0062 0072 008
Temperature (°C)31.726.522.622.524.223.024.1
4

Consider the given dot plot:

a

Find the:

i

Median

ii

Lower quartile

iii

Upper quartile

iv

Interquartile range

v

Value of the lower fence

vi

Value of the upper fence

b

Identify any outliers.

5

For each of the following data sets, calculate:

i

The interquartile range

ii

The value of the lower fence

iii

The value of the upper fence

a
\text{Minimum}5
\text{Q}16
\text{Median}12
\text{Q}317
\text{Maximum}28
b
2
4
6
8
10
12
14
16
18
6

For each of the following sets of data:

i

Construct the five number summary.

ii

Calculate the interquartile range.

iii

Calculate the value of the lower fence.

iv

Calculate the value of the upper fence.

v

Would the value -5 be considered an outlier?

vi

Would the value 16 be considered an outlier?

a

9,\, 5,\, 3,\, 2,\, 6,\, 1

b

3,\, 10,\, 9,\, 2,\, 7,\, 5,\, 6

c

12,\, 5,\, 11,\, 1,\, 9,\, 8,\, 5,\, 6

7

For each of the following sets of data:

i

Construct the five number summary.

ii

Would the value -3 be considered an outlier?

iii

Would the value 15 be considered an outlier?

a

1,\, 4,\, 8,\, 10,\, 6,\, 2,\, 5

b

9,\, 4,\, 6,\, 11,\, 10,\, 8,\, 10

8

A group of Year 12 students were asked how many hours they spend on Hashtagram per day. The results are given below:

1.9, \, 1.1, \, 2.4, \, 2.3, \, 2.1, \, 1.2, \, 1.3, \, 1.6, \, 1.5, \, 1.8

a

Determine the five number summary for this data set.

b

Another girl, Naylaa spends 3.6 hours using Hashtagram. If her score was added to this group, would it be considered an outlier? Explain your answer.

9

The height (in metres) of certain karri trees, which grow in the south west of Australia, are shown below:

74, \, 77, \, 76, \, 81, \, 71, \, 72, \, 78, \, 75, \, 73, \, 84, \, 79

a

Construct the five number summary.

b

A tree is measured to be 66 \text{ m} tall. Would this tree be considered an outlier?

10

The data point 37 is above the upper fence and is considered an outlier. The interquartile range is 10.

Find the largest integer value the upper quartile can be.

11

The data point 5 is below the lower fence and is considered an outlier. The interquartile range is 12.

Find the smallest integer value the lower quartile can be.

12

\text{VO}_{2} Max is a measure of how efficiently your body uses oxygen during exercise. The more physically fit you are, the higher your \text{VO}_{2} Max.

Here are some people’s results when their \text{VO}_{2} Max was measured:

46,\, 27,\, 32,\, 46,\, 30,\, 25,\, 41,\, 24,\, 26,\, 29,\, 21,\, 21,\, 26,\, 47,\, 21,\, 30,\, 41,\, 26,\, 28,\, 26,\, 76

a

Sort the values into ascending order.

b

Find the median \text{VO}_{2} Max.

c

Find the upper quartile value.

d

Find the lower quartile value.

e

Calculate 1.5 \times IQR, where IQR is the interquartile range.

f

Identify any outliers using upper and lower fences.

g

Create a box plot of the data with the outlier displayed separately.

h

An average untrained healthy person has a \text{VO}_{2} Max between 30 and 40.

Using the boxplot, what level of exercise is likely to describe the majority of people in this group?

Effects of outliers
13

For each of the following sets of data:

i

Find the mean, median, mode, and range. Round your answers to two decimal places where necessary.

ii

Identify the outlier.

iii

Remove the outlier from the set and recalculate the values found in part (i).

iv

Describe how each of the four statistics changed after removing the outlier.

a

53, \, 46,\, 25,\, 50,\, 30,\, 30,\, 40,\, 30,\, 47,\, 109

b

4.7,\, 2.8,\, 1.9,\, 0.9,\, 0.9,\, 2.2,\, 2.2,\, 1.2,\, 1.5,\, 0.9

c

4700,\, 4700,\, 4700,\, 4500,\, 5300,\, 4900,\, 5200,\, 4800,\, 1500,\, 5100

14

For each of the following scenarios, determine whether the outlier that was removed must have had a value smaller or larger than the values that remain:

a

A set of data has an outlier removed and the mean lowers.

b

A set of data has an outlier removed and the mean rises.

c

A set of data has an outlier removed and the median lowers.

d

A set of data has an outlier removed and the median rises.

15

When an outlier is removed from a data set, describe the effect on the following:

a

Mode

b

Range

c

Mean

d

Median

16

The number of three-pointers scored in a basketball game are shown in the dot plot:

The mode is 2. If the outlier is removed, what is the new mode?

17

The number of three-pointers scored in a basketball game are shown in the dot plot:

a

Find the range of the data.

b

If the outlier is removed, what is the new range?

18

Consider the given stem plot:

If the outlier is removed, what is the new mean? Round your answer to two decimal places.

Leaf
34\ 4\ 9
46\ 6\ 8\ 9
51\ 4
6
7
84

Key: 2 \vert 3 = 23

19

Consider the following frequency table:

If the outlier is removed, what is the new mode?

Weight in kilogramsFrequency
141
150
160
173
186
194
202
20

Consider the following frequency table:

If the outlier is removed, what is the new mean? Round your answer to two decimal places if needed.

Weight in kilogramsFrequency
122
135
141
152
160
170
181
21

The glass windows for an airplane are rolled to a certain thickness, but machine production means there is some variation. The thickness of each pane of glass produced is measured (in millimetres), and the dot plot shows the results:

a

The current median is 11.15. If the outlier is removed, what is the new median?

b

The current mean is 11.1. If the outlier is removed, what is the new mean? Round your answer to two decimal places.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

MS11-7

develops and carries out simple statistical processes to answer questions posed

What is Mathspace

About Mathspace