topic badge
AustraliaVIC
VCE 11 General 2023

1.08 Outliers

Worksheet
Identify outliers
1

Find the outlier(s) in the following data sets:

a
73, 77, 81, 86, 131
b
69, 79, 86, 72, 86, 77, 73, 82, 81, 76, 83, 47, 87, 70, 80, 85
c
58, 63, 58, 59, 64, 68, 68, 30, 73, 25, 72, 61, 65, 69, 75, 72
2

The stem and leaf plot shows the number of hours worked per week by a group of people. State the outlier.

Number of hours
12
27\ 7\ 9\ 9
31\ 3\ 3\ 3\ 3\ 5\ 8
44\ 4\ 5
Key: 5 | 2 = 52
3

The dot plot shows the temperature \degree C in a town over a several week period. State the temperature that is an outlier.

4

The table shows the average temperature \degree C in a particular city over several years. State the year in which the temperature is an outlier.

Year2 0022 0032 0042 0052 0062 0072 008
Temperature (°C)31.726.522.622.524.223.024.1
5

State the coordinates of the outlier on the following graph:

6

For each of the following data sets, calculate:

i

The interquartile range

ii

The value of the lower fence

iii

The value of the upper fence

a
\text{Minimum}5
\text{Q}16
\text{Median}12
\text{Q}317
\text{Maximum}28
b
2
4
6
8
10
12
14
16
18
7

Consider the given dot plot:

a

Find the:

i

Median

ii

Lower quartile

iii

Upper quartile

iv

Interquartile range

v

Value of the lower fence

vi

Value of the upper fence

b

Identify any outliers.

8

For each of the following sets of data:

i

Construct the five-number summary.

ii

Calculate the interquartile range.

iii

Calculate the value of the lower fence.

iv

Calculate the value of the upper fence.

v

Would the value -5 be considered an outlier?

vi

Would the value 16 be considered an outlier?

a

9,\, 5,\, 3,\, 2,\, 6,\, 1

b

3,\, 10,\, 9,\, 2,\, 7,\, 5,\, 6

c

12,\, 5,\, 11,\, 1,\, 9,\, 8,\, 5,\, 6

9

For each of the following sets of data:

i

Construct the five-number summary.

ii

Would the value -3 be considered an outlier?

iii

Would the value 15 be considered an outlier?

a

1,\, 4,\, 8,\, 10,\, 6,\, 2,\, 5

b

9,\, 4,\, 6,\, 11,\, 10,\, 8,\, 10

10

For each of the data sets below:

i

Construct the five-number summary.

ii

Calculate the value of the lower fence.

iii

Calculate the value of the upper fence.

iv

Identify any outliers.

v

Create a box plot of the data with the outlier(s) displayed separately.

a
6.8,\, 4.0,\, 3.5,\, 5.1,\, 2.4,\, 1.6,\, 3.9,\, 3.5,\, 3.1,\, 3.6,\, 7.6,\, 3.7,\, 4.0,\, 5.1,\, 3.6,\, 3.8,\, 3.6,\, 6.7
b
10,\, 15,\, 12,\, 26,\, 18,\, 15,\, 11,\, 38,\, 25,\, 12,\, 19,\, 17,\, 16,\, 17,\, 11,\, 36,\, 9,\, 2,\, 21,\, 18,\, 16
c
82,\, 87,\, 92,\, 76,\, 80,\, 85,\, 71,\, 84,\, 61,\, 79,\, 81,\, 81,\, 86,\, 97,\, 101,\, 80,\, 71,\, 76,\, 78,\, 86,\, 84
11

There is a test to measure the Emotional Quotient (EQ) of an individual. Here are the EQ results for 21 people listed in ascending order:

58, 90, 91, 92, 93, 94, 95, 95, 95, 97, 99, 100, 108, 114, 116, 116, 117, 118, 118, 122, 129

a

Find the median EQ score.

b

Find the Upper Quartile score.

c

Find the Lower Quartile score.

d

Find the interquartile range.

e

Find the lower fence.

f

Find the upper fence.

g

Use the fences to state any outliers.

12

Consider the data sets below:

  • Set A: \, 14,\, 18,\, 21,\, 19,\, 12,\, 16,\, 22,\, 20,\, 19,\, 13,\, 21,\, 20,\, 16,\, 7,\, 18,\, 20,\, 11,\, 19,\, 17,\, 24

  • Set B: \, 17,\, 9,\, 15,\, 24,\, 14,\, 13,\, 16,\, 10,\, 21,\, 14,\, 15,\, 17,\, 16,\, 13,\, 9,\, 19,\, 14,\, 18,\, 15,\, 12

a

Construct the five-number summary for each set.

b

Identify any outliers and use statistical calculations to justify your answer.

c

Create a parallel box plot of the data sets with the outlier(s) displayed separately.

13

The data point 5 is below the lower fence and is considered an outlier. The interquartile range is 12.

Find the smallest integer value the lower quartile can be.

14

The data point 37 is above the upper fence and is considered an outlier. The interquartile range is 10.

Find the largest integer value the upper quartile can be.

15

A group in a study take a test to assess their reaction time. The participants clicked a button as soon as they heard a sound which was played at random intervals. The reaction time, in milliseconds, of each participant is shown below:

220,\, 280,\, 210,\, 220,\, 215,\, 180,\, 185,\, 190,\, 190,\, 195,\, 150 \, 190,\, 195,\, 195
a

Construct the five-number summary.

b

Identify any outliers and use statistical calculations to justify your answer.

c

Create a box plot of the data with the outlier displayed separately.

d

Give a possible explanation for the outliers present.

16

\text{VO}_{2} Max is a measure of how efficiently your body uses oxygen during exercise. The more physically fit you are, the higher your \text{VO}_{2} Max.

Here are some people’s results when their \text{VO}_{2} Max was measured:

46,\, 27,\, 32,\, 46,\, 30,\, 25,\, 41,\, 24,\, 26,\, 29,\, 21,\, 21,\, 26,\, 47,\, 21,\, 30,\, 41,\, 26,\, 28,\, 26,\, 76

a

Sort the values into ascending order.

b

Determine the median \text{VO}_{2} Max.

c

Determine the upper quartile value.

d

Determine the lower quartile value.

e

Calculate 1.5 \times IQR, where IQR is the interquartile range.

f

Identify any outliers using upper and lower fences.

g

Create a box plot of the data with the outlier displayed separately.

h

An average untrained healthy person has a \text{VO}_{2} Max between 30 and 40.

Using the boxplot, what level of exercise is likely to describe the majority of people in this group?

Effect of outliers
17

The number of three-pointers scored in a basketball game are shown in the given dot plot. The current mode is 2. If the outlier is removed, find the new mode.

18

For the given dot plot, the current range is 11. If the outlier is removed, find the new range.

19

Consider the given stem plot:

If the outlier is removed what is the new mean? Round your answer to two decimal places.

Leaf
34\ 4\ 9
46\ 6\ 8\ 9
51\ 4
6
7
84

Key: 2 \vert 3 = 23

20

Consider the given stem plot:

If the outlier is removed find the new range.

Leaf
25
3
49\ 9
50\ 0\ 4\ 5\ 7
62\ 6

Key: 1 | 2 \ = \ 12

21

Consider the following frequency table:

If the outlier is removed what is the new mean? Round your answer to two decimal places if needed.

Weight in kilogramsFrequency
122
135
141
152
160
170
181
22

Consider the following frequency table:

If the outlier is removed what is the new mode?

Weight in kilogramsFrequency
141
150
160
173
186
194
202
23

The glass windows for an airplane are rolled to a certain thickness, but machine production means there is some variation. The thickness of each pane of glass produced is measured (in millimetres), and the dot plot shows the results.

a

The current median is 11.15. If the outlier is removed what is the new median?

b

The current mean is 11.1. If the outlier is removed what is the new mean? Round your answer to two decimal places.

24

For each of the following sets of data:

i

Find the mean, median, mode, and range. Round your answers to two decimal places where necessary.

ii

Identify the outlier.

iii

Remove the outlier from the set and recalculate the values found in part (i).

iv

Describe how each of the four statistics changed after removing the outlier.

a
53, \, 46,\, 25,\, 50,\, 30,\, 30,\, 40,\, 30,\, 47,\, 109
b
4.7,\, 2.8,\, 1.9,\, 0.9,\, 0.9,\, 2.2,\, 2.2,\, 1.2,\, 1.5,\, 0.9
c
4700,\, 4700,\, 4700,\, 4500,\, 5300,\, 4900,\, 5200,\, 4800,\, 1500,\, 5100
25

True or False: When the outlier is removed from a set of data, the range will always decrease.

26

For each of the following scenarios, determine whether the outlier that was removed must have had a value smaller or larger than the values that remain:

a

A set of data has an outlier removed and the mean lowers.

b

A set of data has an outlier removed and the mean rises.

c

A set of data has an outlier removed and the median lowers.

d

A set of data has an outlier removed and the median rises.

27

When an outlier is removed from a data set, describe the effect on the following:

a
Mode
b
Range
c
Mean
d
Median
28

The selling price of recently sold houses are:

\$467\,000,\, \$413\,000,\, \$410\,000,\, \$456\,000,\, \$487\,000,\, \$929\,000

a

Find the mean selling price, to the nearest thousand dollars.

b

Which of the selling prices raises the mean so that it is not reflective of most of the prices?

c

Recalculate the mean selling price excluding this outlier.

29

Seven millionaires with an average net wealth of \$41 million with a standard deviation of \$7 million are having a party. Suddenly Carlos Slim, who has a net wealth estimated to be \$31 billion, walks into the room.

a

What is the new average net wealth (in millions) in the room? Give your answer rounded to the nearest million.

b

When Carlos Slim's net worth is taken into account, will the standard deviation be higher, lower or unchanged from before?

c

Will the mode be higher, lower or unchanged from before if at least two of the millionaires have the same net wealth?

d

Will the range be higher, lower or unchanged from before?

Suitability of measures of centre
30

The selling price of recently sold houses is given below:

\$760\,000,\, \$650\,000,\, \$810\,000,\, \$780\,000,\, \$760\,000,\, \$590\,000,\, \$1\,360\,000

a

Find the mean selling price. Round your answer to the nearest thousand dollars.

b

Find the median selling price.

c

Recalculate the mean selling price excluding the outlier.

d

Recalculate the median selling price excluding the outlier.

e

Which measure of centre best identifies the typical selling price of recently sold houses? Explain your answer.

31

The weight of fish caught in a "weigh and release" fishing competition, in kilograms are given below:

12.5,\, 15.1,\, 13,\, 14.2,\, 14.5,\, 14.9,\, 12.5,\, 14.3,\, 1.5

a

Find the mean weight.

b

Find the median weight.

c

Recalculate the mean weight excluding the outlier.

d

Recalculate the median weight excluding the outlier.

e

Which measure of centre best identifies the typical fish weight? Explain your answer.

32

The salaries of part-time employees at a company are given in the dot plot below. Which measure of centre best reflects the typical wage of a part-time employee? Explain your answer.

33

The age of students that participated in extra-curricular activities were recorded, and their results are presented in the dot plot below.

a

Find the mean age of participation among the sample of students.

b

Find the median age of participation among the sample of students.

c

By looking at the dot plot, are the mean and median reliable measures for the age of the typical student who participates in activities? Explain your answer.

34

A journalist wanted to report on road speed cameras being used as revenue raisers. She obtained data that showed the number of times 20 speed cameras issued a fine to motorists in one month. The results were:

101,\, 102,\, 115,\, 115,\, 121,\, 124,\, 127,\, 128,\, 130,\, 130,\\ 143,\, 143,\, 146,\, 162,\, 162,\, 163,\, 178,\, 183,\, 194,\, 977

The journalist wants to give the impression that speed cameras are just being used to raise revenue. Which measure of centre should she use in her article? Explain your answer.

35

The selling prices of artworks sold at an auction are given below:

\$18\,000,\, \$11\,000,\, \$17\,000,\, \$20\,000,\, \$18\,000,\, \$16\,000,\, \$15\,000,\, \$218\,000

Which measure of centre best identifies the typical selling price of recently sold artwork? Explain your answer.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

U1.AoS1.5

construct and interpret graphical displays of data, and describe the distributions of the variables involved and interpret in the context of the data

What is Mathspace

About Mathspace