topic badge
AustraliaVIC
VCE 11 General 2023

1.05 Data distribution

Worksheet
Shape, clusters and outliers
1

State whether the scores in each graph can be described as positively skewed, negatively skewed or symmetrical:

a
b
c
d
Leaf
16\ 7\ 7
22\ 2\ 2\ 2\ 3\ 3\ 3
33\ 3\ 3\ 6\ 6\ 6\ 7\ 7\ 7\ 7\ 7
44\ 4\ 4\ 4\ 4\ 4
57\ 7
e
f
g
2

The table shows the number of crime novels in a bookshop for different price ranges.

\text{Price of crime novel, to the nearest } \$55101520253035
\text{Frequency}391961993
a

Plot this data as a bar graph.

b

Describe the shape of the data in the graph.

3

For each of the following graphs:

i

Are there any outliers? If so, state them.

ii

Are there any clusters? If so where?

iii

What is the mode?

v

Describe the distribution of the data.

a
Leaf
05
17\ 8
20\ 8
31\ 3\ 3\ 7\ 8\ 9
41\ 3\ 5\ 8\ 8\ 8
5
6
7
8
92

Key: 1 | 2 = 12

b
Leaf
02
1
20\ 3\ 6\ 6
31\ 4\ 5\ 6\ 6\ 7
40\ 4\ 6\ 7\ 9
50

Key: 5 | 2 = 52\text{ hours}

c
d
4

For each of the graphs below:

i

Describe the shape of the distribution.

ii

Determine the lower quartile score and the upper quartile Score.

iii

Hence, calculate the interquartile range.

iv

Using the interquartile range, are there any outliers in the data set? If so, what are they?

a
b
5

How many peaks are there on the following dot plot?

6

Describe the distribution of the following graphs:

a
b
Central tendency
7

The given stem and leaf plot shows the age of people to enter through the gates of a concert in the first 5 seconds:

a

What was the median age?

b

What was the difference between the lowest age and the median?

c

What is the difference between the highest age and the median?

d

What was the mean age? Give your answer to two decimal places.

e

Is the data positively or negatively skewed?

Leaf
10\ 1\ 2\ 2\ 3\ 3\ 4\ 4\ 4\ 8\ 8\ 8
21\ 7
34\ 5\ 5
40
54

Key: 1 | 2 = 12 \text{ years old}

8

Consider the given histogram, representing students' heights in centimetres:

Does the histogram most likely represent grouped data or individual scores? Explain your answer.

10

The stem and leaf plot shows the batting scores of two cricket teams, England and India.

a

What is the highest score from England?

b

What is the highest score in India?

c

Find the mean score of England.

d

Find the mean score of India.

e

Calculate the combined mean of the two teams.

EnglandIndia
1\ 031\ 2\ 4\ 7
6\ 6\ 5\ 5\ 5\ 540\ 2\ 9
7\ 352\ 5
64

Key: 1 | 2 = 12

11

Consider the frequency distribution table :

\text{Score, }x\text{Frequency, }ffx
411
535
16
14
\text{Total:}43365
a

Complete the table.

b

Find the mean of the scores, correct to two decimal places.

c

Find the mode of the scores.

d

Find the range of the scores.

e

How many scores are less than the mode?

12

Consider the dot plot given. From which score can a dot be removed so that the mean, median and mode remain unchanged?

13

Which of the following dot plots have the highest median?

A
B
14

For the following scenarios, determine the value of the x:

a

A rating system of 1 - 4 was used in a survey to determine the usefulness of a new feature. The 14 scores shown below are known to be bi-modal with values 2 \text{ and }4.

2,\quad 4,\quad 2,\quad 4,\quad 3,\quad 2,\quad 3,\quad 4,\quad 4,\quad 1,\quad 1,\quad 2,\quad 3,\quad x
b

A rating system of 1 - 3 was used in a survey to determine the usefulness of a new feature. The 10 scores shown below are known to have a mode of 1.

3,\quad 2,\quad 3,\quad 2,\quad 1,\quad 3,\quad 1,\quad 1,\quad 2,\quad x
15

The six numbers 6, 2, 7, 18, 17 and an unknown number x have a median of 8.5. Find the value of x.

16

Five numbers have a range of 16, a mode of 2, a median of 7 and a mean of 8. The minimum number in the set is 2. Calculate:

a

The minimum

b

The median

c

The maximum

17

Three numbers have a mode of 10 and a mean of 10. Write the three numbers of the data set.

18

Four numbers have a range of 5, a median of 9 and a mode of 11. Write the four numbers of this data set.

Choosing a measure of centre
19

Find the most appropriate measure of centre for the following data set:

a

15,\quad 13,\quad 16,\quad 17,\quad 15,\quad 15,\quad 15

b

8,\quad 10,\quad 14,\quad 18,\quad 19,\quad 91

c
20

Susanah has been growing watermelons. The weight of the watermelons (in kilograms) are shown below:15,\quad 6,\quad 5,\quad 2,\quad 4,\quad 4,\quad 5

a

Calculate the median weight of Oprah's watermelons.

b

Calculate the mean weight of Oprah's watermelons. Round your answer to two decimal places if necessary.

c

Find the most appropriate measure of centre of this data set.

21

Consider the histogram below:

Determine the measure of centre that would be most appropriate to use to represent the data in this graph. Explain your answer.

22

A timed quiz consists of 6 puzzles. The data below shows the times (in seconds) that it took Sophia to complete each question on her first and second attempts of the quiz:

  • Times in first attempt: \quad 11,\quad 15,\quad 17,\quad 20,\quad 23,\quad 34

  • Times in second attempt: \quad 20,\quad 20,\quad 20,\quad 19,\quad 21,\quad 20

a

Calculate the mean time spent on each puzzle in the first attempt.

b

Calculate the mean time spent on each puzzle in the second attempt.

c

For which of Sophia's attempts would the mean number of minutes spent per question be a better indicator of her performance than the median number of minutes spent per question? Explain your answer.

23

The price of petrol at a petrol station was recorded each day for two weeks. The results are presented in the table below:

MondayTuesdayWednesdayThursdayFridaySaturdaySunday
Week 1\$1.70\$1.50\$1.62\$1.46\$1.49\$1.46\$1.55
Week 2\$1.25\$1.36\$1.25\$1.21\$1.21\$1.20\$3.30

The mean petrol price across the 14 days of records is \$1.54 per litre. For which week is this mean a better indication of the price of petrol? Explain your answer.

24

Every week over 45 weeks, a kayaking club ran social sessions and the number of people who attend who attended each session was recorded in the table:

Number of people attending121314151617181920
Number of weeks656565656

Explain why the mean and the median are equally accurate indicators of the typical number of people who attended each session.

25

The histograms below represents the luggage weight of each passenger on board an airline's morning and afternoon flight:

The airline wants to get an indication of how much luggage each passenger is checking in. For which flight's data would the median be a more precise indicator of a typical passenger's luggage weight than the mean? Explain your answer.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

U1.AoS1.2

the concept of a data distribution and its display using a statistical plot

U1.AoS1.3

the five-number summary and possible outliers

What is Mathspace

About Mathspace