topic badge

6.02 Scatterplots and correlation

Worksheet
Correlation
1

For each of the following scatter plots, describe the data as linear or nonlinear:

a
b
c
d
e
f
2

State whether the following scatter plots show a linear relationship, a nonlinear relationship, or no relationship:

a
5
10
15
20
x
5
10
15
20
y
b
5
10
15
20
x
5
10
15
20
y
c
5
10
15
20
x
5
10
15
20
y
d
5
10
15
20
x
5
10
15
20
y
e
5
10
15
20
x
5
10
15
20
y
f
5
10
15
20
x
5
10
15
20
y
g
5
10
15
20
x
5
10
15
20
y
h
5
10
15
20
x
5
10
15
20
y
i
5
10
15
20
x
5
10
15
20
y
j
5
10
15
20
x
5
10
15
20
y
k
5
10
15
20
x
5
10
15
20
y
l
5
10
15
20
x
5
10
15
20
y
m
5
10
15
20
x
5
10
15
20
y
n
5
10
15
20
x
5
10
15
20
y
3

State whether the following scatter plots show a positive correlation, a negative correlation, or neither:

a
b
c
d
e
4

State whether the following scatter plots show a positive correlation, a negative correlation, or neither:

a
5
10
15
20
x
5
10
15
20
y
b
5
10
15
20
x
5
10
15
20
y
c
5
10
15
20
x
5
10
15
20
y
d
5
10
15
20
x
5
10
15
20
y
e
5
10
15
20
x
5
10
15
20
y
f
5
10
15
20
x
5
10
15
20
y
g
5
10
15
20
x
5
10
15
20
y
h
5
10
15
20
x
5
10
15
20
y
i
5
10
15
20
x
5
10
15
20
y
j
5
10
15
20
x
5
10
15
20
y
5

For each of the following scatter plots, describe the correlation in terms of strength and direction:

a
b
c
d
e
6

Describe the relationship between the variables observed in the following scatter plots in terms of strength and direction:

a
5
10
15
20
x
5
10
15
20
y
b
5
10
15
20
x
5
10
15
20
y
c
5
10
15
20
x
5
10
15
20
y
d
5
10
15
20
x
5
10
15
20
y
e
5
10
15
20
x
5
10
15
20
y
f
5
10
15
20
x
5
10
15
20
y
7

Describe the relationship observed between the variables in the following table of values as one of the following:

  • No linear relationship

  • Positive linear relationship

  • Negative linear relationship

a
x1223654554
y2356124752
b
x415101691758614
y20755281438524413169
c
x135891213161820
y48106103603313111- 63866
d
x46253595399617452486
y4.22.639.74.49.51.54.72.78.6
e
x11812141720913315
y- 12- 7- 11- 14- 16- 21- 8- 13- 2- 14
8

For each of the following pairs of variables:

i
Is there a relationship between the two variables. If yes, answer part (ii).
ii
Is the correlation between the two variables positive or negative?
a
Typing speed and dancing ability
b
Temperature and ice cream sales
c
Time spent studying and exam grade
d
Eye colour and IQ
e
Temperature and soup sales
f
Rainfall and traffic accidents
g
Height and weight
h
Age and hearing ability
Applications
9

Scientists conducted a study where each person was asked to read a paragraph and then recount as much information as they can remember. They found that the longer the paragraph, the less information each person could retain.

Is the correlation between the length of the paragraph and the information retained, positive or negative?

10

A study found a strong positive association between the temperature and the number of beach drownings.

a

Does this mean that the temperature causes people to drown? Explain your answer.

b

Is the strong correlation found a coincidence? Explain your answer.

11

The scatter plot shows the relationship between air and sea temperature:

a

Describe the relationship between air and sea temperature.

b

Describe the correlation between air and sea temperature.

12

A database compare the income per capita of a country and the child mortality rate of that country. A sample of the data is shown in the table.

If a scatter plot was created from the entire database, describe the relationship you would expect it to have in terms of strength, direction and shape.

Income per capitaChild Mortality rate
146567
11\,42816
262135
32\,4689
13

A study was conducted to find the relationship between the age at which a child first speaks and their level of intelligence as teenagers. The following table shows the ages, in months, of some teenagers when they first spoke, and their results in an aptitude test:

Age when first spoke14279916211710719
Aptitude test results9669849010187929910493
a

State the independent variable.

b

State the dependent variable.

c

Construct a scatter plot for the data.

d

Do the variables have a positive or negative linear correlation?

e

Is the correlation weak, moderate or strong?

14

In recent years, beekeepers and scientists have become concerned over a phenomenon known as colony collapse disorder (CCD), where the majority of worker bees in a hive disappear, leaving behind the queen and immature bees.

The percentage of beehive losses that can be attributed to CCD each year, since 2005, is shown in the table:

a

Construct a scatter plot for this data.

b

Describe the correlation between the number of years passed and the number of hives lost to CCD.

\text{Year}Y\text{Hives lost to CCD, }(H)\%
\text{2005} 0 18
\text{2006} 113
\text{2007} 223
\text{2008} 326
\text{2009} 428
\text{2010} 530
15

The market price of bananas varies throughout the year. Each month, a consumer group compared the average quantity of bananas supplied by each producer to the average market price (per unit).

a

Construct a scatter plot using the data from the table.

b

Describe the correlation between the supply quantity and the market price of bananas.

c

According to this data, when will a supplier of bananas receive a higher price per banana?

\text{Supply (kg)}\text{Price (dollars) }
550 15.25
600 14.75
650 14.75
700 14.75
750 14.25
800 14.00
850 13.75
900 13.25
950 13.50
1000 13.25
Correlation and causation
16

Determine whether the following are examples of variables with no correlation:

a

The age of a child and their shoe size.

b

The age of a child and their height.

c

The age of a child and the number of pets owned.

d

The age of a child and the amount of adjectives learned.

17

Determine whether the following describe a relationship that is correlated but not causal:

a

The sales of ice cream and increase in temperature.

b

The number of hours worked and how much money is made for a given person.

c

The amount of showers had in a day and the amount of the water bill.

d

The amount of rainfall received, and level of water in a lake.

e

The larger the dimensions of a rectangular verandah, the more area.

f

The season of the year and the number of water related injuries.

g

Increase in temperature, and the level of mercury in a thermometer.

h

The number of students shouting in class and the number of detentions received.

18

Determine whether the following describe a causal relationship and not just a correlation:

a

An individual's decision to work in construction and his diagnosis of skin cancer.

b

The number of minutes spent exercising and the amount of calories burned.

c

A decrease in temperature and the increase in attendance at an ice skating rink.

d

As a child's weight increases, so does her vocabulary.

19

Determine if following statements are true or false:

a

There is a causal relationship between the number of times a coin lands on heads and the likelihood that it lands on heads on the next flip.

b

There is a causal relationship between the amount of weight training a person does and their strength.

20

A study found a strong correlation between the approximate number of pirates out at sea and the average world temperature.

a

Does this mean that the number of pirates out at sea has an impact on world temperature?

b

Is the strong correlation found a coincidence? Explain your answer.

c

If there is correlation between two variables, is there causation?

Outliers
21

The following table shows the number of traffic accidents associated with a sample of drivers in different age groups:

Age20253035404550556065
No. Accidents41443934302522181917
a

State the independent variable.

b

State the dependent variable.

c

Construct a scatter plot for this data.

d

Do the variables have a positive or negative linear correlation?

e

Is there an outlier in this data set?

22

The marks of 12 students in Maths and Sport were recorded in the following table:

a

Construct a scatter plot for the students' marks in maths vs their marks in sports.

b

Describe the correlation between students' marks in maths and marks in sport.

c

Which student's scores appear to represent an outlier?

StudentMark in MathsMark in Sports
16344
29274
36052
47970
58867
68160
76173
89186
97284
104293
116657
129292
23

A researcher is studying the relationship between the number of passers-by near an emergency situation and the time taken until help is offered. The results are shown in the table below:

a

Construct a scatter plot to represent the data in the table.

b

Comment on the relationship between the number of passers-by and the time until assistance is offered by a passer-by to a person in an emergency.

c

Describe the correlation between the number of passers-by and the time until assistance is offered by a passer-by to a person in an emergency.

\text{Passers-by } (p)\text{Time until help}\\ \text{is offered }(t)
18
219
326
437
551
665
24

A study was conducted to compare running times in various outdoor temperatures. The table below lists the time taken to sprint 400 metres by runners in different temperatures:

\text{Temperature }(C)52108176439
\text{Sprint time }(s)60674869654957535952
a

Construct a scatter plot for this data.

b

How many runners were tested in the study?

c

Describe the correlation between temperature and sprint time for the data.

d

Which data point represents an outlier?

25

The following table shows the average IQ of a random group of people against their height:

\text{Height (cm)}140145150155160165170175180185
\text{IQ}1039598111858910814511093
a

Construct a scatter plot for this data.

b

Describe the relationship between IQ and height.

c

How tall is the person who appears to be an outlier?

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

3.4.12

describe the patterns and features of bivariate data

3.4.13

describe the association between two numerical variables in terms of direction (positive/negative), form (linear/non-linear) and strength(strong/moderate/weak)

3.4.16

interpret relationships in terms of the variables, for example, describe trend as increasing or decreasing

3.4.19

distinguish between causality and association through examples

What is Mathspace

About Mathspace