topic badge

1.04 Correlation using r-values

Worksheet
Pearson's correlation coefficient
1

Describe the type of correlation the following correlation coefficients indicate:

a
r = 1
b
r = 0
c
r = -1
2

A pair of data sets have a correlation coefficient of \dfrac{1}{10} while a second pair of data sets have a correlation coefficient of \dfrac{3}{5}. Which pair of data sets have the stronger correlation?

3

Describe the relationship between the variables in the following studies:

a

A study found that the correlation coefficient between heights of women and probability of being turned down for a promotion was found to be - 0.90.

b

A study found that the correlation coefficient between population of a city and number of speeding fines recorded was found to be 0.83.

c

A study found that the correlation coefficient between length of hair and length of fingernails was found to be 0.07.

d

A study found that the correlation coefficient between number of bylaws a council has about dog breeding and number of dogs available for adoption at the local shelter was found to be 0.55.

4

For each of the following graphs, write down an appropriate value for the correlation coefficient:

a
1
2
3
4
5
6
7
8
9
x
5
10
15
20
25
30
35
y
b
1
2
3
4
5
6
7
8
9
x
2
4
6
8
10
12
14
16
18
20
22
24
26
28
30
32
34
36
38
y
c
1
2
3
4
5
6
7
8
9
10
x
4
8
12
16
y
d
1
2
3
4
5
6
7
8
9
x
2
4
6
8
10
12
14
16
18
y
5

The scatter diagram shows data of the height of a ball kicked into the air as a function of time:

a

Which type of model is appropriate for the data, Linear or Non-linear?

b

Write down a possible value of Pearson’s correlation coefficient, r, for this set of data.

1
2
3
4
5
6
7
8
9
t
5
10
15
20
25
30
35
40
45
50
55
\text{Height}
6

The scatter diagram shows data of a person's level of happiness as a function of their age:

a

Which type of model is appropriate for the data, linear or non-linear?

b

Write down a possible value of Pearson’s correlation coefficient, r, for this set of data.

1
2
3
4
5
6
7
8
9
\text{Age}
10
20
30
40
50
\text{Happiness}
7

The scatter diagram shows data of the height of an object after it is pushed off a rooftop as a function of time:

a

Which type of model is appropriate for the data, linear or quadratic?

b

Write down a possible value of Pearson’s correlation coefficient, r, for this set of data.

1
2
3
4
5
6
7
8
9
x
100
200
300
400
500
600
700
800
900
y
Correlation and causation
8

Explain whether or not the following statements are true or false.

a

If there is correlation between two variables, there must be causation.

b

If there is causation between two variables, there must be correlation.

9

Explain the difference between coincidence, causation, and a confounding factor.

10

For each of the following data examples, determine if there is a causal relationship between the variables:

a

The number of times a coin lands on heads and the likelihood that it lands on heads on the next flip.

b

The amount of weight training a person does and their strength.

11

Many trees lose their leaves in winter. Does this mean that cold temperatures cause the leaves to fall?

12

A geography teacher observes that many of the students who are involved in the music programme do better at tests. Does this mean that learning music makes students better at geography?

13

The table shows the number of fans sold at a store during days of various temperatures:

\text{Temperature } (\degree\text{C})68101214161820
\text{Number of fans sold}1213141718192123
a

Is there a causal relationship between the variables?

b

Without calculating, consider the correlation coefficient, r, for temperature and number of fans sold. Is the value of r positive or negative?

14

A study found a strong positive association between the temperature and the number of beach drownings.

a

Does this mean that the temperature causes people to drown? Explain your answer.

b

Is the strong correlation found a coincidence? Explain your answer.

15

A study found a strong correlation between the approximate number of pirates out at sea and the average world temperature.

a

Does this mean that the number of pirates out at sea has an impact on world temperature?

b

Is the strong correlation found a coincidence? Explain your answer.

c

If there is correlation between two variables, is there causation?

16

A survey was run to collect data on the number of hats being worn at school each day, compared to the temperature of that day. The r-value for the resulting scatterplot was 0.72, including any outliers.

a

State a conclusion that can be drawn from this data.

b

One of the surveyors suggests that both the temperature and number of hats being worn on a day are higher when it is sunnier. Explain what this suggestion implies.

17

For each of the following correlations, state whether you think it is coincidence, correlation, or a confounding factor. If you think there is a confounding factor, state it.

a

The marriage rate in New York correlates with the number of blunt object murders.

b

The battery life of a phone correlates with how many years the phone has been in use.

c

The number of jumpers being worn at school correlates with the number of heaters that are turned on that day.

d

The number of broken umbrellas on the street correlates with the number of shop signs that have fallen over.

e

The money spent on movie tickets in the US correlates with the average daily precipitation in Pennsylvania.

f

The global mean temperature correlates with the size of the polar icecaps.

g

The high tide of a lake correlates with the day of the week.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

3.1.6

describe an association between two numerical variables in terms of direction (positive/negative), form (linear/non-linear) and strength (strong/moderate/weak)

3.1.9

use a scatterplot to identify the nature of the relationship between variables

3.1.17

recognise that an observed association between two variables does not necessarily mean that there is a causal relationship between them

3.1.18

identify possible non-causal explanations for an association, including coincidence and confounding due to a common response to another variable, and communicate these explanations in a systematic and concise manner

What is Mathspace

About Mathspace