topic badge

7.05 Making predictions

Worksheet
Predictions from an equation
1

The lines of best fit for various bivariate data sets are given below. Using the line of best fit, predict the value of y that corresponds to the given x-value, correct to two decimal places:

a

y = 3.36x; x = 9

b

y = -1.52x+1.32; x = 5

c

y = 43.26x+8.74; x = 0.6

d

y = -0.88x-0.97; x = -19.12

e

y = 8.84 x; x = 7.68

f

y = - 8.71 x + 6.79; x = 3.49

g

y = 22.42 x + 2.93; x = 0.26

h

y = - 0.84 x - 0.19; x = -43.15

2

The lines of best fit for various bivariate data sets are given below. Using the line of best fit, predict the value of x that corresponds to the given y-value, correct to two decimal places:

a

y = 6x; y = 24

b

y = 6x-7; y = 5

c

y = -8.63x-5.84; y = 40

d

y = 0.72x+1.47; y = 1.8

e

y = 9.23 x - 4.18; y = 24.8945

f

y = - 7.76 x - 5.89; y = 713.7724

g

y = - 6.83 x; y = - 59.6259

h

y = 0.45 x + 7.62; y = 7.566

3

A bivariate data set has a line of best fit with equation t = 4.24 s.

Predict the value of t when s = 3.76.

4

A bivariate data set has a line of best fit with equation B = - 3.37 A + 9.87.

Predict the value of B when A = 8.26.

5

A bivariate data set has a line of best fit with equation u = - 9.12 v - 6.93.

Find the value of v that gives a prediction of u = 575.6556.

6

For each of the following data sets:

i

Use technology to find the equation of the line of best fit.

ii

Use the line of best fit to calculate the y-value for each of the given x-values.

a

x = 14

x13457912131619
y-10334557910
b

x = 69

x36515041444758593743
y8850539550642368583
Interpolation and extrapolation
7

For each of the data sets below, determine whether the prediction is an extrapolation or an interpolation:

a

A prediction of y when x = 5

x47811121317181920
y02476488118
b

A prediction of y when x = 33

x37545859435560386435
y7253262173471210210112
c

A prediction of y when x = 14

x13457912131619
y-10334557910
d

A prediction of y when x = 69

x36515041444758593743
y8850539550642368583
8

For each of the data sets below, determine whether the prediction is an extrapolation or an interpolation:

a

A prediction of y = 95.69 is made from the following data set using the line of best fit with equation y = - 0.07 x + 96.18.

x191017141125178
y9494.49796.494.497.894.995.99694.4
b

A prediction of y = 72.77 is made from the following data set using the line of best fit with equation y = 1.26 x - 57.01.

x93578697789668695492
y51.225.438.958.638.260.826.328.55.492
c

A prediction of y = 95.7 is made from the following data set using the line of best fit with equation y = -0.15 x +97.8.

x461151057293
y96.797.397.79496.998.597.395.397.597.5
d

A prediction of y = 63.37 is made from the following data set using the line of best fit with equation y = 0.79 x -20.37.

x461151057293
y96.797.397.79496.998.597.395.397.597.5
Reliability of predictions
9
For each of the following data sets:
i

Predict the value of y for the given x-value.

ii

Comment on whether the prediction is reliable, referring to both the strength of correlation and whether interpolation or extrapolation is used.

a

Correlation coefficient: r = 0.93, x = 15, line of best fit: y = 0.84 x + 2.66.

x107141613195209
y9101436101992113
b

Correlation coefficient: r = 0.46, x = 49, line of best fit: y = 0.64 x + 54.31.

x373352100658183185951
y47.636.4145.61377493.878.482.4101.2117.3
c

Correlation coefficient: r = - 1, x = 1, line of best fit: y = - 3.06 x + 93.51.

x489699514342828569
y-45-202-213-72-37-38-163-156-113
d

Correlation coefficient: r = 0.45, x = 143, line of best fit: y = 0.22 x + 0.7.

x23811144509151955382
y135.2112.8231.721.23512.9
e

Correlation coefficient: r = 0.91, x = 9, line of best fit: y = 0.8 x + 0.36.

x91461210517163
y6166338212131
f

Correlation coefficient: r = -0.43, x = 45, line of best fit: y = -0.63 x + 127.89.

x50851752655828638393
y4932146.6138.67298.989.476.496.4105.4
g

Correlation coefficient: r = -0.99, x = 14, line of best fit: y = -2.99 x + 88.56.

x875992977445717376
y-168-90-196-201-131-49-116-129-134
h

Correlation coefficient: r = 0.41, x = 139, line of best fit: y = 0.16 x + 0.74.

x5390456331166657556
y24.19114.11125.22.415.7
Applications
10

Research on the average number of cigarettes per day, x, smoked during pregnancy and the birth weight of the newborn baby, y, was conducted. Results are recorded in the adjacent table:

a

Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.

b

Describe the statistical relationship between these two variables in terms of strength, direction and shape.

c

Use technology to find the line of best fit for this data. Round all values to two decimal places.

d

Use your line of best fit to predict the birth weight of a newborn whose mother smoked 5 cigarettes per day.

e

Justify the reliability of the prediction.

xy
46.303.90
13.005.80
21.405.00
25.004.80
8.605.50
36.504.50
1.007.00
17.905.10
10.605.50
13.405.10
37.303.80
18.505.70
11

A sample of families were interviewed about their annual family income and their average monthly expenditure. The results are given in the table below:

a

Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.

b

Describe the statistical relationship between these two variables.

c

Use technology to find the line of best fit for this data. Round all values to three decimal places.

d

Use your line of best fit to predict the monthly expenditure for a family whose annual income is \$80\,000.

e

For which of the following annual incomes would the regression line produce the most reliable prediction?

A
\$99\,000
B
\$51\,000
C
\$80\,000
\text{Income }(x)\text{Expenditure }(y)
66\,0001100
75\,0001700
65\,0001400
73\,0001300
54\,000600
90\,0001800
87\,0001100
87\,0001500
94\,0001800
96\,0002200
12

The estimated iron ore grades (percentage of iron content) and the actual recovered iron ore grades are recorded to determine the accuracy of the models used by a mining company. The percentages are recorded in the table below:

\text{Estimated }(x)38\%32\%31\%33\%38\%40\%
\text{Actual }(y)48\%29\%42\%32\%40\%50\%
a

Calculate the correlation coefficient between the two variables, rounded to two decimal places.

b

Describe the statistical relationship between these two variables.

c

Use technology to find the line of best fit for this data. Round all values to two decimal places.

d

Use your line of best fit to predict the actual iron ore grade from a pit where the estimated grade is 35\%. Round your answer to two decimal places.

e

Justify the reliability of this prediction.

13

A team of salespeople submit their expenses and their sales for the month of March to their manager. Figures are recorded in the adjacent table:

a

Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.

b

Describe the statistical relationship between these two variables.

c

Use technology to find the line of best fit for this data. Round all values to two decimal places.

d

Use your line line of best fit to predict the expenses of a person in this department who made sales of \$6000 for the month.

e

Justify the reliability of this prediction.

\text{Sales (hundreds) }(x)\text{Expenses }(y)
55.743.8
4828.9
62.773.9
23.411.9
23.89
60.650
20.220.2
52.128.2
23.419.2
33.331.1
Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

MS2-12-2

analyses representations of data in order to make inferences, predictions and draw conclusions

MS2-12-7

solves problems requiring statistical processes, including the use of the normal distribution and the correlation of bivariate data

What is Mathspace

About Mathspace