topic badge

5.05 Making predictions using time series data

Worksheet
Predictions
1

A least squares regression line is fitted to some seasonally adjusted data, given by:y=3.5t+22.8

Use the regression line to predict the deseasonalised value for time period 20.

2

A least squares regression line is fitted to some seasonally adjusted data for time periods 1 to 15, given by:y=1.3t+45.7

a

Use the regression line to predict the deseasonalised value for time period 16.

b

Use the regression line to predict the deseasonalised value for time period 60.

c

Which prediction is more reliable?

3

The monthly average cost of a hotel room in Sydney in 2000 is shown in the following table:

\text{Month, } t\text{Jan}\text{Feb}\text{March}\text{April}\text{May}\text{Jun}\text{Jul}\text{Aug}\text{Sep}
\text{Hotel price }, P(\$)250240235237239230228237332
a

Let January 2000 be t=1 and construct a time series graph of the data.

b

Which month appears to be an outlier?

c

Remove the outlier and find the equation of the least squares regression line for the remaining data. Round all values to four decimal places.

d

Predict the average cost of a hotel room in Sydney in November 2000.

4

Data following a 5 point cyclical pattern is collected and seasonally adjusted for time periods 1 to 14. A least squares regression line is fitted to the seasonally adjusted data, which appears linear, and is given by:

y = 2.4378 t + 66.2925

a

Calculate the predicted deseasonalised value for time period 15, to four decimal places.

b

If the seasonal index for this period was 77\%, calculate the true predicted value to four decimal places.

5

Data following a 3 point cyclical pattern is collected and seasonally adjusted for time periods 1 to 12. A least squares regression line is fitted to the seasonally adjusted data and is given by:

y = - 2.1404 t + 51.4172

a

Calculate the predicted deseasonalised value for time period 16, to four decimal places.

b

If the seasonal index for this period was 140\%, calculate the true predicted value to four decimal places.

c

Comment on the reliability of the predicted value from part (b).

d

What does the coefficient of t indicate in the equation of the least squares regression line?

6

The petrol price cycle at a local service station is monitored. The results over two weeks are given in the table below:

\text{Day}\text{Time }(t)\text{Price (cents)}\text{Deseasonalised data}
Week 1\text{Mon}199102.97
\text{Tue}285.298.19
\text{Wed}384105.13
\text{Thu}4104.7103.10
\text{Fri}5132.5X
\text{Sat}6113.9103.15
\text{Sun}7105.4103.44
Week 2\text{Mon}8114.5119.10
\text{Tue}9108.1124.58
\text{Wed}1093.2116.65
\text{Thu}11120.8118.96
\text{Fri}12140.6114.00
\text{Sat}13Y118.91
\text{Sun}14120.8118.56

Seasonal indices:

MonTuesWedThuFriSatSun
0.96140.86770.79901.01551.23331.10421.0189
a

Which is the best day of the cycle to purchase petrol?

b

Calculate the missing values X and Y to two decimal places.

c

Using your calculator, determine the equation of least squares regression line for the deseasonalised data, in terms of t. Round all values to two decimal places.

d

Predict the price of petrol for Thursday in the third week.

e

Comment on the reliability of your prediction.

7

A new pop up ice-cream shop records their sales over their first month. The data is tabulated below. The shop is only open from Friday to Sunday.

\text{Day}\text{Time }(t)\text{Sales (dollars)}\text{Deseasonalised data}
Week 1\text{Fri}120362101.14
\text{Sat}222572040.87
\text{Sun}319362092.75
Week 2\text{Fri}42224X
\text{Sat}525472303.10
\text{Sun}620602226.79
Week 3\text{Fri}723492424.15
\text{Sat}827062446.88
\text{Sun}9Y2431.09
Week 4\text{Fri}1024352512.90
\text{Sat}1128242553.58
\text{Sun}1223982592.15

Seasonal indices:

FriSatSun
0.96901.10590.9251
a

On which day will shop be most likely to need extra help?

b

Calculate the missing values X and Y to two decimal places.

c

Using your calculator, determine the equation of least squares regression line for the deseasonalised data in terms of t. Round all values to two decimal places.

d

Predict the sales for Friday of the sixth week.

e

Comment on the reliability of your prediction

8

The number of customers served at a shopping centre cafe are recorded quarterly over a period of four years and the results are entered into the table below:

\text{Month}\text{Time }(t)\text{Customer} \\ \text{numbers}\text{Cycle} \\ \text{mean}\text{Perentage} \\ \text{of cycle mean}\text{Deaseasonalised} \\ \text{number}
2016 \text{Jan}11687 130.396\%1284
\text{Apr}21218 94.145\%1256
\text{Jul}3886 1293.75 68.483\%1329
\text{Oct}41384 106.976\%1318
2017 \text{Jan}51789 130.823\%1361
\text{Apr}61327 97.038\%1369
\text{Jul}7905 1367.5 66.179\%1358
\text{Oct}81449 105.960\%1380
2018\text{Jan}92325 132.971\%1769
\text{Apr}101745 99.800\%1800
\text{Jul}111112 1748.5 63.597\%1669
\text{Oct}121821 103.632\%1726
2019 \text{Jan}132565 131.471\%1952
\text{Apr}141890 96.873\%1949
\text{Jul}151333 1951 68.324\%2000
\text{Oct}162016 103.332\%1920

Seasonal indices:

JanAprJulOct
131.42\%96.964\%66.646\%104.975\%
a

Use your calculator to determine the equation of the least squares regression line for the deseasonalised data, in terms of t. Round all values to three decimal places.

b

What does the coefficient of t indicate in the least squares regression line?

c

State the value of t for April 2020.

d

Use the regression line from the deseasonalised data and the seasonal index for April to predict the number of customers for April 2020. Round your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

f

The cafe owner used the following calculation to predict the number of customers for July 2021:

\begin{aligned} \text{Predicted Value} & = \left( 55.1588 \times 19 + 1121.15\right) \times \dfrac{66.464}{100} \\ & = 1141.71\ldots \\ & \approx 1142 \end{aligned}

What is wrong with this prediction?

9

The number of customers served at a shopping centre cafe are recorded quarterly over a period of four years and the results are entered into the table below:

\text{Month}\text{Time }(t)\text{Customer numbers}\text{4CMA}
2016 \text{Jan}11687
\text{Apr}21218
\text{Jul}38861306.50
\text{Oct}413841332.88
2017 \text{Jan}517891348.88
\text{Apr}613271359.38
\text{Jul}79051434.50
\text{Oct}814491553.75
2018\text{Jan}923251631.88
\text{Apr}1017451703.13
\text{Jul}1111121778.50
\text{Oct}1218121826.63
2019 \text{Jan}1325651872.38
\text{Apr}1418901925.50
\text{Jul}151333
\text{Oct}162016

Seasonal indices:

JanAprJulOct
131.42\%96.964\%66.646\%104.975\%
a

Use your calculator to determine the equation of the least squares regression line for the 4CMA data, in terms of t. Round all values to four decimal places.

b

What does the coefficient of t indicate in the least squares regression line?

c

State the value of t for January 2021.

d

Use the equation of the regression line from the 4CMA data and the seasonal index for January to predict the number of customers for January 2021. Round your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

f

The cafe owner used the following calculation to predict the number of customers for October 2020.

\begin{aligned} \text{Predicted Value} & = 62.8964 \times 20 + 1054.8731 \\ & = 3122.14 \ldots \\ & \approx 2313 \end{aligned}

What is wrong with the prediction?

10

The number of members attending a gym are recorded weekly over a period of four months and the results are entered into the table below. The owner decides to use a 4 point centred moving average to smooth the data and make predictions.

\text{Week}\text{Time }(t)\text{Attendance numbers}\text{4CMA}
Jan 11760
221123
338151073.25
4415601100.00
Feb 158301129.00
2612671156.25
379031173.13
4816901188.75
Mar198351210.13
21013871219.25
3119541227.63
41217121249.88
Apr1138801274.75
21415201293.88
3151020
4161799

Seasonal indices:

Week 1Week 2Week 3Week 4
69.49\%110.889\%77.455\%142.166\%
a

Use your calculator to determine the equation of the least squares regression line for the 4CMA data, in terms of t. Round all values to four decimal places.

b

What does the coefficient of t indicate in the least squares regression line?

c

State the value of t for Week 2 of May.

d

Use the equation of the regression line from the 4CMA data and the seasonal index for Week 2 to predict the number of customers for Week 2 May. Round your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

f

The gym owner used the following calculation to predict the attendance for Week 2 June.

\begin{aligned} \text{Predicted Value} & = \left( 18.7499 \times 22 + 1031.9506\right) \times 110.889 \\ & = 160\,173.44 \ldots \\ & \approx 160\,173 \end{aligned}

What is wrong with this prediction?

11

The number of hockey sticks sold are recorded tri-annually over a period of four years and the results are entered into the table below. The owner decides to use a 3 point centred moving average to smooth the data and make predictions.

a

Use your calculator to determine the equation of the least squares regression line for the 3MA data, in terms of t. Round all values to two decimal places.

b

What does the coefficient of t indicate in the least squares regression line?

c

State the value of t for May 2021.

d

Predict the whole number of hockey sticks sold in May 2021.

e

Comment on the reliability of your prediction.

f

The sports store owner used the following calculation to predict the sales for May 2022:

\begin{aligned} \left( 1.16 \times 14 + 31.06\right) \times \frac{100}{171.56} & = 27.57 \ldots \\ & \approx 28 \end{aligned}

What is wrong with the prediction?

\text{Month}\text{Time } \\\ (t)\text{Sales}\text{3MA}
2016 \text{Jan}112
\text{May}25431.33
\text{Sep}32534.00
2017 \text{Jan}41536.33
\text{May}56236.67
\text{Sep}63239.00
2018\text{Jan}71640.67
\text{May}86941.33
\text{Sep}93743.33
2019 \text{Jan}101844.67
\text{May}117538.67
\text{Sep}1241

Seasonal indices:

JanMaySep
40.12\%171.56\%88.32\%
12

The number of students cycling to the university library is recorded daily over a period of three weeks and the results are entered into the table to the right. A 7 point moving average is used to smooth the data in order to make predictions. The seasonal indices have been calculated in the table below using the average percentage method.

a

Use your calculator to determine the equation of the least squares regression line for the 7MA data, in terms of t. Round all values to four decimal places.

b

What does the coefficient of t indicate in the least squares regression line?

c

State the value of t for Saturday Week 4.

d

Predict the number of cyclists for Saturday Week 4. Round your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

\text{Week}\text{Day}\text{Time } \\ (t)\text{No.}\text{7MA}
1\text{Mon}166
\text{Tues}268
\text{Wed}371
\text{Thu}465103.57
\text{Fri}5255103.86
\text{Sat}6111103.71
\text{Sun}789103.86
2\text{Mon}868104.00
\text{Tues}967103.29
\text{Wed}1072102.71
\text{Thu}1166102.14
\text{Fri}12250101.86
\text{Sat}13107102.00
\text{Sun}1485102.00
3\text{Mon}1566101.43
\text{Tues}1668100.57
\text{Wed}177299.43
\text{Thu}186298.57
\text{Fri}19244
\text{Sat}2099
\text{Sun}2179

Seasonal indices:

MonTuesWedThuFriSatSun
0.660.670.710.632.461.040.83
13

The following data shows the sales of washing machines at a leading retailer over four quarters of three consecutive years:

\text{Month}\text{Time }(t)\text{Number of}\\\text{ washing machines sold}\text{Percentage of}\\\text{yearly mean}\text{4CMA}
Year 1 \text{March}145539.014\%
\text{June}2105490.375\%
\text{Sept}361352.562\%1167.63
\text{Dec}42543218.049\%1113.38
Year 2\text{March}546640.303\%1063.00
\text{June}660952.670\%X
\text{Sept}765556.649\%1168.63
\text{Dec}82895250.378\%1252.50
Year 3\text{March}956538.176\%1328.00
\text{June}10118179.797\%Y
\text{Sept}1168746.419\%
\text{Dec}123487235.608\%
a

Calculate the seasonal index, correct to three decimal places for the quarters ending in:

i

March

ii

June

iii

September

iv

December

b

The data is smoothed using a 4 point centred moving average as shown in the table above. Calculate the missing values X and Y.

c

Use your calculator to determine the equation of the least squares regression line for the 4CMA data, in terms of t. Round all values to four decimal places.

d

Predict the number of washing machines sold in the quarter ending September Year 5. Give your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

14

The following data shows the number of customers at a hand car wash business for the first 4 weeks of 3 consecutive months:

\text{Week}\text{Time }(t)\text{Number of}\\\text{ customers}\text{Percentage of}\\\text{ monthly mean}\text{Deseasonalised}\\\text{ data}
March \text{1}1800112.44\%A
\text{2}2743104.43\%726.92
\text{3}345363.67\%722.30
\text{4}4850119.47\%705.88
April\text{1}5780116.46\%680.31
\text{2}6676100.93\%B
\text{3}742363.16\%674.46
\text{4}8800119.45\%664.36
May\text{1}9743115.06\%648.04
\text{2}10654101.28\%639.84
\text{3}1139661.23\%C
\text{4}12790122.34\%656.05
a

Calculate the seasonal index, correct to five decimal places for each of the following weeks:

i

Week 1

ii

Week 2

iii

Week 3

iv

Week 4

b

The data is smoothed by deseasonalising the data as shown in the table above. Calculate the missing values A, B and C correct to two decimal places.

c

Use your calculator to determine the equation of the least squares regression line for the deseasonalised data, in terms of t. Round all values to four decimal places.

d

Predict the number of customers for Week 4 in June. Round your answer to the nearest whole number.

e

Comment on the reliability of your prediction.

15

The following data shows the sales of air conditioners at a leading retailer over four quarters from 2017 to 2019:

\text{Month}\text{Time }(t)\text{No. of air}\\ \text{conditioners sold}\text{Proportion}\\ \text{of yearly mean}\text{Deseasonalised}\\ \text{data}
2017 \text{March}13320.3054
\text{June}23200.2943
\text{Sept}39660.8885
\text{Dec}427312.5118
2018\text{March}59870.6340
\text{June}69260.5948
\text{Sept}711170.7175
\text{Dec}831972.0536
2019 \text{March}912160.6910
\text{June}109390.5336
\text{Sept}1114140.8035
\text{Dec}1234701.9719
a

Calculate the seasonal index, correct to three decimal places for the quarters ending in:

i

March

ii

June

iii

September

iv

December

b

Deseasonalise the data and complete the last column of the table. Round data to the nearest whole air conditioner sold.

c

Use your calculator to calculate the least squares regression line that fits the deseasonalised data, in terms of t. Round all values to one decimal place.

d

Predict the whole number of air conditioners sold in the quarter ending December 2020.

e

Comment on the reliability of your prediction.

16

The following data shows the sales of air conditioners at a leading retailer over four quarters of three consecutive years:

\text{Month}\text{Time }(t)\text{Number of}\\ \text{ air conditioners sold}\text{Proportion}\\ \text{ of yearly mean}\text{4CMA}
Year 1 \text{March}110420.8529
\text{June}24860.3978
\text{Sept}36130.50171236.5
\text{Dec}427462.24761266.625
Year 2\text{March}511600.81831347.75
\text{June}66090.4296A
\text{Sept}711390.80351496.875
\text{Dec}827621.94851647.75
Year 3\text{March}917950.96381713.625
\text{June}1011810.6341B
\text{Sept}1110940.5874
\text{Dec}1233801.8148
a

Calculate the seasonal index, correct to four decimal places for the quarters ending in:

i

March

ii

June

iii

September

iv

December

b

The data is smoothed using a 4 point centred moving average as shown in the table below. Calculate the missing values A and B.

c

Use your calculator to calculate the equation of the least squares regression line that fits the 4CMA data, in terms of t. Round all values to four decimal places.

d

Predict the whole number of air conditioners sold in the quarter ending December Year 4.

e

Comment on the reliability of your prediction.

17

The electricity bills of an energy conscious household are noted over two years. The data is represented in the table below:

\text{Month}\text{Time }(t)\text{Billed amount}\text{Proportion}\\ \text{of yearly mean}\text{Deseasonalised}\\ \text{data}
2018 \text{Feb}11220.7114
\text{Apr}22421.4111
\text{Jun}31100.6414
\text{Aug}41560.9096
\text{Oct}51590.9271
\text{Dec}62401.3994
2019\text{Feb}7660.5287
\text{Apr}81691.3538
\text{Jun}9840.6729
\text{Aug}101271.0174
\text{Oct}111240.9933
\text{Dec}121791.4339
a

Calculate the seasonal index for each of the following billing periods, correct to four decimal places:

i

Feb

ii

Apr

iii

Jun

iv

Aug

v

Oct

vi

Dec

b

Deseasonalise the data in the table and complete the last column. Round figures to two decimal places.

c

Use your calculator to calculate the least squares regression line that fits deseasonalised data in terms of t. Round all values to two decimal places.

d

Predict the electricity bill amount for Feb 2021.

e

Comment on the reliability of your prediction.

18

A nursery records the demand of herb seedlings by making note of the sales during periods of the year. The results are given below:

t\text{Time period}\text{Seedlings sold }\text{Deseasonalised data}
1\text{Apr 2017}16361959.05
2\text{Aug 2017}14721957.19
3\text{Dec 2017}29772107.16
4\text{Apr 2018}20272427.25
5\text{Aug 2018}17302300.23
6\text{Dec 2018}34632451.16
7\text{Apr 2019}23422804.45
8\text{Aug 2019}22192950.41
9\text{Dec 2019}36412577.15

Seasonal indices:

AprilAugustDecember
0.83510.75211.4128
a

Use your calculator to calculate the least squares regression line that fits the deseasonalised data in terms of t. Round all values to two decimal places.

b

Predict the whole number of seedlings sold in August 2020.

c

Comment on the reliability of your prediction.

d

Predict the whole number of seedlings sold in April 2021.

e

Comment on the reliability of this second prediction.

Sign up to access Worksheet
Get full access to our content with a Mathspace account

Outcomes

ACMGM092

fit a least-squares line to model long-term trends in time series data

What is Mathspace

About Mathspace