Predict the value of y, using the given value of x and the line of best fit:
y = - 8.71 x + 6.79; x = 3.49
y = 8.84 x; x = 7.68
y = - 0.84 x - 0.19; x = - 43.15
y = 22.42 x + 2.93; x = 0.26
A bivariate data set has a line of best fit with equation t = 4.24 s.
Predict the value of t when s = 3.76.
A bivariate data set has a line of best fit with equation B = - 3.37 A + 9.87.
Predict the value of B when A = 8.26.
Find the value of x, using the given value of y and the line of best fit:
y = 9.23 x - 4.18; y = 24.8945
y = - 7.76 x - 5.89; y = 713.7724
y = - 6.83 x; y = - 59.6259
y = 0.45 x + 7.62; y = 7.566
A bivariate data set has a line of best fit with equation u = - 9.12 v - 6.93.
Find the value of v that gives a prediction of u = 575.6556.
Decide whether the prediction is an extrapolation or an interpolation for each of the data sets below:
A prediction for the y-value when x = 5 is made from the data set below:
x | 4 | 7 | 8 | 11 | 12 | 13 | 17 | 18 | 19 | 20 |
---|---|---|---|---|---|---|---|---|---|---|
y | 0 | 2 | 4 | 7 | 6 | 4 | 8 | 8 | 11 | 8 |
A prediction for the y-value when x = 33 is made from the data set below:
x | 37 | 54 | 58 | 59 | 43 | 55 | 60 | 38 | 64 | 35 |
---|---|---|---|---|---|---|---|---|---|---|
y | 72 | 53 | 26 | 21 | 73 | 47 | 12 | 102 | 10 | 112 |
A prediction of y = 95.69 is made from the following data set using the line of best fit with equation y = - 0.07 x + 96.18:
x | 19 | 10 | 1 | 7 | 14 | 11 | 2 | 5 | 17 | 8 |
---|---|---|---|---|---|---|---|---|---|---|
y | 94 | 94.4 | 97 | 96.4 | 94.4 | 97.8 | 94.9 | 95.9 | 96 | 94.4 |
A prediction of y = 72.77 is made from the following data set using the line of best fit with equation y = 1.26 x - 57.01:
x | 93 | 57 | 86 | 97 | 78 | 96 | 68 | 69 | 54 | 92 |
---|---|---|---|---|---|---|---|---|---|---|
y | 51.2 | 25.4 | 38.9 | 58.6 | 38.2 | 60.8 | 26.3 | 28.5 | 5.4 | 92 |
Predict the value of y for the given x value.
Comment on whether the prediction is reliable, referring to both the strength of correlation and whether interpolation or extrapolation is used.
Correlation coefficient: r = 0.93, x = 15, line of best fit: y = 0.84 x + 2.66.
x | 10 | 7 | 14 | 1 | 6 | 13 | 19 | 5 | 20 | 9 |
---|---|---|---|---|---|---|---|---|---|---|
y | 9 | 10 | 14 | 3 | 6 | 10 | 19 | 9 | 21 | 13 |
Correlation coefficient: r = 0.46, x = 49, line of best fit: y = 0.64 x + 54.31.
x | 37 | 33 | 52 | 100 | 65 | 81 | 83 | 18 | 59 | 51 |
---|---|---|---|---|---|---|---|---|---|---|
y | 47.6 | 36.4 | 145.6 | 137 | 74 | 93.8 | 78.4 | 82.4 | 101.2 | 117.3 |
Correlation coefficient: r = - 1, x = 1, line of best fit: y = - 3.06 x + 93.51.
x | 48 | 96 | 99 | 51 | 43 | 42 | 82 | 85 | 69 | 54 |
---|---|---|---|---|---|---|---|---|---|---|
y | -45 | -202 | -213 | -72 | -37 | -38 | -163 | -156 | -113 | -73 |
Correlation coefficient: r = 0.45, x = 143, line of best fit: y = 0.22 x + 0.7.
x | 23 | 81 | 11 | 44 | 50 | 91 | 51 | 95 | 53 | 82 |
---|---|---|---|---|---|---|---|---|---|---|
y | 1 | 35.2 | 1 | 12.8 | 23 | 1.7 | 21.2 | 35 | 1 | 2.9 |
Research on the average number of cigarettes per day, x, smoked during pregnancy and the birth weight of the newborn baby, y, was conducted. Results are recorded in the adjacent table:
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables in terms of strength, direction and shape.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the birth weight of a newborn whose mother smoked on average 5 cigarettes per day. Round your answer to two decimal places.
Justify the reliability of the prediction.
x | y |
---|---|
46.30 | 3.90 |
13.00 | 5.80 |
21.40 | 5.00 |
25.00 | 4.80 |
8.60 | 5.50 |
36.50 | 4.50 |
1.00 | 7.00 |
17.90 | 5.10 |
10.60 | 5.50 |
13.40 | 5.10 |
37.30 | 3.80 |
18.50 | 5.70 |
A sample of families were interviewed about their annual family income and their average monthly expenditure. The results are given in the table below:
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to three decimal places.
Use your regression line to predict the monthly expenditure for a family whose annual income is \$80\,000.
For which of the following annual incomes would the regression line produce the most reliable prediction?
\text{Income }(x) | \text{Expenditure }(y) |
---|---|
66\,000 | 1100 |
75\,000 | 1700 |
65\,000 | 1400 |
73\,000 | 1300 |
54\,000 | 600 |
90\,000 | 1800 |
87\,000 | 1100 |
87\,000 | 1500 |
94\,000 | 1800 |
96\,000 | 2200 |
The estimated iron ore grades (percentage of iron content) and the actual recovered iron ore grades are recorded to determine the accuracy of the models used by a mining company. The percentages are recorded in the table below:
\text{Estimated }(x) | 38\% | 32\% | 31\% | 33\% | 38\% | 40\% |
---|---|---|---|---|---|---|
\text{Actual }(y) | 48\% | 29\% | 42\% | 32\% | 40\% | 50\% |
Calculate the correlation coefficient between the two variables, rounded to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the actual iron ore grade from a pit where the estimated grade is 35\%. Round your answer to two decimal places.
For which of the following estimated grades would the regression line produce the least reliable prediction of the actual grade?
A team of salespeople submit their expenses and their sales for the month of March to their manager. Figures are recorded in the adjacent table:
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the expenses of a person in this department who made sales of \$6000 for the month.
Justify the validity of this prediction.
\text{Sales (hundreds) }(x) | \text{Expenses }(y) |
---|---|
55.7 | 43.8 |
48 | 28.9 |
62.7 | 73.9 |
23.4 | 11.9 |
23.8 | 9 |
60.6 | 50 |
20.2 | 20.2 |
52.1 | 28.2 |
23.4 | 19.2 |
33.3 | 31.1 |
The forecast maximum temperature, in degrees Celsius, and the observed maximum temperature are recorded to determine the accuracy in the temperature prediction models used by the weather bureau. Results are displayed in the adjacent table:
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the observed maximum temperature on a day in the same month when the forecast was 25 \degree \text{C}. Round your answer to two decimal places.
For which of the following forecast temperatures would the regression line produce the most reliable prediction?
\text{Forecast }(x) | \text{Observed }(y) |
---|---|
30.10 | 31.20 |
27.20 | 27.80 |
26.90 | 24.50 |
29.00 | 27.60 |
30.20 | 31.60 |
34.20 | 31.50 |
33.20 | 33.70 |
36.90 | 34.30 |
27.10 | 25.00 |
30.10 | 31.90 |
During an alcohol education programme, 10 adults were offered up to 6 drinks and were then given a simulated driving test where they scored a result out of a possible 100. The results are displayed in the table below:
\text{Number of drinks }(x) | 3 | 2 | 6 | 4 | 4 | 1 | 6 | 3 | 4 | 2 |
---|---|---|---|---|---|---|---|---|---|---|
\text{Driving score }(y) | 66 | 61 | 43 | 58 | 56 | 73 | 31 | 64 | 55 | 62 |
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the driving score of a young adult who consumed 5 drinks. Round your answer to one decimal place.
Describe the reliability of the prediction in part (d).
Research has been conducted to deduce whether there is any relationship between the number of pirated downloads as a percentage of total downloads, x, and the incidences of malware infection in computers, y.
x | 40.5 | 61.1 | 77.4 | 34.6 | 65.3 | 63.8 | 56 | 66.5 | 34.3 | 20.1 | 68.7 | 50.8 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
y | 3.7 | 7.1 | 16.7 | 10.6 | 5.2 | 15.8 | 12.7 | 12.8 | 1.5 | 0.9 | 8.5 | 12.9 |
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the number of computers in 1000 that you would expect to be affected by malware if in that region 80\% of downloads were pirated.
Describe the reliability of this prediction.
The results, as percentages, for a practice spelling test and the real spelling test were collected for 8 students. The results are displayed in the table below:
\text{Practice }(x) | 56.30 | 79.00 | 59.40 | 77.00 | 71.60 | 64.40 | 61.60 | 68.20 |
---|---|---|---|---|---|---|---|---|
\text{Real }(y) | 48.90 | 69.30 | 51.90 | 66.00 | 62.50 | 56.10 | 54.50 | 59.20 |
Calculate the correlation coefficient between the two variables. Round your answer to two decimal places.
Describe the statistical relationship between these two variables.
Use your graphing calculator to form an equation for the least squares regression line of y \text{ on } x. Round all values to two decimal places.
Use your regression line to predict the real Spelling Test result of a student who scored 60\% in their Practice Spelling Test. Round the answer to two decimal places.
Describe the reliability of this prediction.
The data in the table and scatter plot below show the frequencies per month of online marketing emails sent out to subscribers compared with the proportion of subscribers who click and open the email:
\text{Frequency }(x) | \text{Proportion }(y) |
---|---|
3 | 0.41 |
4 | 0.59 |
1 | 1.28 |
5 | 0.47 |
4 | 0.26 |
7 | 0.62 |
10 | 0.75 |
Using a graphics calculator (or other technology), calculate the correlation coefficient between these two variables. Round your answer to two decimal places.
Which piece of data appears to be an outlier?
Remove the outlier and recalculate the correlation coefficient. Round your answer to two decimal places.
Describe the statistical relationship between these two variables (with the outlier removed).
Use your graphing calculator to form an equation for the least squares regression line of y on x (with the outlier removed). Round all values in the equation to two decimal places.
Use your regression line to predict the proportion of emails opened if they are sent 20 times a month. Round your answer to two decimal places.
Describe the reliability of the prediction in part (f).