A line of good fit (or "trend" line) is a straight line that good represents the data on a scatter plot. This line may pass through some of the points, none of the points, or all of the points. However, it always represents the general trend of the points, which then determines whether there is a positive, negative or no linear relationship between the two variables.
Lines of good fit are really handy as they help determine whether there is a relationship between two variables, which can then be used to make predictions.
To draw a line of good fit, we want to minimise the vertical distances from the points to the line. This will roughly create a line that passes through the centre of the points.
The following scatter plot shows the data for two variables, x and y.
Draw a line of good fit for the data.
Use the line of good fit to estimate the value of y when x=4.5
Use the line of good fit to estimate the value of y when x=9
Lines of good fit are really handy as they help determine whether there is a relationship between two variables, which can then be used to make predictions.
Given a set of data relating two variables x and y, it may be possible to form a linear model. This model can then be used to understand the relationship between the variables and make predictions about other possible ordered pairs that fit this relationship.
How reliable are these predictions? Well, any model that fits the observed data will make reliable predictions from interpolations since the model roughly passes through the centre of the data points. We can say that the model follows the trend of the observed data.
However extrapolations are generally unreliable since we make assumptions about how the relationship continues outside of collected data. Sometimes extrapolation can be made more reliable if we have additional information about the relationship.
Several cars underwent a brake test and their age, x, was measured against their stopping distance, y. The scatter plot shows the results and a line of good fit that approximates the positive correlation:
According to the line, what is the stopping distance of a car that is 2 years old?
Using the two marked points on the line, determine the slope of the line of good fit.
Determine the value of the vertical intercept of the line.
Use the line of good fit to estimate the stopping distance of a car that is 6.5 years old.
The table shows the number of people who went to watch a movie x weeks after it was released.
\text{Weeks }(x) | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
---|---|---|---|---|---|---|---|
\text{Number of people }(y) | 37 | 37 | 33 | 33 | 29 | 29 | 25 |
Plot the points from the table.
If a line of good fit were drawn to approximate the relationship, which of the following could be its equation?
Graph the line of good fit whose equation is given by y=-2x+40.
Use the equation of the line of good fit to find the number of people who went to watch the movie 12 weeks after it was released.
A car company looked at the relationship between how much it had spent on advertising and the amount of sales each month over several months. The data has been plotted on the scatter graph and a line of good fit drawn. Two points on the line are \left(3200, 300\right) and \left(5600, 450\right).
Using the two given points, what is the slope of the line of good fit?
The line of good fit can be written in the form S = \dfrac{1}{16} A + b, where S is the money spent on sales in thousands of dollars, and A is the advertising costs.
Determine the value of b, the vertical intercept of the line.
Use the line of good fit to estimate the number of sales next month if \$4800 is to be spent on advertising.
Which of the following is true about the prediction in part (c)?
A prediction made within the observed data is called an interpolation.
A prediction made outside the observed data is called an extrapolation.
Generally, extrapolation is less reliable than interpolation since the model makes assumptions about the relationship outside the observed data set.