A line of best fit or regression line both refer to a linear regression model that best represents the data on a scatter plot. Depending on the strength of the association, measured with the coefficient of determination (r^2), a regression function may pass exactly through all of the points, some of the points, or none of the points. However, it always represents the general trend of the data.
Lines of best fit are really handy as we can use them to help us make predictions or conclusions about the data.
The correlation coefficent, r, can be calculated with technology to describe the strength of the line of best fit. To approximate a line of best fit by eye, balance the number of points above the line with the number of points below the line. You should generally ignore outliers as they can skew the line of best fit.
Below are examples of what a good line of best fit might look like.
During an alcohol education program, 10 adults were offered up to 6 drinks and were then given a simulated driving test where they were scored out of a possible 100 points.
Number of drinks | 3 | 2 | 6 | 4 | 4 | 1 | 6 | 3 | 4 | 2 |
---|---|---|---|---|---|---|---|---|---|---|
Driving score | 64 | 59 | 42 | 57 | 58 | 72 | 33 | 63 | 55 | 62 |
Describe the association between number of drinks and driving score.
Use technology to calculate the correlation coefficient and line of best fit.
Interpret the meaning of the slope and y-intercept of the line of best fit in context of the data.