The coefficient of determination, r^2, is related to Pearson's correlation coefficient r, and is useful when interpreting a linear relationship between two variables. The coefficient of determination is between 0 and 1 and is most useful when converted to a percentage. It is used to describe the level of accuracy with which one variable can be used to predict another.
If we have already identified the value of r, this can simply be squared to get the value of r^2. For instance, if r=0.8 then r^2=0.64, and if r=-0.9 then r^2=0.81.
If we don't already have the value of r, technology can be used to calculate both.
For example, the CAS image below shows the r^2 value is given on the same screen as the r value.
r^2 tells us the proportion of the response variable (y) that can be explained by the variation in the explanatory variable (x).
For example, if r^2=0.92 then we can say that 92\% of the variation in the response variable is explained by the variation in the explanatory variable.
The closer the value of r^2 is to 1, the more that the variation in the response variable is explained by the variation in the explanatory variable. Note that this does not mean that the closer r^2 is to 1, the more the x variable is causing the y variable to happen. Be very careful with the language you use - correlation does not imply causation.
A scientist investigated the link between the number of cancer cells killed by a certain drug and the strength of the drug used. The results were recorded and the coefficient of determination r^2 was found to be 0.92.
Which of the following is true?
A linear association between two data sets is such that the correlation coefficient is -0.72.
What proportion of the variation can be explained by the linear relationship? Give your answer to the nearest percent.
The heights (in \text{cm}) and the weights (in \text{kg}) of 8 primary school children is shown on the scattergraph below.
Calculate the value of the coefficient of determination. Give your answer to two decimal places.
Calculate the value of the correlation coefficient. Give your answer to two decimal places.
What percentage of the variation in weight is accounted for by the height of the child? Give your answer to the nearest whole percent.
Consider these two comments on the claim βThe weight of a child is primarily influenced by their height.β
Which do you think is most correct?
r^2 tells us the proportion of the response variable (y) that can be explained by the variation in the explanatory variable (x).