We can now describe the direction and strength of trends in bivariate data, but there is a more precise way of measuring how strong a linear model fits the data. It is called the correlation coefficient (also called Pearson's correlation coefficient), and it is represented by the letter $r$r. It measures how close bivariate data is to a straight line, and also tells us whether the line has a positive or negative gradient.
Calculating the correlation coefficient by hand is very time-consuming. People usually use technology (either a calculator or a computer) to calculate the coefficient, and then do the more difficult task of interpreting their result.
The direction of a linear relationship is given by the sign of the correlation coefficient.
The strength of a linear relationship is given by the value of $r$r. The closer $r$r is to $1$1 o $-1$−1, the stronger the relationship is likely to be.
There are some special cases to consider.
Many websites have a correlation coefficient calculator, where you enter the ordered pairs in your data set and the calculator returns the coefficient. The correlation coefficient is often labelled $r$r or $R$R. Image 1 shows pairs of values of $x$x and $y$y entered separately but in corresponding order to calculate $R$R. | Of course you can also use a graphics calculator to enter the ordered pairs, and have it return the correlation coefficient value. Image 2 shows that in the Statistics mode, you can again enter the $x$x and corresponding $y$y-values in lists 1 and 2 and retrieve the correlation coefficient $R$R. |
The important work lies in interpreting these calculations, and what they mean for the strength of the relationship between the variables.
Calculate the correlation coefficient for the bivariate data in the table below to the nearest two decimal places.
$x$x | $7$7 | $12$12 | $6$6 | $19$19 | $8$8 | $16$16 | $20$20 | $9$9 | $11$11 | $19$19 |
---|---|---|---|---|---|---|---|---|---|---|
$y$y | $4$4 | $8$8 | $2$2 | $12$12 | $5$5 | $8$8 | $9$9 | $5$5 | $4$4 | $10$10 |
Describe the relationship between variables with a correlation coefficient of $-0.34$−0.34.
No linear relationship
Strong negative linear relationship
Weak positive linear relationship
Strong positive linear relationship
Weak negative linear relationship
The linear relationship between a set of data for variables $x$x and $y$y has a correlation coefficient of $0.3$0.3.
The linear relationship between a set of data for variables $x$x and $t$t has a correlation coefficient of $-0.9$−0.9.
Do the two linear relationships have the same direction?
Yes
No
Which relationship has a stronger correlation?
Between $x$x and $y$y.
Between $x$x and $t$t.