When given bivariate data as a table of values, a scatterplot can be created to graph the data, where the explanatory variable is shown on the horizontal axis and the response variable is shown on the vertical axis. In this way, each data point is displayed as a point in a two-dimensional coordinate system.
Graphs of scatterplots and linear graphs were previously covered in Chapter 9. Select the brand of calculator you use below to work through an example of using a calculator to create a scatterplot.
Casio Classpad
How to use the CASIO Classpad to generate a scatterplot for a set of data.
The average number of pages read to a child each day and the child’s growing vocabulary are measured. Consider the data set given below:
Pages read per day ($x$x) | $25$25 | $27$27 | $29$29 | $3$3 | $13$13 | $31$31 | $18$18 | $29$29 | $29$29 | $5$5 |
---|---|---|---|---|---|---|---|---|---|---|
Total vocabulary ($y$y) | $402$402 | $440$440 | $467$467 | $76$76 | $220$220 | $487$487 | $295$295 | $457$457 | $460$460 | $106$106 |
Use your calculator to generate a scatterplot of the data.
TI Nspire
How to use the TI Nspire to generate a scatterplot for a set of data.
The average number of pages read to a child each day and the child’s growing vocabulary are measured. Consider the data set given below:
Pages read per day ($x$x) | $25$25 | $27$27 | $29$29 | $3$3 | $13$13 | $31$31 | $18$18 | $29$29 | $29$29 | $5$5 |
---|---|---|---|---|---|---|---|---|---|---|
Total vocabulary ($y$y) | $402$402 | $440$440 | $467$467 | $76$76 | $220$220 | $487$487 | $295$295 | $457$457 | $460$460 | $106$106 |
Use your calculator to generate a scatterplot of the data.
Create a scatter plot for the set of data in the table.
$x$x | $1$1 | $3$3 | $5$5 | $7$7 | $9$9 |
---|---|---|---|---|---|
$y$y | $3$3 | $7$7 | $11$11 | $15$15 | $19$19 |
An association between two variables is known as a correlation. A correlation may (or may not) signify a relationship between two variables. To identify any correlation between the two variables, there are three things to focus on when analysing a scatterplot:
The direction of the scatterplot refers to the pattern shown by the data points. The direction of the pattern can be described as having positive correlation, negative correlation or no correlation:
The form of a scatterplot refers to the type of relationship the two variables may appear to share. For example, if the data points lie on or close to a straight line, the scatterplot has a linear form.
Forms other than a line may be apparent in a scatterplot. If the data points lie on or close to a curve, it may be appropriate to infer a non-linear form between the variables.
The strength of a linear correlation relates to how closely the points reassemble a straight line.
Most scatterplots will fall somewhere in between these two extremes, and will display a weak, moderate or strong correlation.
The correlation coefficient (also known as the $r$r value) measures the strength of a linear correlation. This calculation will be discussed in the next lesson in this chapter.
Identify the type of correlation in the following scatter plot.
Think: If we draw a straight line through the points, we will be able to look at the gradient of the line and how closely it fits the points. Here is a line that approximates the trend of the data:
Do: The line that we drew to approximate the data has a gradient of around $+1$+1, so this is a positive correlation. The line fits quite closely to all of the points, so it is a strong correlation. In summary, we would say that this scatterplot indicates a strong, positive correlation.
Describe the correlation between the two variables; eye colour and IQ.
Think: Does a person's eye colour have anything to do with their IQ?
Do: Eye colour and IQ is an example of a pair of variables that have no correlation.
The scatter plot shows the relationship between sea temperatures and the amount of healthy coral.
Describe the correlation between sea temperature the amount of healthy coral.
Select all descriptions that apply.
Negative
Strong
Positive
Weak
Which variable is the response variable?
Sea temperature
Level of healthy coral
Which variable is the explanatory variable?
Level of healthy coral
Sea temperature
The following table shows the number of traffic accidents associated with a sample of drivers of different age groups.
Age | Accidents |
---|---|
$20$20 | $41$41 |
$25$25 | $44$44 |
$30$30 | $39$39 |
$35$35 | $34$34 |
$40$40 | $30$30 |
$45$45 | $25$25 |
$50$50 | $22$22 |
$55$55 | $18$18 |
$60$60 | $19$19 |
$65$65 | $17$17 |
Which of the following scatter plots correctly represents the above data?
Is the correlation between a person's age and the number of accidents they are involved in positive or negative?
Positive
Negative
Is the correlation between a person's age and the number of accidents they are involved in strong or weak?
Strong
Weak
Which age group's data represent an outlier?
30-year-olds
None of them
65-year-olds
20-year-olds
Consider the table of values that show four excerpts from a database comparing the income per capita of a country and the child mortality rate of the country. If a scatter plot was created from the entire database, what relationship would you expect it to have?
Income per capita | Child Mortality rate |
---|---|
$1465$1465 | $67$67 |
$11428$11428 | $16$16 |
$2621$2621 | $35$35 |
$32468$32468 | $9$9 |
Strongly positive
No relationship
Strongly negative