We have learned how to identify  independent and dependent variables . We will now construct and interpret statistical data using scatter plots to determine whether a relationship exists between two variables.
The first step in determining the presence and type of relationship is to plot the data on a scatter plot. A scatter plot is a statistical display that is often used to determine whether the expected relationship exists between two quantitative variables.
Once we have a scatter plot, we can start to perform analysis such as determining association.
One way to analyze scatterplots is to describe the shape that the data takes. Sometimes the data clusters around some kind of curve, so the relationship is:
An association is a way of expressing a relationship between two variables and, more specifically, how strongly pairs of data are related. We describe the association from data using language like positive association, negative association, or no association. We can even further strengthen the language by using the words strong or weak to describe the association.
Linear patterns reveal whether or not two measurements are connected to each other. In other words, the presence of a linear pattern signals that the two sets of have linear association. One way of understanding these relationships is by plotting ordered pairs onto a scatter plot. This makes it easier to recognize patterns in the data, especially whether or not these patterns appear to be linear.
This linear relationship can be seen through close and consistent grouping in a scatter plot. The more closely the dots resemble a straight line, the stronger the association between the variables.
A positive association is when the data appears to gather in a positive relationship, similar to a straight line with a positive slope. In other words, as one variable increases, the other variables also increases or as one variable decreases the other decreases as well. So basically, the variables change in the same direction.
There are three types of positive association:
A negative association is when the data appears to gather in a negative relationship. Similar to a straight line with a negative slope. In other words, as one variable increases, the other one decreases.
Like positive association, there are three types of negative association:
If the points of the scatter plot are spread randomly then we can say there is no correlation.
Identify the type of association in the following scatter plot.
The following table shows the number of traffic accidents associated with a sample of drivers of different age groups.
Age | Accidents |
---|---|
20 | 41 |
25 | 44 |
30 | 39 |
35 | 34 |
40 | 30 |
45 | 25 |
50 | 22 |
55 | 18 |
60 | 19 |
65 | 17 |
Construct a scatter plot to represent the above data.
Is the association between a person's age and the number of accidents they are involved in positive or negative?
Is the association between a person's age and the number of accidents they are involved in strong or weak?
Which age group's data represent an outlier?
A positive association is when the data appears to gather in a positive direction, similar to a straight line with a positive slope. The variables change in the same direction.
A negative association is when the data appears to gather in a negative direction. Similar to a straight line with a negative slope. In other words, as one variable increases, the other one decreases.
When there is no relationship between the variables we say they have no association.