We described and compared sets of data in lesson 7.02 Measures of center using the mean and median, and then described and compared sets of data in lesson 7.03 Measures of spread using standard deviation and interquartile range. We will continue to use data displays to describe the center and spread of data and determine the impact that outliers have on the shape of data.
When we describe the shape of data sets, we focus on how the data points are distributed and whether the shape is symmetric or not.
In symmetric distributions the \text{mean}\approx \text{median}.
In distributions that are skewed left, the \text{mean} < \text{median}
In distributions that are skewed right, the \text{mean} > \text{median}.
In uniform distributions, the \text{mean} \approx \text{median}.
Keep in mind:
The number of minutes spent exercising per day for 10 days is recorded for two people who have just signed up for a new gym membership. Compare the exercise data for each person. What does the shape, center, and spread of the data tell us about each person's exercise habits?
Person A | Person B | |
---|---|---|
Mean | 57 | 54.5 |
Median | 57.5 | 70 |
Standard deviation | 6.34 | 17.55 |
Interquartile range | 11.25 | 25 |
Consider the list of ages of people in a field trip group:\{12, 12, 13, 13, 13, 13, 13, 14, 14, 24 \}
Interpret the data set using shape, center, and spread.
Remove the outlier from the data set and describe how the shape, center, and spread change.
Consider the following data distributions that show statistics for the WNBA and NBA:\text{Highest WNBA Salaries (in millions): } \{0.23, 0.23, 0.23, 0.23, 0.23, 0.23, 0.23, 0.22, 0.22, 0.20\}\\ \text{Highest NBA Salaries (in millions): } \{46, 44, 44, 44, 42, 42, 39, 39, 39, 39\}
Compare the shape, center, and spread for the highest salaries in each basketball association.
Interpret sets of data by considering the shape and skew: