Once we have organised the data, we need to present the data in a form that will be easy to read, understand and analyse.
Some common ways of displaying statistical data are listed below.
The best type of display to be used will depend on the type of data and purpose of the investigation.
Another type of statistical graph, the box and whisker plot is used to display statistical summary data, and will be described in a later section.
A pie chart (pie graph or sector graph) is a chart for displaying categorical data that uses a circle divided into slices(sectors) to show relative sizes of data. This chart is useful to display and compare parts of data that make up a whole, such as proportion of voters voting for particular political parties.
There are $360^\circ$360° in a circle. Using this fact we can:
$\theta=\text{Fraction of group in sector}\times360^\circ$θ=Fraction of group in sector×360°
$\text{Number in a sector}=\frac{\theta}{360}\times\text{Number in whole group}$Number in a sector=θ360×Number in whole group
The sector graph represents the number of people taking leave from work at a particular company.
If $5$5 people took leave in January, how many degrees represent $1$1 person?
How many people took leave in November?
How many people took leave between the beginning of November and the end of March?
What percentage of the people took leave in December?
Give your answer as a percentage, rounding to two decimal places.
These graphs represent the frequency of data values as the length of horizontal bars or vertical columns.
Column graphs (also known as bar graphs) are usually used to display categorical data or discrete numerical data.
Histograms are similar to column graphs, with vertical columns used to display continuous numerical data. The main difference between a column graph and histogram is that histograms do not have spaces between the columns.
The reason that histograms do not have gaps between columns is that the class intervals are not separate categories. Instead, the columns represent the frequency of values observed in the class intervals. The width of the columns indicates the range of values in the class intervals. Below are some examples and a brief description of their differences.
Column graphs | Histograms |
---|---|
|
|
In a survey some people were asked approximately how many minutes they take to decide between brands of a particular product.
Complete the table.
Minutes Taken | Tally | Frequency |
---|---|---|
1 | $\editable{}$ | |
2 | $\editable{}$ | |
3 | $\editable{}$ |
How many people took part in the survey?
How much time did most people take to choose between brands?
Complete the column graph using the results from the table above.
The histogram below shows the number of hours that students in a particular class had slept for the night before.
How many students are in the class?
How many students had at least $8$8 hours of sleep that night?
What percentage of students had less than $6$6 hours of sleep?
Dot plots are a graphical way of displaying the distribution of numerical or categorical data on a simple scale with dots representing the frequency of data values. They are best used for small to medium size sets of data and are good for visually highlighting how the data is spread and whether there are any gaps in the data or outliers. We will look at identifying outliers in more detail in our next lesson.
In a dot plot, each individual value is represented by a single dot, displayed above a horizontal line. When data values are identical, the dots are stacked vertically. The graph appears similar to a pictograph or column graph with the number of dots representing the total count.
Here is a dot plot of the number of goals scored in each of Bob’s soccer games.
How many times were five goals scored?
Which number of goals were scored equally and most often?
$1$1
$0$0
$4$4
$3$3
$2$2
$5$5
How many games were played in total?
Christa is a casual nurse. She used a dot plot to keep track of the number of shifts she did each week for a number of weeks.
Over how many weeks did Christa record her shifts?
For how many weeks did she work $5$5 shifts?
$\editable{}$ weeks
How many weeks did she work less than $6$6 shifts?
$\editable{}$ weeks
When Christa works at least $6$6 shifts a week, she buys a weekly train ticket. What proportion of the time did she buy a weekly train ticket?
A stem and leaf plot, or stem plot, is used for organising and displaying numerical data. It is appropriate for small to moderately sized data sets. The graph is similar to a column graph on its side. An advantage of a stem and leaf plot over a column graph is the individual scores are retained and further calculations can be made accurately.
In a stem and leaf plot, the right-most digit in each data value is split from the other digits, to become the 'leaf'. The remaining digits become the 'stem'.
The values in a stem and leaf plot should be arranged in ascending order (from lowest to highest) from the centre out. To emphasise this, it is often called an ordered stem and leaf plot.
The data values $10,13,16,21,26,27,28,35,35,36,41,41,45,46,49,50,53,56,58$10,13,16,21,26,27,28,35,35,36,41,41,45,46,49,50,53,56,58 are displayed in the stem and leaf plot below.
Which of the following is true of a stem-and-leaf plot?
Stem | Leaf | |
$0$0 | $7$7 | |
$1$1 | ||
$2$2 | ||
$3$3 | $1$1 $3$3 $3$3 $3$3 | |
$4$4 | $1$1 $2$2 $3$3 $4$4 $9$9 | |
$5$5 | $1$1 $2$2 $4$4 $5$5 $5$5 | |
$6$6 | $0$0 | |
|
The scores are ordered.
A stem-and-leaf plot does not give an idea of outliers and clusters.
It is only appropriate for data where scores have high frequencies.
The individual scores cannot be read on a stem-and-leaf plot.
The stem-and-leaf plot below shows the age of people to enter through the gates of a concert in the first $5$5 seconds.
Stem | Leaf | |
$1$1 | $1$1 $2$2 $4$4 $5$5 $6$6 $6$6 $7$7 $9$9 $9$9 | |
$2$2 | $2$2 $3$3 $5$5 $5$5 $7$7 | |
$3$3 | $1$1 $3$3 $8$8 $9$9 | |
$4$4 | ||
$5$5 | $8$8 | |
|
How many people passed through the gates in the first $5$5 seconds?
What was the age of the youngest person?
The youngest person was $\editable{}$ years old.
What was the age of the oldest person?
The oldest person was $\editable{}$ years old.
What proportion of the concert-goers were under $20$20 years old?