Numerical data, such as times, heights, weights or temperatures, are based on measurements making any data value possible within a large range of values. Instead of having a visual for every single data point, we can represent the frequency of each value or the number of times that the value occurs in the data set. When displaying frequency information for this type of data, a special chart called a histogram is used.
The histogram represents the distribution of the data. It allows us to see clearly where all of the recorded times fall along a numerical scale. The x \text{-axis} represents the measurements in the data set, and the y \text{-axis} represents the frequency, or number of times that the measure occurs in the data set.
Let's look at some examples of histograms and practice interpreting them.
A government agency records how long people wait on hold to speak to their representatives. The results are displayed in the histogram below:
Complete the corresponding frequency table:
Length of hold (minutes) | Frequency |
---|---|
1 | |
2 | |
3 | |
4 | |
5 |
How many phone calls were made?
How long in total did these people wait on the hold?
What was the mean wait time? Give your answer as a decimal.
The amount of snowfall (in centimeters) is recorded at the base of the mountain each day.
To create a frequency histogram of the data, which values go on the horizontal axis?
The snowfall recorded each day, to the nearest centimeter, is as follows: 6,\,2,\,0,\,3,\,2,\,2,\,3,\,4,\,2,\,0,\,3,\,2,\,3,\,4,\,6,\,4,\,3,\,0,\,5,\,3
Construct a frequency histogram of the data.
On how many days did 3 centimeters of snow fall?
On how many days did at least 4 centimeters of snow fall?
In the next example, the data needs to be grouped into intervals (also called bins) in order to construct the frequency table and the histogram to represent the times taken for 72 runners to complete a ten kilometer race.
Interval (minutes) | Frequency |
---|---|
45\ - \lt 50 | 9 |
50\ - \lt 55 | 7 |
55\ - \lt 60 | 20 |
60\ - \lt 65 | 30 |
65\ - \lt 70 | 6 |
Every data value must go into exactly one and only one interval or bin.
There are some general guidelines to use when attempting to create intervals:
The key features of a histogram are:
The horizontal axis is a numerical scale (like a number line)
The data on the horizontal axis may be grouped into intervals
There are no gaps between the columns of a histogram
The height of each column will be the frequency
Histograms are not the same as bar graphs. The two major differences between them are:
In a bar graph, the bars do not touch.
Bar graphs are normally used to represent categorical data (ie. eye color, hair color, gender, etc.) along the horizontal axis, rather than numerical data.
Consider the following set of values:
44,62,56,53,31,78,59,46,32,41,65
Which set of five intervals should we use to analyze this data?
Create a frequency table using the set of intervals from part (a).
Construct a histogram to display the data shown in the frequency table.
Every data value must go into exactly one and only one interval or interval.
The key features of a histogram are:
The horizontal axis is a numerical scale (like a number line)
The data on the horizontal axis may be grouped into intervals
There are no gaps between the columns of a histogram
The height of each column will be the frequency