topic badge
iGCSE (2021 Edition)

18.10 Cumulative frequency tables and graphs (Extended)

Lesson

Cumulative frequency scores

Some frequency tables have an extra column for cumulative frequency. It is a running total of the frequencies. In other words, cumulative frequency is the total of that row's frequency and all the other frequencies from the previous scores in the data set.

Worked example

Example 1

Sam recorded the number of pets owned by $10$10 people in his class. Here is the regular frequency table.

Number of Pets Frequency
$0$0 $2$2
$1$1 $5$5
$2$2 $2$2
$3$3 $1$1

Now let's look at how we would calculate and add in a cumulative frequency column.  Remember, we add each frequency to the previous frequency total. The first value in the cumulative frequency table will be the same as the value in the frequency column (since there's no previous value to add it to).

Number of Pets Frequency Cumulative Frequency
$0$0 $2$2 $2$2
$1$1 $5$5 $2+5=7$2+5=7
$2$2 $2$2 $7+2=9$7+2=9
$3$3 $1$1 $9+1=10$9+1=10

Notice that the final value in the cumulative frequency column is the same as the total number of people that were surveyed? That's how we know we've got our frequency scores all right.

Practice question

Question 1

The number of sightings of the Northern Lights were recorded across various Canadian locations over a period of $1$1 month. The numbers below represent the number of sightings at each location.

$12,8,9,8,11,7,7,11,10,9,9,11,7,10,11,7,8,9,11,9$12,8,9,8,11,7,7,11,10,9,9,11,7,10,11,7,8,9,11,9

  1. Complete the table.

    Number of Sightings Number of Locations ($f$f) Cumulative Frequency ($cf$cf)
    $7$7 $\editable{}$ $\editable{}$
    $8$8 $\editable{}$ $\editable{}$
    $9$9 $\editable{}$ $\editable{}$
    $10$10 $\editable{}$ $\editable{}$
    $11$11 $\editable{}$ $\editable{}$
    $12$12 $\editable{}$ $\editable{}$
  2. In how many locations were there at least $11$11 sightings?

  3. In how many locations were there less than $11$11 sightings?

  4. What was the median number of sightings across all $20$20 locations?

 

Cumulative frequency graph

This is a line that:

  • starts at the smallest score of the first class then ends at the highest score of the class at a height of the cumulative frequency for that class.
  • increases by the frequency of each class, ending at the largest score in the class interval.
  • ends at the total sum of all scores

 

Consider the following cumulative frequency table:

Class Frequency Cumulative frequency
$0-10$010 $5$5 $5$5
$11-20$1120 $16$16 $21$21
$21-30$2130 $10$10 $31$31
$31-40$3140 $7$7 $38$38
$41-50$4150 $4$4 $42$42

 

The cumulative frequency graph would be: 

Finding the median from the cumulative frequency graph

We can find the median of the data set using the cumulative frequency graph by:

- Finding the middle point on the cumulative frequency axis (half the total number of scores)

- Drawing a horizontal line to the polygon and then a vertical line down to the horizontal axis

 

Estimating quartiles and percentiles from the cumulative frequency graph

We can find the percentiles and quartiles of the data set using the cumulative frequency graph by:

- Find the corresponding percentage of the total number of scores and find that number on the cumulative frequency axis. e.g. For the $20$20th percentile, find $20%$20% of the total number of scores. 

- Drawing a horizontal line to the polygon and then a vertical line down to the horizontal axis

Worked example

Example 2

Consider the cumulative frequency graph given below:

cumulative frequency graph

Use the graph to estimate: 
a) The median.

b) The $90$90th percentile.

c) The lower quartile.

d) The upper quartile.

e) The interquartile range.

 

 

a) 

Think: The median represents the $50%$50% mark of the data, because $50%$50% of the data lies above and below it. 

Do:  We need to find $50%$50% of the total number of scores, which according to the graph is $50$50

$50%$50% of $50$50 $=$= $50%\times50$50%×50
  $=$= $25$25

 

Now to find the median we draw a horizontal line from $25$25 on the vertical axis until it hits the cumulative frequency graph, then draw a line vertically down: 

We can see that the dashed line hits the horizontal axis in the column bounded by $20$20 and $25$25. So we take the average of these numbers to find the median.  Therefore the median is $22.5.$22.5. 

Note: If the column was just labelled with one number, rather than two numbers at the end points, then that one number would be the median.

b) 

Think: The $90$90th percentile is the $90%$90% mark of the data, because $90%$90% of the data lies below it. 

Do:  We need to find $90%$90% of the total number of scores, which according to the graph is $50$50

$90%$90% of $50$50 $=$= $90%\times50$90%×50
  $=$= $45$45

 

Now to find the percentile we draw a horizontal line from $45$45 on the vertical axis until it hits the cumulative frequency graph, then draw a line vertically down: 

We can see that the dashed line hits the horizontal axis in the column bounded by $30$30 and $35$35. So we take the average of these numbers to find the percentile.  Therefore the $90$90th percentile is $32.5.$32.5. 

c) 

Think: The lower quartile is the $25%$25% mark of the data, because $25%$25% of the data lies below it. 

Do:  We need to find $25%$25% of the total number of scores, which according to the graph is $50$50

$25%$25% of $50$50 $=$= $25%\times50$25%×50
  $=$= $12.5$12.5

 

Now to find the percentile we draw a horizontal line from $12.5$12.5 on the vertical axis until it hits the cumulative frequency graph, then draw a line vertically down: 

We can see that the dashed line hits the horizontal axis in the column bounded by $15$15 and $20$20. So we take the average of these numbers to find the lower quartile. Therefore the lower quartile is $17.5.$17.5. 

d) 

Think: The upper quartile is the $75%$75% mark of the data, because $75%$75% of the data lies below it. 

Do:  We need to find $75%$75% of the total number of scores, which according to the graph is $50$50

$75%$75% of $50$50 $=$= $75%\times50$75%×50
  $=$= $37.5$37.5

 

Now to find the percentile we draw a horizontal line from $37.5$37.5 on the vertical axis until it hits the cumulative frequency graph, then draw a line vertically down: 

We can see that the dashed line hits the horizontal axis in the column bounded by $25$25 and $30$30. So we take the average of these numbers to find the upper quartile.  Therefore the upper quartile is $27.5.$27.5. 

 

e) 

Think: The interquartile range is the difference between the upper quartile and lower quartile. 

Do:  

Interquartile range $=$= $27.5-17.5$27.517.5
  $=$= $10$10

 

 

Cumulative frequency and grouped data

For grouped data cumulative frequency scores are calculated the same way by adding the cumulative frequency column in the frequency distribution table. The difference with grouped data is that when finding the median we can only estimate the value.

Worked example

Example 2

The frequency distribution table below shows the heights, in centimetres, of a group of children aged $5$5 to $11$11.

Child's height in cm class centre frequency cumulative frequency
$91$91-$100$100 $95$95 $5$5 $5$5
$101$101-$110$110 $105$105 $22$22 $27$27
$111$111-$120$120 $115$115 $30$30 $57$57
$121$121-$130$130 $125$125 $31$31 $88$88
$131$131-$140$140 $135$135 $18$18 $106$106
$141$141-$150$150 $145$145 $6$6 $112$112

Use the table to answer the following questions:

  1. How many children were in the group?
  2. How many children had heights greater than $130$130 cm but less than or equal to $140$140 cm?
  3. Which class interval contained the most children?
  4. How many children had a height less than or equal to $120$120 cm?
  5. How many children had a height greater than $130$130 cm?

Do:

  1. The final cumulative frequency value tells us there were $112$112 children in the group. This is equal to the sum of the values in the frequencies column. 
  2. The frequency column indicates there are $18$18 children with heights in the range $131-140$131140.
  3. The class interval with the highest frequency is $120-129$120129.
  4. The cumulative frequency of the $111-120$111120 class interval, tells us that $57$57 children had a height less than or equal to $120$120 cm.
  5. Here we can add the final two frequencies: $18+6=24$18+6=24. Alternatively, we could subtract the cumulative frequency of $88$88 (corresponding to the class interval containing the height $130$130 cm), from the total number of children in the group: $112-88=24$11288=24.

Practice question

Question 2

Complete the table and answer the following questions:

  1. Complete the frequency distribution table:

    Class Class centre ($x$x) Frequency ($f$f) Cumulative frequency Center times frequency ($fx$fx)
    $1-9$19 $\editable{}$ $8$8 $\editable{}$ $\editable{}$
    $10-18$1018 $\editable{}$ $16$16 $\editable{}$ $\editable{}$
    $19-27$1927 $\editable{}$ $4$4 $\editable{}$ $\editable{}$
    $28-36$2836 $\editable{}$ $21$21 $\editable{}$ $\editable{}$
    $37-45$3745 $\editable{}$ $16$16 $\editable{}$ $\editable{}$
    Totals $\editable{}$ $\editable{}$
  2. Using the class centres as 'scores', calculate the mean to 2 decimal places.

  3. What is the median class?

    $28-36$2836

    A

    $1-9$19

    B

    $10-18$1018

    C

    $19-27$1927

    D

    $37-45$3745

    E
  4. What is the modal class?

    $28-36$2836

    A

    $37-45$3745

    B

    $1-9$19

    C

    $10-18$1018

    D

    $19-27$1927

    E

Cumulative frequency graphs for grouped data

We can construct a cumulative frequency graph for grouped data using the class interval on the horizontal axis. 

Worked example 

Example 3

a) The global life expectancy data from 2016 is shown in the frequency distribution table below. Construct a cumulative frequency graph for the data set.

class interval frequency cumulative frequency
$51-54$5154 $5$5 $5$5
$55-60$5560 $10$10 $15$15
$61-64$6164 $25$25 $40$40
$65-70$6570 $26$26 $66$66
$71-74$7174 $40$40 $106$106
$75-80$7580 $49$49 $155$155
$81-84$8184 $28$28 $183$183
Total $183$183  

Do: Plot the cumulative frequency for each class interval to get the height of each column. The columns should be ascending each time.

The cumulative frequency graph is displayed below:

b) Estimate the median life expectancy age using the graph.

Think: The median age is in the middle of the data set when the data is in ascending order. The number of scores altogether is 183. The median from the graph is the middle of 183 scores which is 92.5. 

Do: Starting at 92.5 along the vertical axis, draw a line from the vertical axis to the cumulative frequency graph and then a perpendicular line down to the horizontal axis. Estimate the value of the median by its position on the horizontal axis.

 
 

The median is approximately 73 years.

 

 

Outcomes

0607C11.6

Cumulative frequency table and curve. Median, quartiles and interquartile range.

0607E11.6

Cumulative frequency table and curve. Median, quartiles, percentiles and interquartile range.

What is Mathspace

About Mathspace