Time series data refers to bivariate data where the explanatory variable, or independent variable, is time. This can be time in hours, days, months, weeks, quarters or any unit of time that is collected at regular intervals. Time is always plotted on the horizontal axis.
Some examples of time series data include:
fuel prices - they tend to rise and fall in a cyclical pattern according to the day of the week
air-conditioner sales - they do not remain constant all year as sales increase and decrease according to the time of year
ocean tides - their height is cyclical over a 24 hour period
A time series graph is a bit like a scattergraph however the points are connected in sequential order by lines.
Trend - by looking at the graph we can see overall whether the response variable is increasing (positive trend), decreasing (negative trend), or stable (not increasing or decreasing).
Note: To be seasonal, each cycle must have the same number of points. If there is a repeating pattern but the number of data points is different for each cycle, then we say it is cyclical rather than seasonal. In the above example each cycle has 4 points so we say it is seasonal and there are 4 seasons per cycle.
Irregular Fluctuations-once we've observed trend and seasonality, we can determine whether the data for some seasons or for entire cycles has irregular fluctuations.
Below is an example of a time series graph that shows house and apartment prices in Melbourne.
The overall trend is increasing. There is not enough detail to see the seasonality of the data or count how many cycles in one season. There are some clear fluctuations.
We can see prices flattening out leading up to the Global Financial Crisis (GFC), and then house prices took a bit of a dive from September 2008 to March 2009. Recovery was fast and house prices rose rapidly.
There was with a big jump in December 2010.
Almost a year later in September 2011, prices seem to be falling. At this time, investors may have been wondering whether they were in a pricing bubble which means prices are well above the normal market value. When the bubble bursts, prices drop rapidly and this causes a lot of uncertainty in the market.
In this case, if we were to allocate numerical values to the time periods, t=1 would be Sept 06, t=2 would be Dec 06 etc.
Consider the following set of Time Series data:
Time Period | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Data | 5 | 10 | 18 | 1 | 11 | 20 | 7 | 13 | 22 | 10 | 15 | 25 |
Plot a time series graph for the data.
State the number of seasons per cycle.
Identify the underlying trend as increasing, decreasing, or constant.
State the time period that appears to be an outlier, if there is one.
Characteristics of time series graphs:
Time is always plotted on the horizontal axis.
Trend - by looking at the graph we can see overall whether the data is increasing (positive trend), decreasing (negative trend), or stable (not increasing or decreasing).
Seasonality - by counting the number of points from peak to peak or trough to trough we can determine how many seasons or time periods constitute one cycle of the data.
Outliers - once we've observed trend and seasonality, we can determine outliers, which are data points that do not follow the pattern of the rest of the data.