We look for a model that will summarise an apparent relation between two variable quantities. The model, if successful, will enable predictions to be made about further observations of an experiment and it may help in developing an understanding of the mechanisms involved in a process.
As a rule, we look for the simplest model that will provide a reasonably good fit for a set of data. And, of all possible models, a linear model is the simplest.
Even when it is clear that a nonlinear model would describe the data better than would a linear model, we may choose to use a linear model that will be approximately correct over a small portion of the data range. We may wish to approximate a complicated nonlinear process by a string of separate linear models that each apply to a small sub-range of the data.
There is a repeating pattern which could be modelled using sine or cosine functions but, as an approximation, we could fit several straight lines, each applicable to a part of the range.
We must be careful with modelling in this way to specify which linear graph is applicable for each part of the range of data. For example, the left-most line applies to the time period from about $2$2 hours to $8$8 hours, and so on.
The choice of a mathematical model for data often rests on extra information or assumptions about the data. For example, the idea of modelling the tide data using trigonometric functions comes from considerations beyond the actual data. We believe, before any data is collected, that the tides follow a repeating pattern. Therefore, we will need a mathematical function that repeats. Combinations of the sine function have this property.
If we did not use extra knowledge or common sense of this sort we might try modelling the tidal data using a polynomial function. One might find by experiment that it is possible to discover a degree $6$6 polynomial that fits the collected data very closely. However, just as with the linear approximations, we would have to specify that the function only applies in the particular $24$24-hour time period. This polynomial model would be wildly inaccurate on subsequent days because the polynomial increases without bound outside of the small window of time in which the data set was obtained.
It is proposed to carry out an experiment to do with trees that are being grown for timber in a plantation. The researchers are interested in the diameters of the tree trunks at time intervals after the trees were first planted. Before any measurements are made, various theories are discussed concerning the possible outcomes.
On the basis of this thinking, certain mathematical models are discarded as being impossible or unlikely and the experiment proceeds with the aim of discovering which of the remaining, plausible models seems to best reflect reality.
On the basis of this thinking, the researchers would probably discard an exponential model, a quadratic model, all periodic functions and various others, but might consider linear models (including composites of several linear pieces), a logarithm function, a square root function, or one of the several special S-shaped functions.
Every week, Rochelle deposits $\$800$$800 into a savings account. She plots her accumulated savings over several weeks.
Would a linear model accurately describe Rochelle’s savings after the first week? Choose the most appropriate response.
No, Rochelle's savings will not change over time.
Yes, the difference in the balance from week to week increases at a linear rate.
Yes, the accumulated savings will approach a constant amount after many weeks.
Yes, for every week that passes, Rochelle's savings will increase by a constant amount.
Due to a certain process of nuclear decay, the amount of Potassium-40 in a geological sample halves every $0.13$0.13 billion years. The scatter plot below compares the age of a sample ($t$t, in billions of years) with the proportion ($P$P) of the original amount of Potassium-40 present in the sample, for several different geological samples. The graph also features an exponential model.
As $t$t gets larger and larger, what value does $P$P approach?
The value of $P$P will approach $\editable{}$.
Consider a particular geological sample containing an initial amount of Potassium-40. As this sample ages, will the amount of Potassium-40 present always be greater than zero?
Yes, the amount present will continue to halve every $0.13$0.13 billion years and never reach zero.
No, there will ultimately be no Potassium-40 left in the sample.
A supermarket uses rectangular prism cardboard boxes of different widths, $x$x, and volumes, $y$y. A scatter plot of different sized boxes is shown below.
Jack is an employee who usually packs and unpacks smaller boxes. He uses the following linear model to represent the box sizes:
For which values of $x$x is this model most accurate?
$30\le x\le40$30≤x≤40
$40\le x\le46$40≤x≤46
$24\le x\le46$24≤x≤46
$24\le x\le30$24≤x≤30
Georgia is an employee who usually packs and unpacks larger boxes. She uses a different linear model to represent the box sizes, shown below:
For which values of $x$x is this model most accurate?
$40\le x\le46$40≤x≤46
$24\le x\le46$24≤x≤46
$30\le x\le40$30≤x≤40
$24\le x\le30$24≤x≤30
The graph below shows both Jack's and Georgia's models for the values of $x$x that they most accurately describe:
Do the two models together accurately describe all of the boxes used by the supermarket? Choose the most appropriate response.
Yes - Jack's model describes all of the small boxes, and Georgia's model describes all of the large boxes, so together they must describe all of the boxes.
No - There are boxes used by the supermarket with $x>46$x>46 that are not described accurately by either model.
Yes - Jack's model is accurate in the region $24\le x\le30$24≤x≤30 and Georgia's model is accurate in the region $40\le x\le46$40≤x≤46, and together these are all the possible values for $x$x.
No - Jack's model is only accurate in the region $24\le x\le30$24≤x≤30 and Georgia's model is only accurate in the region $40\le x\le46$40≤x≤46, but there is a box used by the supermarket that has $x=35$x=35 which is in neither region.