Linear interpolation formula
The linear interpolation formula is the simplest method used to estimate the value of a function between any two known points. This formula is also useful for curve fitting using linear polynomials. This formula is often used for data forecasting, data prediction and other mathematical and scientific applications. The linear interpolation equation is given by:
\[y = y_1 + (x-x_1) \frac{(y_2-y_1)}{(x_2-x_1)}\]
where:
x1 and y1 are the first coordinates.
x2 and y2 are the second coordinates.
x is the point to perform the interpolation.
y is the interpolated value.
Solved example for linear interpolation
The best way to understand linear interpolation is through the use of an example.
Find the value of y if x = 5 and some set of value given are (3,2), (7,9).
Step 1: First assign each coordinate the right value
x = 5 (note that this is given)
x1 = 3 and y1 = 2
x2 = 7 and y2 = 9
Step 2: Substitute these values into the equations, then get the answer for y.
\(y = 2 +(5-3)\frac{(9-2)}{(7-3)} \quad y = \frac{11}{2}\)
How to do linear interpolation
There are a few useful steps that will help you compute the desired value such as the median, 1st quartile and 3rd quartile. We will go through each step with the use of an example so that it is clear.
In this example, we will look at grouped data with class intervals.
Class | Frequency |
0-10 | 5 |
11-20 | 10 |
21-30 | 1 |
31-40 | 8 |
41-50 | 18 |
51-60 | 6 |
61-70 | 20 |
Frequency is how often a value in a specific class appears in the data.
Step 1: Given the class and the frequency, you have to create another column called the cumulative frequency (also known as CF).
Class | Frequency | CF |
0-10 | 5 | 5 |
11-20 | 10 | 15 |
21-30 | 1 | 16 |
31-40 | 8 | 24 |
41-50 | 18 | 42 |
51-60 | 6 | 48 |
61-70 | 20 | 68 |
Step 2: Plot the cumulative frequency graph. To do this, you plot the upper boundary of the class against the cumulative frequency.
Finding the median
The median is the value in the middle of the data.
The position of the median is at the \(\Big( \frac{n}{2} \Big)^{th}\) value, where n is the total cumulative frequency
In this example, n = 68
Step 1: Solve for the position of the median \(\frac{68}{2} = 34^{th} \space position\)
Step 2: Look for where the 34th position lies in the data using the cumulative frequency.
According to the cumulative frequency, the 34th value lies in the 41-50 class interval.
Step 3: Given the graph, use linear interpolation to find the specific median value.
We treat the segment of the graph where the class interval lies as a straight line and use the gradient formula to assist.
\(\text{Gradient} = \frac{(\text{Median cf - previous cf})}{(\text{upper bound - lower bound})} =\frac{(42-24)}{(50-41)} = 2\)
We can manipulate this formula and substitute the value of the median (m) as the upper bound and the position of the median as the median cf which is also equal to the gradient.
\(\text{Gradient} = \frac{(34-24)}{(m-41)}\)
So it follows that,
\(2 = \frac{(34-24)}{(m-41)} \quad 2 = \frac{10}{m-41} \quad m-41 = \frac{10}{2} \quad m-41 = 5 \quad m = 46\)
So the median is 46.
Finding the first quartile
The 1st quartile is also known as the lower quartile. This is where the first 25% of the data lies.
The position of the 1st quartile is the \(\Big(\frac{n}{4} \Big)^{th}\) value.
The steps to find the 1st quartile are very similar to the steps to find the median.
Step 1: solve for the position of the 1st quartile \(\frac{68}{4} = 17^{th} \text{ position}\)
Step 2: Look for where the 17th position lies in the data using the cumulative frequency.
According to the cumulative frequency, the 17th value lies in the 31-40 class interval.
Step 3: Given the graph, use linear interpolation to find the specific 1st quartile value.
We treat the segment of the graph where the class interval lies as a straight line and use the gradient formula to assist.
\(\text{Gradient} = \frac{(1^{st}\text{quartile cf - previous cf})}{(\text{upper bound - lower bound})} =\frac{(24-16)}{(40-31)} = \frac{8}{9}\)
We can manipulate this formula and substitute the value of the 1st quartile (Q1) as the upper bound and the position of the 1st quartile as the 1st quartile cf which is also equal to the gradient.
\(\text{Gradient} = \frac{(17-16)}{(Q_1-31)}\)
It follows that,
\(\frac{8}{9} = \frac{(17-16)}{(Q_1 - 31)} \quad \frac{8}{9} = \frac{1}{Q_1 - 31} \quad Q_1 - 31 = \frac{9}{8} \quad Q_1 = 32.125\)
So the 1st quartile is 32.125.
Finding the third quartile
The 1st quartile is also known as the lower quartile. This is where the first 25% of the data lies.
The position of the 3rd quartile is the \(\Big(\frac{3n}{4} \Big)^{th}\) value.
Step 1: solve for the position of the 3rd quartile \(\frac{3(68)}{4} = 51^{st} \text{ position}\)
Step 2: Look for where the 51st position lies in the data using the cumulative frequency.
According to the cumulative frequency, the 51st value lies in the 61-70 class interval.
Step 3: Given the graph, use linear interpolation to find the specific 3rd quartile value.
We treat the segment of the graph where the class interval lies as a straight line and use the gradient formula to assist.
\(\text{Gradient} = \frac{3^{rd} \text{quartile cf - previous cf}}{\text{upper bound - lower bound}} = \frac{(68-48)}{(70-61)} = \frac{20}{9}\)
We can manipulate this formula and substitute the value of the 3rd quartile (Q3) as the upper bound and the position of the 3rd quartile as the 3rd quartile cf which is also equal to the gradient.
\(\text{Gradient} = \frac{(51-48)}{(Q_3 -61)}\)
It follows that, \(\frac{20}{9} = \frac{(51-48)}{(Q_3 - 61)} \quad \frac{20}{9} = \frac{3}{Q_3 - 61} \quad Q_3 - 61 = \frac{27}{20} \quad Q_3 = 62.35\)
So the 3rd quartile is 32.125.
Linear Interpolation - Key takeaways
- Linear interpolation is used to find an unknown value of a function between any two known points.
- The formula for linear interpolation is \(y = y_1 +(x-x_1) \frac{(y_2-y_1)}{(x_2-x_1)}\)
- Linear interpolation can also be used to find the median, 1st quartile and 3rd quartile
- The position of the median is \(\frac{n}{2}\)
- The position of the 1st quartile is \(\frac{n}{4}\)
- The position of the 3rd quartile \(\frac{3n}{4}\)
- A graph of the upper bounds in each class interval plotted against the cumulative frequency can be used to locate the median, 1st quartile and 3rd quartile.
- The gradient formula can be used to find the specific value of the median, 1st quartile and 3rd quartile
How we ensure our content is accurate and trustworthy?
At StudySmarter, we have created a learning platform that serves millions of students. Meet
the people who work hard to deliver fact based content as well as making sure it is verified.
Content Creation Process:
Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.
Get to know Lily
Content Quality Monitored by:
Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.
Get to know Gabriel