Visualizing their salary using a range may not account for the salary gap between the junior staff and the senior staff.
Quartiles and interquartile range are what account for the data values between the two extreme ends of the data set.
In this article, we will be learning about quartiles and the inter-quartile range.
What is a quartile?
Quartiles are the values that divide a set into quarters (four parts).
Though quartiles divide the data set into four parts, we thus have three quartiles: the first quartile, the second quartile, and the third quartile.
The lower quartile
The lower quartile, also known as the first quartile is what accounts for the data under 25%. It is technically the middle point value between the lowest data point and the median of the data set. It is also denoted by .
We recall that the median of a data set is the midpoint value. The lower quartile is the median of the set of values from the lowest data point to the median of the entire data set.
To find the lower quartile, we use the median as a reference point.
- If the number of data values in the data set is odd, disregard the middle number. The lower quartile is the median of the lower half of the data set.
- If the number of data values in the data set is even, the lower quartile is still the median of the lower half of the data set.
Find the lower quartile of the data set 9, 12, 3, 5, 8, 3, 4.
Solution
Step 1.
We rearrange the data values in ascending order to get
3, 3, 4, 5, 8, 9, 12
Step 2.
We identify that 5 is the median in the entire data set. However, that means that the lower half of the data is now left with 3, 3, 4.
Step 3.
The median for that is 3. Therefore, the lower quartile is
.
Find the lower quartile of the given data set 78, 62, 46, 89, 98, 23, 45, 77.
Solution
Step 1.
We rearrange the data values in ascending order to get,
23, 45, 46, 62, 77, 78, 89, 98
Step 2.
Since the number of the data values is even, we can split them into two equal parts with the lower half being,
23, 45, 46, 62
Step 3.
To find the median for these values, we will need to find the average of the two values in the middle, since this data set is also even. Thus, the lower quartile is given by,
The second quartile
The second quartile denoted by is the median of the data set. This is the middle point value of the entire data set.
To find the second quartile, we identify the middle value of the given data set if the number of data values is odd. If the number of the data values in the given data set is even, we find the average of the two middle values. That average is the second quartile.
Find the second quartile of the data set 9, 12, 3, 5, 8, 3, 4
Solution
Step 1.
We rearrange data values in ascending order, to get
3, 3, 4, 5, 8, 9, 12
Step 2.
5 here is identified as the middle value in the data set. Therefore, the second quartile is
Find the second quartile of the given data set 78, 62, 46, 89, 98, 23, 45, 77.
Solution
Step 1.
We rearrange the data values in ascending order
23, 45, 46, 62, 77, 78, 89, 98
Step 2.
Since the number of data set is even, two numbers can be identified as the middle values. These are 62 and 77. We will find the average of these values, to get
The third quartile
The third quartile, also known as the upper quartile is the value under which 75% of the data are found when arranged in increasing order. It is denoted by . This value is the middle point value between the median and the highest data value.
To find the upper quartile, use the median as a reference point.
- If the number of data values in the data set is odd, disregard the middle number. The upper quartile is the median of the upper half of the data set.
- If the number of data values in the data set is even, the upper quartile is still the median of the upper half of the data set.
Find the upper quartile of the data set 9, 12, 3, 5, 8, 3, 4
Solution
Step 1. We rearrange the data values in ascending order to get,
3, 3, 4, 5, 8, 9, 12
Step 2.
We identify that 5 is the median in the entire data set. However, that means that the upper half of the data is now left with
8, 9, 12.
The median for that is 9. Therefore,
Find the upper quartile of the given data set 78, 62, 46, 89, 98, 23, 45, 77.
Solution
Step 1.
We rearrange the data values in ascending order to get,
23, 45, 46, 62, 77, 78, 89, 98
Step 2.
Since the number of the data values is even, we can split them into two equal parts with the upper half being,
77, 78, 89, 98
Step 3.
To find the median for these values, we will need to find the average of the two values in the middle, since this data set is also even.
Importance of quartiles in statistics
There are significant uses of finding quartiles in statistics. These are discussed below.
- Quartiles easily identify a dataset's central tendency and its variability.
- Quartiles help identify outliers in a dataset.
- Quartiles give information on the shape of the distribution of data.
- They summarize large data sets.
- Quartiles are the primary elements used to calculate interquartile ranges.
Interquartile range and quartile deviation
The interquartile range is the difference between the upper quartile and the lower quartile value.
This means that to find the interquartile range of any given data successfully, you will need to know the upper and lower quartiles.
Interquartile range formula
The interquartile range formula is given by
where ,
To find the quartiles and interquartile range of a given data set you can proceed as follows,
Order the values in ascending order.
Find the median. This is always labeled as the second quartile ( ).
Now find the median of both halves of the data set. The lowest half is labelled , and the highest half is labelled.
Find the interquartile range (IQR) by subtracting from .
Quartiles and interquartile range calculation
In this section, we are going to take an example of how quartiles and interquartile range are calculated.
Find the interquartile range for the data set 6, 47, 49, 15, 43, 41, 7, 39, 43, 41, 36.
Solution
Step 1.
We rearrange the data set in order from lowest to highest, to get
6, 7, 15, 36, 39, 41, 41, 43, 43, 47, 49
Step 2.
We find the median by locating the middle data point, which is 41. This is also known as the second quartile,
Step 3.
With finding the median for both halves, we need to understand that the point where the median is located divides the data points into two.
Hence, the median for the first half will be the first quartile, whist the median for the second half will be the third quartile. Let us find the median for the first half first.
The first half is 6, 7, 15, 36, 39 . The median is 15. Thus
We find the median for the second half too, which is 41, 43, 43, 47, 49 . The median is 43. Thus.
Now, we can proceed to calculate the interquartile range,
Plotting interquartile ranges
Plotting interquartile ranges on a graph means you would be drawing a box plot. To construct one, we follow the following steps,
- Rearrange the values in the data set from lowest to highest.
- Identify the highest and lowest values in the data set.
- Identify the data set's midpoint value (median).
- Find the upper and lower quartiles.
- Find the inter-quartile range.
- Construct the box plot with the necessary values found.
The table below is the data of basketball players' points scored per game over a seven-game span. Visualize this on a box plot.
Game | Points |
1 | 10 |
2 | 17 |
3 | 5 |
4 | 32 |
5 | 16 |
6 | 18 |
7 | 20 |
Solution
Step 1.
We rearrange the values in the data set from lowest to highest.
5, 10, 16, 17, 18, 20, 32.
Step 2.
Now identify the highest and lowest values in the data set
Step 3.
We can now identify the midpoint value (median) of the data set,
Step 4.
We will now find the upper and lower quartiles.
The lower quartile is the median for the first half of the data set. That means that we are finding the median for 5, 10, 16
The upper quartile is the median for the second half of the data set. That means that we are finding the median for 18, 20, 32
Step 5.
We can now find the inter-quartile range by the formula,
Step 6.
Now that we have all our necessary values, we will construct our box and whisker plot.
We will first draw a number line that fits the data, and plot all the necessary values we found.
Construct a rectangle that encloses the median of the entire data set that its vertical lines pass through the upper and lower quartiles. Now construct a vertical line through the median that hits both ends of the rectangle.
Quartile deviation
Quartile deviation is defined as half of the difference between the upper and lower quartile.
Quartile deviation is one of the measures that measure dispersion in a data set. Mathematically, this measures the extent to which the lower and upper quartiles differ from the median. It is calculated by dividing a data set's inter-quartile range by 2.
The quartile deviation is also known as the semi-inter-quartile range. Its formula is defined by,
What will be the quartile deviation for the data set 6, 9, 3, 6, 6, 5, 2, 3, 8?
Solution
Step 1.
We rearrange the data set in order from lowest to highest,
2, 3, 3, 5, 6, 6, 6, 8, 9
Step 2.
We find the median by locating the middle data point, which is 6. This means the second quartile is 6.
Step 3.
We find the median for both halves. Let us start with the first.
2, 3, 3, 5
We have both values are 3, therefore the first quartile is 3.
Now we will find the median for the second half
6, 6, 8, 9
Since we have two figures here, we will find the average of them.
Now that we have found the lower and upper quartiles, we want to know how much these values deviate from the median (which is the middle point value of the data set). Finding the quartile deviation means we will subtract the first quartile from the third quartile, and divide it by 2,
Quartiles and Interquartile Range - Key takeaways
- A quartile is a type of quantile that divides an ordered data set into four quarters.
- The interquartile range is the difference between the upper quartile and the lower quartile value.
- The third quartile accounts for data under 75%.
- The formula for inter-quartile range is .
- The formula for quartile deviation is
How we ensure our content is accurate and trustworthy?
At StudySmarter, we have created a learning platform that serves millions of students. Meet
the people who work hard to deliver fact based content as well as making sure it is verified.
Content Creation Process:
Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.
Get to know Lily
Content Quality Monitored by:
Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.
Get to know Gabriel