Standard Deviation

Mobile Features AB

You might want to look at Measures of Central Tendency before learning about standard deviation. If you are already familiar with the mean of a data set, let's go!

Get started

Review generated flashcards

Sign up for free
You have reached the daily AI limit

Start learning or create your own AI flashcards

StudySmarter Editorial Team

Team Standard Deviation Teachers

  • 5 minutes reading time
  • Checked by StudySmarter Editorial Team
Save Article Save Article
Sign up for free to save, edit & create flashcards.
Save Article Save Article
  • Fact Checked Content
  • Last Updated: 01.09.2022
  • 5 min reading time
Contents
Contents
  • Fact Checked Content
  • Last Updated: 01.09.2022
  • 5 min reading time
  • Content creation process designed by
    Lily Hulatt Avatar
  • Content cross-checked by
    Gabriel Freitas Avatar
  • Content quality checked by
    Gabriel Freitas Avatar
Sign up for free to save, edit & create flashcards.
Save Article Save Article

Jump to a key chapter

    Standard deviation is a measure of dispersion, and it is used in statistics to see how spread out values are from the mean in a data set.

    Standard deviation formula

    The formula for standard deviation is:

    \[ \sigma = \sqrt{\dfrac{\sum(x_i-\mu)^2}{N}}\]

    Where:

    \(\sigma\) is the standard deviation

    \(\sum\) is the sum

    \(x_i\) is an individual number in the data set

    \( \mu\) is the mean of the data set

    \(N\) is the total number of values in the data set

    So, in words, the standard deviation is the square root of the sum of how far each data point is from the mean squared, divided by the total number of data points.

    The variance of a set of data is equal to the standard deviation squared, \(\sigma^2\).

    Standard deviation graph

    The concept of standard deviation is pretty useful because it helps us predict how many of the values in a data set will be at a certain distance from the mean. When carrying out a standard deviation, we assume that the values in our data set follow a normal distribution. This means that they are distributed around the mean in a bell-shaped curve, as below.

     probability standard deviation graph studysmarterStandard deviation graph. Image: M W Toews, CC BY-2.5 i

    The \(x\)-axis represents the standard deviations around the mean, which in this case is \(0\). The \(y\)-axis shows the probability density, which means how many of the values in the data set fall between the standard deviations of the mean. This graph, therefore, tells us that \(68.2\%\) of the points in a normally-distributed data set fall between \(-1\) standard deviation and \(+1\) standard deviation of the mean, \(\mu\).

    How do you calculate standard deviation?

    In this section, we will look at an example of how to calculate the standard deviation of a sample data set. Let's say you measured the height of your classmates in cm and recorded the results. Here's your data:

    165, 187, 172, 166, 178, 175, 185, 163, 176, 183, 186, 179

    From this data we can already determine \(N\), the number of data points. In this case, \(N = 12\). Now we need to calculate the mean, \(\mu\). To do that we simply add all the values together and divide by the total number of data points, \(N\).

    \[ \begin{align} \mu &= \frac{165 + 187+172+166+178+175+185+163+176+183+186+179}{12} \\ &= 176.25. \end{align} \]

    Now we have to find

    \[ \sum(x_i-\mu)^2.\]

    For this we can construct a table:

    \(x_i\)

    \(x_i - \mu\)

    \((x_i-\mu)^2\)

    165

    -11.25

    126.5625

    187

    10.75

    115.5625

    172

    -4.25

    18.0625

    166

    -10.25

    105.0625

    178

    1.75

    3.0625

    175

    -1.25

    1.5625

    185

    8.75

    76.5625

    163

    -13.25

    175.5625

    176

    -0.25

    0.0625

    183

    6.75

    45.5625

    186

    9.75

    95.0625

    179

    2.75

    7.5625

    For the standard deviation equation, we need the sum by adding all the values in the last column. This gives \(770.25\).

    \[ \sum(x_i-\mu)^2 = 770.25.\]

    We now have all the values we need to plug into the equation and get the standard deviation for this data set.

    \[ \begin{align} \sigma &= \sqrt{\dfrac{\sum(x_i-\mu)^2}{N}} \\ &= \sqrt{\frac{770.25}{12}} \\ &= 8.012. \end{align}\]

    This means that, on average, the values in the data set will be \(8.012\, cm\) away from the mean. As seen on the normal distribution graph above, we know that \(68.2\%\) of the data points are between \(-1\) standard deviation and \(+1\) standard deviation of the mean. In this case, the mean is \(176.25\, cm\) and the standard deviation \(8.012\, cm\). Therefore, \( \mu - \sigma = 168.24\, cm\) and \( \mu - \sigma = 184.26\, cm\), meaning that \(68.2\%\) of values are between \(168.24\, cm\) and \(184.26\, cm\) .

    The age of five workers (in years) in an office was recorded. Find the standard deviation of the ages: 44, 35, 27, 56, 52.

    We have 5 data points, so \(N=5\). Now we can find the mean, \(\mu\).

    \[ \mu = \frac{44+35+27+56+52}{5} = 42.8\]

    We now have to find

    \[ \sum(x_i-\mu)^2.\]

    For this, we can construct a table such as above.

    \(x_i\)\(x_i - \mu\)

    \((x_i-\mu)^2\)

    441.21.44
    35-7.860.84
    27-15.8249.64
    5613.2174.24
    529.284.64

    To find

    \[ \sum(x_i-\mu)^2,\]

    we can simply add all the numbers in the last column. This gives

    \[ \sum(x_i-\mu)^2 = 570.8\]

    We can now plug everything into the standard deviation equation.

    \[ \begin{align} \sigma &= \sqrt{\dfrac{\sum(x_i-\mu)^2}{N}} \\ &= \sqrt{\frac{570.8}{5}} \\ &= 10.68. \end{align}\]

    So the standard deviation is \(10.68\) years.

    Standard Deviation - Key takeaways

    • Standard deviation is a measure of dispersion, or how far away the values in a data set are from the mean.
    • The symbol for standard deviation is sigma, \(\sigma\)
    • The equation for standard deviation is \[ \sigma = \sqrt{\dfrac{\sum(x_i-\mu)^2}{N}} \]
    • The variance is equal to \(\sigma^2\)
    • Standard deviation is used for data sets that follow a normal distribution.
    • The graph for a normal distribution is bell-shaped.
    • In a data set that follows a normal distribution, \(68.2\%\) of values fall within \(\pm \sigma\) the mean.


    Images

    Standard deviation graph: https://commons.wikimedia.org/wiki/File:Standard_deviation_diagram.svg

    Frequently Asked Questions about Standard Deviation

    What is standard deviation?

    Standard deviation is a measure of dispersion, used in statistics to find the dispersion of values in a data set around the mean.

    Can standard deviation be negative?

    No, standard deviation cannot be negative because it is the square root of a number.

    How do you work out standard deviation?

    By using the formula 𝝈=√ (∑(xi-𝜇)^2/N) where 𝝈 is the standard deviation, ∑ is the sum, xi is an individual number in the data set, 𝜇 is the mean of the data set and N is the total number of values in the data set.

    Save Article
    How we ensure our content is accurate and trustworthy?

    At StudySmarter, we have created a learning platform that serves millions of students. Meet the people who work hard to deliver fact based content as well as making sure it is verified.

    Content Creation Process:
    Lily Hulatt Avatar

    Lily Hulatt

    Digital Content Specialist

    Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.

    Get to know Lily
    Content Quality Monitored by:
    Gabriel Freitas Avatar

    Gabriel Freitas

    AI Engineer

    Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.

    Get to know Gabriel

    Discover learning materials with the free StudySmarter app

    Sign up for free
    1
    About StudySmarter

    StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

    Learn more
    StudySmarter Editorial Team

    Team Math Teachers

    • 5 minutes reading time
    • Checked by StudySmarter Editorial Team
    Save Explanation Save Explanation

    Study anywhere. Anytime.Across all devices.

    Sign-up for free

    Sign up to highlight and take notes. It’s 100% free.

    Join over 22 million students in learning with our StudySmarter App

    The first learning app that truly has everything you need to ace your exams in one place

    • Flashcards & Quizzes
    • AI Study Assistant
    • Study Planner
    • Mock-Exams
    • Smart Note-Taking
    Join over 22 million students in learning with our StudySmarter App
    Sign up with Email