Standard deviation measures how spread out numbers are from their average. A small standard deviation means values cluster tightly around the mean; a large standard deviation means they are widely dispersed. It is the most fundamental measure of variability in statistics and appears everywhere from test scores to stock market analysis to quality control.

What Standard Deviation Tells You

Imagine two classes of students both scoring an average of 75% on an exam. Class A has scores ranging from 70 to 80 (SD = 3), while Class B ranges from 45 to 100 (SD = 18). The average is identical, but the stories are completely different. Class A performed consistently; Class B had extreme variation. Standard deviation captures this difference that the average alone cannot.

Step-by-Step Calculation

For the dataset: 4, 8, 6, 5, 3, 7, 8, 5:

  1. Find the mean: (4+8+6+5+3+7+8+5) / 8 = 46/8 = 5.75
  2. Subtract the mean from each value and square the result:
    (4−5.75)² = 3.0625, (8−5.75)² = 5.0625, (6−5.75)² = 0.0625, (5−5.75)² = 0.5625, (3−5.75)² = 7.5625, (7−5.75)² = 1.5625, (8−5.75)² = 5.0625, (5−5.75)² = 0.5625
  3. Sum the squared differences: 23.5
  4. Divide by N (population) or N−1 (sample): Population: 23.5/8 = 2.9375. Sample: 23.5/7 = 3.357
  5. Take the square root: Population SD = 1.71. Sample SD = 1.83
Formulas
Population SD: σ = √(Σ(x−μ)² / N) Sample SD: s = √(Σ(x−x̄)² / (N−1))

Use population SD (σ) when you have data for the entire group. Use sample SD (s) when your data is a subset of a larger population. The N−1 correction (Bessel’s correction) reduces bias in sample estimates.

The 68-95-99.7 Rule

For normally distributed data (bell curve), standard deviation creates predictable boundaries:

  • 68% of values fall within ±1 standard deviation of the mean
  • 95% fall within ±2 standard deviations
  • 99.7% fall within ±3 standard deviations

This is why values beyond 3 standard deviations are considered extreme outliers. In quality control, the "Six Sigma" methodology targets processes where defects fall beyond 6 standard deviations from the mean. Use the Standard Deviation Calculator to compute SD instantly for any dataset.

Key Takeaways

  • Standard deviation measures spread — how far values typically deviate from the mean.
  • Use sample SD (N−1) for subsets of data; population SD (N) for complete datasets.
  • The 68-95-99.7 rule applies to normally distributed data and defines expected ranges.
  • Low SD = consistent data, high SD = variable data. Neither is inherently good or bad — it depends on context.
  • Always pair SD with the mean — SD alone is meaningless without knowing the center of the data.

Frequently Asked Questions

When should I use standard deviation vs variance?

Variance is the square of standard deviation (SD squared). Standard deviation is preferred for interpretation because it is in the same units as your data. If your data is in dollars, SD is in dollars while variance is in dollars-squared, which is hard to interpret. Variance is used in advanced statistical formulas and calculations behind the scenes.

What is the difference between population and sample standard deviation?

Population SD divides by N (total count) and is used when you have data for an entire group. Sample SD divides by N-1 (Bessel's correction) and is used when your data is a subset of a larger population. The N-1 correction produces a slightly larger value that better estimates the true population variability from a sample.

What is a high standard deviation?

Whether an SD is 'high' depends entirely on the context and the mean. An SD of 10 is tiny for household income data (mean ~$75,000) but enormous for exam scores (mean ~75). The coefficient of variation (CV = SD/Mean x 100%) normalizes this: a CV below 15% generally indicates low variability, 15-30% moderate, and above 30% high.