← All Resources

Statistics Explained: Mean, Standard Deviation, Z-Scores, and Probability

✍️
By Derek Giordano, BA Business Marketing  ·  Updated May 2026  ·  Reviewed for accuracy
📅 Updated May 2026⏱ 14 min read🧮 Statistics Calculator

Statistics shows up everywhere, and most people's reaction to it is somewhere between "I barely passed that class" and outright dread. But here's the thing — you don't need to love statistics to benefit from understanding the basics. It's how scientists determine whether a drug works, how pollsters predict elections, how quality engineers catch manufacturing defects, and how financial analysts price risk. I've tried to explain each concept below the way I wish someone had explained it to me: in plain language, with real examples, and with links to calculators that do the actual math.

Measures of Center: Mean, Median, and Mode

These three numbers each describe the "middle" of a dataset, but they do it in very different ways. Choosing the wrong one is one of the most common mistakes I see.

Mean (average): Add all values, divide by the count. For the dataset {4, 7, 8, 9, 12}: mean = 40 ÷ 5 = 8. Simple enough. But the mean has a weakness — it gets yanked around by extreme values. Change that 12 to 120 and the mean jumps from 8 to 29.6, even though most values are still single digits.

Median: The middle value when data is sorted. For {4, 7, 8, 9, 12}: median = 8. For even-numbered datasets, average the two middle values. Here's why the median matters: changing 12 to 120 doesn't budge it at all. And that's exactly why median household income ($74,580 in the U.S.) gets reported instead of average household income ($105,000+) — a handful of billionaires pull the average way up, but the median tells you what a typical household actually earns.

Mode: The most frequently occurring value. For {4, 7, 7, 9, 12}: mode = 7. A dataset can have no mode, one mode, or several. Honestly, mode doesn't come up that often in practice outside of categorical data — "the most common shoe size" or "the most popular color."

Use the Mean, Median, Mode Calculator to compute all three instantly for any dataset.

When to use which: Use mean for symmetric data without outliers (test scores, temperatures). Use median for skewed data or data with outliers (income, home prices, age at a concert). Use mode for categorical data or to find the most common value.

Standard Deviation: How Spread Out Is the Data?

Standard deviation measures dispersion — how far individual values typically fall from the mean. Low SD means everything clusters tightly around the average. High SD means wide spread. It's the number that reveals what the mean hides.

Worked example: Two classrooms both average 80% on a test, but with very different distributions:

ClassroomScoresMeanStandard Deviation
Class A78, 79, 80, 81, 82801.6
Class B55, 70, 80, 90, 1058018.7

Both classes average 80, but look at the difference. Class A's scores are nearly identical (SD = 1.6) — everyone understood the material about equally. Class B has a spread from 55 to 105 (SD = 18.7), meaning some students are badly lost while others are crushing it. Same average, completely different stories. That's what standard deviation gives you. Use the Standard Deviation Calculator to compute it for any dataset.

The Normal Distribution (Bell Curve)

A lot of things in nature follow a pattern: most values cluster near the center, with fewer and fewer as you move toward the extremes. Heights. Test scores. Blood pressure readings. Manufacturing tolerances. This bell-shaped pattern is called the normal distribution, and once you understand it, you'll start seeing it everywhere.

When data follows a normal distribution, a set of powerful rules kicks in:

RangePercentage of DataExample (Mean=100, SD=15, like IQ)
Within ±1 SD of mean68.2%85–115
Within ±2 SD of mean95.4%70–130
Within ±3 SD of mean99.7%55–145

This is called the 68-95-99.7 rule (or empirical rule). It only applies to normally distributed data.

The practical implications are huge. If a manufacturing process produces bolts with a mean diameter of 10.0 mm and a standard deviation of 0.02 mm, you instantly know that 99.7% of bolts will be between 9.94 and 10.06 mm. Anything outside ±3 SD is almost certainly defective. That's the foundation of quality control in every industry — and it all comes back to this one rule.

Z-Scores: Standardizing Any Dataset

A z-score converts any value to a universal scale — it tells you how many standard deviations that value sits from the mean. Once you get this concept, a lot of statistics clicks into place.

Formula: z = (value − mean) ÷ standard deviation

Example: On a test with mean = 75 and SD = 10, a score of 90 has a z-score of (90 − 75) ÷ 10 = 1.5. This means the score is 1.5 standard deviations above average. From the normal distribution table, a z-score of 1.5 corresponds to approximately the 93rd percentile — better than 93% of all scores.

Z-scores solve a problem that trips people up all the time. Say you scored 85 on a history test (mean 70, SD 10) and 42 on a chemistry test (mean 35, SD 5). Which performance was better? The history z-score is 1.5; the chemistry z-score is 1.4. You actually did slightly better in history relative to each class, despite the raw numbers being on completely different scales. That's the power of standardization. Use the Z-Score Calculator to convert any value.

Probability: Quantifying Uncertainty

Probability is just a number between 0 and 1 (or 0% and 100%) expressing how likely something is. Zero means impossible. One means certain. Everything interesting happens in between.

Basic probability: P(event) = favorable outcomes ÷ total outcomes. A fair die has P(rolling a 3) = 1/6 ≈ 16.7%.

Independent events: When one event does not affect another, multiply probabilities. P(two heads in a row) = 0.5 × 0.5 = 0.25 (25%). P(rolling 6 three times in a row) = (1/6)³ ≈ 0.46%.

Complementary events: P(not A) = 1 − P(A). If there is a 30% chance of rain, there is a 70% chance of no rain.

For more complex probability scenarios involving successes in multiple trials, the Binomial Probability Calculator computes exact probabilities. For general probability calculations, use the Probability Calculator.

Percentiles: Where You Stand in the Crowd

A percentile tells you what percentage of values in a dataset fall below a given value. If your test score is at the 85th percentile, you scored higher than 85% of test-takers. Straightforward.

Percentiles are everywhere once you start looking: standardized tests (SAT, GRE, ACT), pediatric growth charts, blood pressure categories, bone density scans. They give you context that raw numbers alone can't. A blood pressure reading of 135/85 means a lot more when you know it sits at the 75th percentile for your age group.

Z-scores and percentiles are directly related through the normal distribution. A z-score of 0 = 50th percentile. A z-score of +1 ≈ 84th percentile. A z-score of +2 ≈ 98th percentile. A z-score of −1 ≈ 16th percentile.

Common Statistical Mistakes

Confusing correlation with causation. This one drives me crazy because it's everywhere in media. Ice cream sales and drowning deaths both increase in summer. They're correlated — but ice cream doesn't cause drowning. A third variable (warm weather) drives both. Correlation measures the strength of a relationship; it says nothing about which variable causes the other, or whether something else entirely is responsible.

Ignoring sample size. Flip a coin 10 times and get 7 heads (70%)? Doesn't mean the coin is unfair. With only 10 trials, that kind of variability is normal. Flip 10,000 times and get 7,000 heads? Now you've got strong evidence. Larger samples produce more reliable statistics. Period.

Reporting mean when median is more appropriate. Average net worth in the 55–64 age group is $1.57 million; median is $364,500. That's a massive gap, and it's because a few billionaires pull the average way up. For skewed data, the median is almost always the more honest number. Whenever you see "average" in a headline about income or wealth, be skeptical.

Cherry-picking time periods. "The stock market returned 25% last year" and "The stock market lost 15% over two years" can both be true depending on which window you choose. Always ask: what's the full time period, and is this slice representative?

Frequently Asked Questions

What is the difference between mean, median, and mode?
Mean is the arithmetic average. Median is the middle value when sorted. Mode is the most frequent value. For skewed data (like income), median is usually more representative because extreme values pull the mean toward the tail.
What does standard deviation tell you?
Standard deviation measures how spread out data points are from the mean. For normally distributed data, about 68% of values fall within 1 SD, 95% within 2 SD, and 99.7% within 3 SD of the mean.
What is a z-score?
A z-score tells you how many standard deviations a value is from the mean. It lets you compare values across different scales and determine percentile rankings within a distribution.
What is the normal distribution and why does it matter?
The normal distribution (bell curve) is a symmetric pattern where most values cluster near the mean. It matters because many natural phenomena follow this pattern, and it enables the 68-95-99.7 rule for calculating probabilities.
How is probability calculated?
Basic probability = favorable outcomes ÷ total outcomes. For independent events, multiply probabilities. For mutually exclusive events, add them. P(heads) = 1/2. P(two heads in a row) = 1/2 × 1/2 = 1/4.

Run the Numbers

Compute statistics instantly. Use the free Statistics Calculator for full descriptive statistics, the Z-Score Calculator for standardized scores, and the Probability Calculator for likelihood calculations — no signup required.

Related tools: Standard Deviation Calculator · Mean, Median, Mode Calculator · Binomial Probability Calculator · Percentage Calculator · Percent Error Calculator

← Back to all resources
📚 Sources: [1] Khan Academy — Statistics and Probability [2] NCES — Condition of Education (Testing Data) [3] U.S. Census Bureau — Income and Poverty Statistics [4] NCTM — National Council of Teachers of Mathematics