Why the Average Is Not Enough
Imagine two cities both have an average daily temperature of 20°C (68°F). Sounds like they have similar weather, right? But what if City A stays between 18°C and 22°C all year, while City B swings from -5°C in winter to 45°C in summer? The average is the same, but the experience of living there is completely different.
This is why we need to measure spread - how much the values in a dataset differ from each other. Two of the most important tools for measuring spread are the range and the variance.
The Range: The Simplest Measure of Spread
The range is the easiest spread measure to understand. You take the largest value, subtract the smallest value, and that is your range.
Range = Highest value − Lowest value
A basketball player scores these points over 5 games: 12, 18, 15, 22, 10
Highest score: 22. Lowest score: 10.
Range = 22 − 10 = 12 points
This tells us the player's scoring varied by 12 points from their worst to best game.
The range gives you a quick snapshot, but it has a major limitation: it only looks at the two most extreme values and ignores everything in between.
Consider two students' test scores over 5 exams:
Student A: 60, 80, 82, 83, 100 → Range = 40
Student B: 60, 61, 62, 63, 100 → Range = 40
Both have the same range of 40, but Student A's scores are more clustered in the middle, while Student B has scores bunched at the low end with one high outlier. The range cannot tell you this.
When the Range Is Useful
Despite its limitations, the range is handy for quick checks. A nurse checking a patient's blood pressure over a week might note the range first: "Your systolic pressure ranged from 118 to 142." That immediately tells both the nurse and the patient something useful.
Introducing Variance: A Smarter Measure of Spread
Variance looks at every value in your data and asks: how far is each one from the mean? Then it combines all of those distances into one number. A low variance means the values are clustered close to the mean. A high variance means they are spread far apart.
Calculating Variance Step by Step
Let us walk through this with a simple example so the idea becomes clear.
Monthly salaries of 4 employees at a small shop: $2,000 · $2,500 · $3,000 · $2,500
Step 1 - Find the mean:
(2,000 + 2,500 + 3,000 + 2,500) ÷ 4 = $2,500
Step 2 - Find each value's distance from the mean:
- $2,000 − $2,500 = −$500
- $2,500 − $2,500 = $0
- $3,000 − $2,500 = +$500
- $2,500 − $2,500 = $0
Step 3 - Square each distance (to remove negative signs and emphasize larger gaps):
- (−500)² = 250,000
- (0)² = 0
- (500)² = 250,000
- (0)² = 0
Step 4 - Find the mean of those squared distances:
(250,000 + 0 + 250,000 + 0) ÷ 4 = 125,000
The variance is 125,000 (in "squared dollars," which is a bit awkward - we will address this in the next lesson on standard deviation).
Why Do We Square the Distances?
This is a question many beginners ask, and it is a great one. If you just added up the raw distances without squaring, the positives and negatives would cancel each other out and you would get zero every time. Squaring makes all the values positive and also gives extra weight to values that are far from the mean.
Population Variance vs. Sample Variance
You may encounter two slightly different formulas for variance. The difference is small but worth knowing about.
If your data includes every single member of the group you care about (for example, every student in a classroom), you divide by the total count. This is called population variance.
If your data is a sample - a smaller group chosen to represent a larger one (for example, 100 shoppers surveyed out of thousands) - you divide by one less than the count. This is called sample variance, and the small adjustment helps make the estimate more accurate.
You survey 5 people about how many cups of coffee they drink per day: 1, 2, 3, 2, 2. The mean is 2.
Squared distances from the mean: 1, 0, 1, 0, 0
Population variance (if these 5 are everyone you care about): (1+0+1+0+0) ÷ 5 = 0.4
Sample variance (if these 5 represent a larger group): (1+0+1+0+0) ÷ 4 = 0.5
The difference is small here, and it gets even smaller as your sample size grows.
Why Spread Matters in Real Life
Weather and Travel Planning
If you are packing for a trip and the average temperature at your destination is 22°C, you might pack only summer clothes. But if the variance is high, temperatures might swing from 10°C at night to 34°C during the day. You would want layers. The average alone does not prepare you.
Salary Negotiations
A job listing says the average salary for a position is $60,000. But what is the spread? If the range is $55,000 to $65,000, salaries are tightly packed and you know roughly what to expect. If the range is $35,000 to $120,000, the actual salary depends heavily on experience, location, and negotiation. The spread changes how you approach the conversation.
Sports Consistency
Two soccer players each score an average of 1 goal per match over a season. Player A scores 0 or 1 in most games - very consistent. Player B scores 0 in many games but occasionally scores 4 or 5 - high variance. A coach deciding between them might prefer Player A for reliability or Player B for a high-stakes match where a big performance could be the difference.
Comparing Range and Variance
The range is quick and easy but only considers two data points. The variance uses every data point and gives a fuller picture of spread. Think of the range as glancing at a thermometer once in the morning and once at night, while variance is like checking it every hour and calculating how much the temperature fluctuated throughout the day.
The range tells you the gap between the highest and lowest values - fast and simple, but it misses the details. Variance tells you how spread out all the values are around the mean - it uses every data point and gives a much richer picture. Together with the mean, these measures help you understand not just what is typical, but how much things vary. And in real life, the variation often matters more than the average.