Measures of Dispersion
"The mean tells you where the centre is. The standard deviation tells you how far from centre the data wanders."
1. Chapter Overview
CENTRAL TENDENCY tells you WHERE the data clusters. DISPERSION tells you how SPREAD OUT it is. Both are essential. This chapter covers: absolute and relative measures of dispersion — Range, Quartile Deviation, Mean Deviation, Standard Deviation (the most important), Coefficient of Variation, and the Lorenz Curve (for inequality).
2. Why Dispersion Matters
Two datasets with IDENTICAL means:
- Dataset A: [50, 50, 50, 50, 50] → Mean = 50. Variation = ZERO.
- Dataset B: [0, 25, 50, 75, 100] → Mean = 50. Variation = LARGE.
The mean is the SAME. The story is COMPLETELY DIFFERENT.
Absolute vs Relative Dispersion
- Absolute: Expressed in the SAME UNITS as the data (dollars, kg, cm). Example: Range, Standard Deviation.
- Relative: Expressed as a RATIO or PERCENTAGE. UNIT-FREE. Allows COMPARISON across datasets with different units. Example: Coefficient of Variation.
3. Measures of Dispersion
Range
- Range = Maximum value — Minimum value
- Simplest measure. ONLY uses two extreme values. IGNORES everything in between. Highly affected by outliers.
Quartile Deviation (Semi-Interquartile Range)
- Q.D. = (Q₃ — Q₁) / 2
- Q₁ (First quartile): 25% of observations below it. Q₃ (Third quartile): 75% below it.
- Covers the MIDDLE 50% of data. Not affected by extreme values at either end.
Mean Deviation (M.D.)
- Average of the ABSOLUTE deviations from a measure of central tendency (usually mean or median)
- M.D. = Σ|X — A| / N (where A is mean or median)
- Uses ALL values. But: using absolute values makes it mathematically less tractable.
Standard Deviation (S.D., σ) — THE MOST IMPORTANT
- σ = √[Σ(X — X̄)² / N] (for population)
- The SQUARE ROOT of the average of SQUARED deviations from the mean
- Why squared? (1) Eliminates sign (negative deviations become positive when squared). (2) Gives MORE WEIGHT to larger deviations. (3) Mathematically TRACTABLE.
Variance
- Variance = σ² = Σ(X — X̄)² / N
- The standard deviation SQUARED
Coefficient of Variation (C.V.)
- C.V. = (σ / X̄) × 100
- Relative measure. Allows COMPARISON of dispersion across datasets with DIFFERENT MEANS or DIFFERENT UNITS.
- LOWER C.V. = more consistent/less variability.
4. Lorenz Curve — Measuring Inequality
- Graphical representation of INEQUALITY (income, wealth, land distribution)
- Horizontal axis: CUMULATIVE % of population (from poorest to richest)
- Vertical axis: CUMULATIVE % of income (or whatever is being measured)
- Line of perfect equality: 45° diagonal. Bottom 20% of population have 20% of income; bottom 50% have 50%; etc.
- Lorenz Curve: the ACTUAL distribution. BOWS BELOW the line of equality.
- The FARTHER the Lorenz Curve is from the line of equality → the GREATER the inequality
- Gini Coefficient (not in detail in Class 11, but related): numerical measure derived from Lorenz Curve
5. Which Measure to Use?
| Measure | Pros | Cons |
|---|---|---|
| Range | Simplest | Uses only 2 values. Unstable. |
| Quartile Deviation | Ignores extremes | Ignores most data |
| Mean Deviation | Uses all data | Absolute values — mathematically less useful |
| Standard Deviation | Mathematically BEST. Foundation of statistics. | Sensitive to outliers |
| Coefficient of Variation | Allows cross-dataset comparison | Derived from mean and SD — inherits their assumptions |
Standard Deviation is the default choice. Complement with C.V. for relative comparison.
6. Exam Focus
- Range, Quartile Deviation — definitions and formulas
- Standard Deviation — formula, calculation, why squared deviations
- Variance — definition
- Coefficient of Variation — formula, when used
- Lorenz Curve — what it shows, how to read it, line of equality
7. Conclusion
Knowing the AVERAGE is not enough. You must know the SPREAD:
- RANGE: Max — Min. Crude but instant.
- STANDARD DEVIATION: The gold standard. Uses all data. Mathematically powerful. The foundation of inferential statistics.
- C.V.: For comparing apples and oranges (data with different means/units).
- LORENZ CURVE: For SEEING inequality.
'The mean is a seductive liar. The standard deviation is the honest friend who tells you the truth about how messy the data really is.'
