Chi-squared distribution

Take independent standard normals, square each, and add them up. The result is chi-squared with degrees of freedom.

The density itself is unlovely, but every interactive feature follows from that definition:

00.050.10.150.2024681012141618xdensity
Chi-squared (mean = k = 4.0) Normal(k, √(2k))
PDF . The sum of k squared standard normals: . The dashed Normal(k, √2k) approximation tightens as k grows.

What to notice

  • Small k is strongly right-skewed. At the mode sits at zero and the density blows up there — a single Normal squared spends most of its time near zero.
  • Large k goes Normal. The dashed curve is Normal(k, √(2k)). By the two are nearly indistinguishable, which is why variance-based tests quietly revert to z-tests when degrees of freedom are high.
  • Mean and variance are both linear in k. Adding one more squared Normal adds 1 to the mean and 2 to the variance.

Why it matters

Every statistical test whose null distribution is “something squared divided by something” turns out to involve chi-squared:

  • Sample variance. If your data are iid Normal, the scaled sample variance is exactly chi-squared:
    This is the denominator of Student’s t.
  • Pearson’s χ² test. Summing squared residuals of observed vs. expected counts produces an approximate chi-squared.
  • Likelihood ratios. Under regularity conditions, −2 log(LR) is chi-squared in the sample-size limit (Wilks’ theorem).

Together with the F distribution — the ratio of two chi-squareds — it covers most of the classical frequentist toolkit.