35 Bias, Variance and Noise

y = f (x) + ϵ

where $ϵ$ is a random noise with $E [ϵ] = 0$ and variance $Var (E) = σ^{2}$ .

We want to analyse the performance of a model at a fixed test point $x$ . The total expected squared error at $x$ is :

E_{S, ϵ} [(y - \hat{f} (x; S))^{2}]

This expectation is taken with respect to :

The unknown true distribution of the training set $S$ : Randomly drawn from the true distribution $p (x, y)$ .
The noise $ϵ$ : Variability in $y$ for a fixed $x$ .

The bias is defined as the difference between the true function $f (x)$ and the expected model prediction over all possible training set :

Bias^{2} = (f (x) - E_{S} [\hat{f} (x; S)])^{2}

where :

$f (x)$ : The true function that generates the data.
$\hat{f} (x; S)$ : The model’s prediction at $x$ given training set $S$ .
$E_{S} [\hat{f} (x; S)]$ : The expected prediction over different training sets.

The variance measures how much the model’s predictions vary for different training set $S$ .

Variance = E_{S} [(\hat{f} (x; S) - E_{S} [\hat{f} (x; S)])^{2}]

This captures the sensitivity of the model to different training sets.

The irreducible noise, $σ^{2}$ , is the variance in $y$ that is independent of $x$ .

Irreducible Noise = E_{ϵ} [ϵ^{2}] = σ^{2}

This is the inherent randomness in the data, which cannot be reduced by any model.

Start with :

E_{S} [(\hat{f} (x; S) - f (x))^{2}]

Add/Subtract the mean :

E_{S} [(\hat{f} (x; S) - E_{S} [\hat{f} (x; S)] + E_{S} [\hat{f} (x; S)] - f (x))^{2}]

Expand the square :

= Bias^{2} (E_{S} [\hat{f} (x; S)] - f (x))^{2} + Variance E_{S} [(\hat{f} (x; S) - E_{S} [\hat{f} (x; S)])^{2}] + Cross-term

Eliminate the Cross-term :

E_{S} [\hat{f} (x; S) - E_{S} [\hat{f} (x; S)]] = 0.

E_{S, ε} [(y - \hat{f} (x; S))^{2}] = E_{S} [(f (x) - \hat{f} (x; S))^{2}] + σ^{2} E_{ε} [ε^{2}] = Bias^{2} (E_{S} [\hat{f} (x; S)] - f (x))^{2} + Variance E_{S} [(\hat{f} (x; S) - E_{S} [\hat{f} (x; S)])^{2}] + σ^{2} = Bias^{2} + Variance + σ^{2} .

Total Error = Bias^{2} + Variance + Noise .

Ayush Acharjya's Notes