11 Maximum Likelihood Estimate and Estimator

Formally, let $X = (X_{1}, X_{2}, \dots, X_{n})$ ) be a random sample from a distribution with parameter $θ$ . Suppose we have observed the values of $X$ as $x = (x_{1}, x_{2}, \dots, x_{n})$ . A maximum likelihood estimate (MLE) of $θ$ , denoted as $\hat{θ}_{M L E}$ , is a value of $θ$ that maximises the likelihood function:

\hat{θ}_{MLE} = ar g θ max L (θ ∣ x)

Maximum Likelihood Estimator:

A Maximum Likelihood Estimator of the parameter $θ$ , denoted as $\hat{Θ}_{MLE}$ , is a random variable

\hat{Θ}_{MLE} = \hat{Θ}_{MLE} (X)

whose value is given by $\hat{θ}_{MLE}$ when $X = x$

Cost Function for computing the MLE

Cost Function : A function that maps a set of events into a number representing the “cost” of that event occurring, also called the loss function or objective function. For computing the MLE, there is one-to-one mapping between the likelihood function and the cost function: Given some data $D$ .

J (θ, D) = - lo g L (θ ∣ D)

Why Negative Logarithm ?

Convention - Software for minimization problems.
Convenience - Logarithms simplify multiplication into addition. This makes differentiation easier.
Numerical stability - Product of small probabilities can converge to zero, causing computational issues due to machine precision limits.

Ayush Acharjya's Notes

Explorer

11 Maximum Likelihood Estimate and Estimator

Maximum Likelihood Estimator:

Cost Function for computing the MLE

Graph View

Table of Contents

Backlinks