Maximum Likelihood (ML) estimation is a method for estimating the parameter of a statistical model that makes the observed data as likely as possible, given the model. The idea is simple: you want to find the value of that maximises the likelihood of the observed data .
- ML can overfit with small datasets; MAP incorporates priors to mitigate this.
- Both ML and MAP use optimisation to find their best parameter values. ML has no notion of probability distribution over parameter values - all uncertainty in parameters are due to randomness in the data - the frequentist perspective.