Negative Log-Likelihood as Cost Function :
Where:
- : The cost function to be minimised.
- : Sigmoid function.
- : Binary target label (0 or 1)
Optimisations :
- Minimise to find the optimal parameters .
- Use Gradient Descent.
Gradient of Cost Function : whose -th component is: