Negative Log-Likelihood as Cost Function :

Where:

  • : The cost function to be minimised.
  • : Sigmoid function.
  • : Binary target label (0 or 1)

Optimisations :

  • Minimise to find the optimal parameters .
  • Use Gradient Descent.

Gradient of Cost Function : whose -th component is: