• Also known as logit regression.
  • A model that outputs a real number in , thresholded at to obtain binary outcomes (class 0 or class 1).
  • Goal : Predict the probability that an example belongs to the β€œ1” class.
  • Application : Widely used in medical and social sciences.
  • Approach :
    • Use the so-called log-odds (logit) to create a .
    • Apply the sigmoid function (logistic function) to convert log-odds to a real number in - i.e., a probability.

Logit = Log-Odds

Given an example having features , its target , is a Bernoulli random variable. Takes value 1 with some probability and 0 with probability .

  • Probability : A number that describes the likelihood of an event occurring.
  • Odds : The ratio of the probability of an event occurring to the probability of it not occurring.
  • Logit (log-odds) : The logarithm of the odds.

The logit

For the -th example, ,

where :

  • : Intercept term.
  • : Coefficients (aka. parameters of the model).
  • ;
  • , where is the target variable (also called β€œdependent variable”).

The Sigmoid

Solve:

We exponentiate both sides:

Multiply both sides by :

Bring all terms to one side:

Solve for :

The RHS is the sigmoid function . It maps the logit (a real value) to a probability (in ). It signifies , the probability that label 1 is predicted for .

Note : It follows that .