- Also known as logit regression.
- A model that outputs a real number in , thresholded at to obtain binary outcomes (class 0 or class 1).
- Goal : Predict the probability that an example belongs to the β1β class.
- Application : Widely used in medical and social sciences.
- Approach :
- Use the so-called log-odds (logit) to create a .
- Apply the sigmoid function (logistic function) to convert log-odds to a real number in - i.e., a probability.
Logit = Log-Odds
Given an example having features , its target , is a Bernoulli random variable. Takes value 1 with some probability and 0 with probability .
- Probability : A number that describes the likelihood of an event occurring.
- Odds : The ratio of the probability of an event occurring to the probability of it not occurring.
- Logit (log-odds) : The logarithm of the odds.
The logit
For the -th example, ,
where :
- : Intercept term.
- : Coefficients (aka. parameters of the model).
- ;
- , where is the target variable (also called βdependent variableβ).
The Sigmoid
Solve:
We exponentiate both sides:
Multiply both sides by :
Bring all terms to one side:
Solve for :
The RHS is the sigmoid function . It maps the logit (a real value) to a probability (in ). It signifies , the probability that label 1 is predicted for .
Note : It follows that .