Suppose you are trying to explain some outcome [math]y[/math], where [math]y[/math] is equal to 0 or 1 (e.g., whether someone is a nonsmoker or a smoker). You also have data on a vector of explanatory variables [math]x[/math] (e.g., someone’s age, their gender, their level of education, etc.) and on a treatment variable [math]D[/math], which we will also assume is binary, so that [math]D[/math] is equal to 0 or 1 (e.g., whether someone has attended an information session on the negative effects of smoking).
If you were interested in knowing what the effect of attending the information session on the likelihood that someone is a smoker, i.e., the impact of [math]D[/math] on [math]y[/math] The equation of interest in this case is
A Rant on Estimation with Binary Dependent Variables (Technical)
Suppose you are trying to explain some outcome [math]y[/math], where [math]y[/math] is equal to 0 or 1 (e.g., whether someone is a nonsmoker or a smoker). You also have data on a vector of explanatory variables [math]x[/math] (e.g., someone’s age, their gender, their level of education, etc.) and on a treatment variable [math]D[/math], which we will also assume is binary, so that [math]D[/math] is equal to 0 or 1 (e.g., whether someone has attended an information session on the negative effects of smoking).
If you were interested in knowing what the effect of attending the information session on the likelihood that someone is a smoker, i.e., the impact of [math]D[/math] on [math]y[/math] The equation of interest in this case is