I have mentioned a few times that there is an unspoken ontological order of things in applied work, wherein one first needs to take care of the problem of identification before one should worry about properly modeling the dependent variable’s data-generating process. In other words, before you obsess over whether you should estimate a Poisson or a negative binomial regression, your time is better spend thinking about whether the effect of your variable of interest on your dependent variable is properly identified.
This week, however, I wanted to move away from my usual focus on the identification of causal effects to look at the modeling of DGPs.
Let us take an example from the first article I ever published (and which, to this day, remains my most-cited article). In that article, my coauthor and I were interested in the marketing behavior of the households in our sample. In some time periods, some households happened to be net sellers (i.e., their sales exceeded their purchases), some households happened to be net buyers (i.e., their purchases exceeded their sales), and some households happened to be autarkic (i.e., their neither bought nor sold).