One of the things I often tell students when discussing whether to use linear regression or a more complicated nonlinear (i.e., maximum likelihood-based) procedure is that one advantage of linear regression is that it prevents identification by functional form.
By “identification via functional form,” what I mean is that the distributional or functional form assumptions made in the context of more complicated nonlinear procedures can lead you to estimate a coefficient which is purely identified because of those distributional or functional form assumptions.
I always had a hard time clearly explaining the intuition behind this, until my colleague Arne Henningsen, with whom I co-taught my advanced econometrics class at the University of Copenhagen, gave a really good example to the class. Here is that example.