Home > Research > AIDS and Society Research Unit > Publications > Publications All > Pub home > Testing for a common latent variable in a linear regression: Or how to "fix" a bad variable by adding multiple proxies for it

Testing for a common latent variable in a linear regression: Or how to "fix" a bad variable by adding multiple proxies for it

Year: 2005
Working paper number: 132
Author: Wittenberg, Martin
Abstract:

We analyse models in which additional "controls" or proxies are included in a regression. This might occur intentionally if there is significant measurement error in a key regressor or if a key variable is not measured at all. We develop a test of the hypothesis that a subset of the regressors are all proxying for the same latent variable and we show how an estimate of the structural coefficient might be obtained more efficiently than is available in the current literature. We apply the procedure to the determinants of sleep among young South Africans. We show that the income variable in the time use survey is badly measured. Nevertheless the measured impact of income on sleep is significant and amounts to 35 minutes per day between children with the median income and those in the topmost income bracket. Including a variety of asset proxies increases the estimated size of the coefficient enormously. The specification tests indicate that some of the asset proxies, however, have independent effects. Access to electricity, in particular, is not simply proxying for income. Instead it seems to be capturing access to various forms of entertainment, such as television. Even when this independent effect is properly accounted for, the size of the income coefficient is still 40% to 100% larger than in the specifications without the proxies.


Publication file: wp132.pdf
TOP