Title
Interpretation of Regressions with Multiple Proxies
Author(s)
Darren Lubotsky Darren Lubotsky (Princeton University)
Martin Wittenberg Martin Wittenberg (University of the Witwatersrand South Africa)
Abstract
We consider the situation in which there are multiple proxies for one unobserved explanatory variable in a linear regression and provide a procedure by which the coefficient of interest can be extracted "post hoc" from a multiple regression in which all the proxies are used simultaneously. This post hoc estimator is strictly superior in large samples to coefficients derived using any index or linear combination of the proxies that is created prior to the regression. To use an index created from the proxies that extracts the largest possible signal from them requires knowledge of information that is not available to the researcher. Using the proxies simultaneously in a multiple regression delivers this information, and the researcher then simply combines the coefficients in a known way to obtain the estimate of the effect of the unobserved factor. This procedure is also much more robust than ad hoc index construction to departures from the assumption of an underlying common factor. We provide some Monte Carlo simulations and applications to existing empirical problems to show that the reduction in attenuation bias can be non-negligible, even in finite samples.
Creation Date
2001-09
Section URL ID
IRS
Paper Number
457
URL
https://dataspace.princeton.edu/bitstream/88435/dsp01fq977t77z/1/457.pdf
File Function
Jel
C1; C31
Keyword(s)
Proxy variables; measurement error; index construction
Suppress
false
Series
1