TY - JOUR
T1 - Clustering, spatial correlations, and randomization inference
AU - Barrios, Thomas
AU - Diamond, Rebecca
AU - Imbens, Guido W.
AU - Koleśar, Michal
N1 - Funding Information:
Thomas Barrios is a Graduate Student, Department of Economics, Harvard University, Cambridge, MA 02138 (E-mail: [email protected]). Rebecca Diamond is a Graduate Student, Department of Economics, Harvard University, Cambridge, MA 02138 (E-mail: [email protected]). Guido Imbens is Professor, Department of Economics, Harvard University and Research Associate, National Bureau of Economic Research, Cambridge, MA 02138 (E-mail: [email protected]). Michal Kolesár is Graduate Student, Department of Economics, Harvard University, Cambridge, MA 02138 (E-mail: [email protected]). Financial support for this research was generously provided through NSF grants 0631252, 0820361, and 0961707. The authors thank participants in the econometrics workshop at Harvard University, the referees, the editor, and the associate editor for comments, and, in particular, Gary Chamberlain for helpful discussions.
PY - 2012
Y1 - 2012
N2 - It is a standard practice in regression analyses to allow for clustering in the error covariance matrix if the explanatory variable of interest varies at a more aggregate level (e.g., the state level) than the units of observation (e.g., individuals). Often, however, the structure of the error covariance matrix is more complex, with correlations not vanishing for units in different clusters. Here, we explore the implications of such correlations for the actual and estimated precision of least squares estimators. Our main theoretical result is that with equal-sized clusters, if the covariate of interest is randomly assigned at the cluster level, only accounting for nonzero covariances at the cluster level, and ignoring correlations between clusters as well as differences in within-cluster correlations, leads to valid confidence intervals. However, in the absence of random assignment of the covariates, ignoring general correlation structures may lead to biases in standard errors. We illustrate our findings using the 5% public-use census data. Based on these results, we recommend that researchers, as a matter of routine, explore the extent of spatial correlations in explanatory variables beyond state-level clustering.
AB - It is a standard practice in regression analyses to allow for clustering in the error covariance matrix if the explanatory variable of interest varies at a more aggregate level (e.g., the state level) than the units of observation (e.g., individuals). Often, however, the structure of the error covariance matrix is more complex, with correlations not vanishing for units in different clusters. Here, we explore the implications of such correlations for the actual and estimated precision of least squares estimators. Our main theoretical result is that with equal-sized clusters, if the covariate of interest is randomly assigned at the cluster level, only accounting for nonzero covariances at the cluster level, and ignoring correlations between clusters as well as differences in within-cluster correlations, leads to valid confidence intervals. However, in the absence of random assignment of the covariates, ignoring general correlation structures may lead to biases in standard errors. We illustrate our findings using the 5% public-use census data. Based on these results, we recommend that researchers, as a matter of routine, explore the extent of spatial correlations in explanatory variables beyond state-level clustering.
KW - Clustered standard errors
KW - Confidence intervals
KW - Misspecification
KW - Random assignment
UR - http://www.scopus.com/inward/record.url?scp=84864382738&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84864382738&partnerID=8YFLogxK
U2 - 10.1080/01621459.2012.682524
DO - 10.1080/01621459.2012.682524
M3 - Article
AN - SCOPUS:84864382738
SN - 0162-1459
VL - 107
SP - 578
EP - 591
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 498
ER -