Kosuke Imai, Gary King, and Clayton Nall. "The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation," copy at http://gking.harvard.edu/files/abs/cluster-abs.shtml. (Article: PDF)

Abstract

A basic feature of many field experiments is that investigators are only able to randomize clusters of individuals --- such as households, communities, firms, medical practices, schools, or classrooms --- even when the individual is the unit of interest. To recoup the resulting efficiency loss, some studies pair similar clusters and randomize treatment within pairs. However, many other studies avoid pairing, in part because of claims in prominent methodological articles, echoed by clinical trials standards organizations, that this matched-pair, cluster-randomization design has serious problems. We prove that all such claims are unfounded. We then derive the model underlying the estimator recommended in the literature for this design. It is shown that this existing estimator is unbiased only in situations when matching is unnecessary, and that its commonly used standard error is invalid. To overcome this problem without modeling assumptions, we develop a simple design-based estimator with much improved statistical properties. We also propose a model-based approach that encompasses our design-based estimator as well as the estimator in the literature. Our methods also address individual-level noncompliance, which is common in applications but ignored by most existing methods. We show that from the perspective of bias, efficiency, power, or robustness, and in large or small samples, pairing should be used in cluster-randomized experiments whenever feasible; failing to do so is equivalent to discarding a considerable fraction of one's data. We develop these techniques in the context of a randomized evaluation we are conducting of the Mexican Universal Health Insurance Program.

Also see related research on causal inference.