Causal inference from observational data—that is, data that did not come from an experiment—isnotoriously difficult: because the probability distribution of the treatment variable Z is unknown,measured or unmeasured variables that correlate with both Z and the outcome Y may confound causal estimates. This thesis will present methods for designing and modeling causal observational studies that combine design-based techniques with regression to account for measured covariates X.Regression discontinuity designs occur when treatment assignment is a function of a variable T: when T exceeds a threshold c, treatment is assigned. Conventionally, researchers analyze RDDsby regressing Y on both T and Z. This thesis argues for modeling RDDs as naturally-randomizedexperiments in two steps: modeling the relationship between Y and T, and using that design to infer and estimate effects of Z on Y. We illustrate this approach by reanalyzing a dataset used to estimate the effects of academic probation on students’ grade point averages.The rest of the thesis focuses on propensity-score stratification with high-dimensional data(p >> n). If treatment assignment is a random unknown function of X, researchers can adjust causal estimates for X by estimating propensity scores: subjects’ respective probabilities of treatment assignment conditional on X. Researchers then stratify subjects based on their propensity scores and model the data as if treatment were randomized within strata. However, when the dimension of X is large, propensity-score estimation is impossible. We propose a method in which a subset of X is used to estimate propensity scores. Next, the entire matrix X can be used to model Y, using a high-dimensional regression technique; the model is trained on subjects excluded from the stratification. The model’s predictions of Y can then be used to test balance on, and adjust for, the entire set of covariates in X. We illustrate this method by evaluating two high-school educational programs.
【 预 览 】
附件列表
Files
Size
Format
View
New Perspectives on Regression Adjustment in Causal Inference, with Applications to Educational Program Evaluation.