Causal Inference Using Educational Observational Data: Statistical Bias Reduction Methods and Multilevel Data Extensions

dc.contributor.advisorAbbott, Roberten_US
dc.contributor.authorHernandez, Jose Manuelen_US
dc.date.accessioned2015-09-29T18:01:57Z
dc.date.issued2015-09-29
dc.date.submitted2015en_US
dc.descriptionThesis (Ph.D.)--University of Washington, 2015en_US
dc.description.abstractThis study utilizes a data driven simulation design, which deviates from the traditional model-based approaches most commonly adopted in quasi-experimental Monte Carlo (MC) simulation studies, to answer two main questions. First, this study explores the finite sample properties of the most utilized quasi-experimental methods that control for observable selection bias in the field of education and compares them to traditional regression methods. Second, this study lends an insight into the effects of ignoring the multilevel structure of data commonly found in the field when using quasi-experimental methods. Specifically, treatment effects were estimated using (1) Ordinary Least Squares (OLS) multiple linear regression (treatment effects, adjusted for mean differences on confounders), (2) Propensity Score Matching (PSM) using nearest neighbor 1:n with replacement, (3) Propensity Score Matching using Inverse Probability Weighting (IPW) of the propensity score, and (4) Propensity Score Matching using Sub-classification (Subclassification). There were five main factors that were varied to simulate the data, all of which were fully crossed, as follows: Four sample sizes (600, 1000, 2000, and 5000); three association levels among simulated variables (low, moderate, high); two treatment exposure levels (25% and 50%); four treatment effect sizes using Cohen’s d (none, low, moderate, and high); and five levels of ICCs (0, .10, .20, .30, and .40). These 480 conditions were each analyzed with four methods of analysis, for a total of 1920 conditions. Additionally, using data from the Educational Longitudinal Study of 2002 (ELS:2002), an applied study demonstration of the different estimation methods in question was performed and compared to the simulation results. Findings indicate that under certain conditions all methods compared perform the same and have similar estimates of treatment effects. Additionally, when the clustering of the data is ignored bias is introduced for smaller sample size conditions.en_US
dc.embargo.lift2016-09-28T18:01:57Z
dc.embargo.termsRestrict to UW for 1 year -- then make Open Accessen_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.otherHernandez_washington_0250E_14698.pdfen_US
dc.identifier.urihttp://hdl.handle.net/1773/33788
dc.language.isoen_USen_US
dc.rightsCopyright is held by the individual authors.en_US
dc.subjectCausal Inference; Monte Carlo Simulation; Multilevel Modeling; Propensity Score Matchingen_US
dc.subject.otherStatisticsen_US
dc.subject.otherEducation policyen_US
dc.subject.otherEducational evaluationen_US
dc.subject.othereducation - seattleen_US
dc.titleCausal Inference Using Educational Observational Data: Statistical Bias Reduction Methods and Multilevel Data Extensionsen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hernandez_washington_0250E_14698.pdf
Size:
2.67 MB
Format:
Adobe Portable Document Format