Causal Inference Using Educational Observational Data:  Statistical Bias Reduction Methods and Multilevel Data Extensions

Hernandez, Jose Manuel

Causal Inference Using Educational Observational Data: Statistical Bias Reduction Methods and Multilevel Data Extensions

dc.contributor.advisor	Abbott, Robert	en_US
dc.contributor.author	Hernandez, Jose Manuel	en_US
dc.date.accessioned	2015-09-29T18:01:57Z
dc.date.issued	2015-09-29
dc.date.submitted	2015	en_US
dc.description	Thesis (Ph.D.)--University of Washington, 2015	en_US
dc.description.abstract	This study utilizes a data driven simulation design, which deviates from the traditional model-based approaches most commonly adopted in quasi-experimental Monte Carlo (MC) simulation studies, to answer two main questions. First, this study explores the finite sample properties of the most utilized quasi-experimental methods that control for observable selection bias in the field of education and compares them to traditional regression methods. Second, this study lends an insight into the effects of ignoring the multilevel structure of data commonly found in the field when using quasi-experimental methods. Specifically, treatment effects were estimated using (1) Ordinary Least Squares (OLS) multiple linear regression (treatment effects, adjusted for mean differences on confounders), (2) Propensity Score Matching (PSM) using nearest neighbor 1:n with replacement, (3) Propensity Score Matching using Inverse Probability Weighting (IPW) of the propensity score, and (4) Propensity Score Matching using Sub-classification (Subclassification). There were five main factors that were varied to simulate the data, all of which were fully crossed, as follows: Four sample sizes (600, 1000, 2000, and 5000); three association levels among simulated variables (low, moderate, high); two treatment exposure levels (25% and 50%); four treatment effect sizes using Cohen’s d (none, low, moderate, and high); and five levels of ICCs (0, .10, .20, .30, and .40). These 480 conditions were each analyzed with four methods of analysis, for a total of 1920 conditions. Additionally, using data from the Educational Longitudinal Study of 2002 (ELS:2002), an applied study demonstration of the different estimation methods in question was performed and compared to the simulation results. Findings indicate that under certain conditions all methods compared perform the same and have similar estimates of treatment effects. Additionally, when the clustering of the data is ignored bias is introduced for smaller sample size conditions.	en_US
dc.embargo.lift	2016-09-28T18:01:57Z
dc.embargo.terms	Restrict to UW for 1 year -- then make Open Access	en_US
dc.format.mimetype	application/pdf	en_US
dc.identifier.other	Hernandez_washington_0250E_14698.pdf	en_US
dc.identifier.uri	http://hdl.handle.net/1773/33788
dc.language.iso	en_US	en_US
dc.rights	Copyright is held by the individual authors.	en_US
dc.subject	Causal Inference; Monte Carlo Simulation; Multilevel Modeling; Propensity Score Matching	en_US
dc.subject.other	Statistics	en_US
dc.subject.other	Education policy	en_US
dc.subject.other	Educational evaluation	en_US
dc.subject.other	education - seattle	en_US
dc.title	Causal Inference Using Educational Observational Data: Statistical Bias Reduction Methods and Multilevel Data Extensions	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Hernandez_washington_0250E_14698.pdf
Size:: 2.67 MB
Format:: Adobe Portable Document Format

Download

Collections

Education - Seattle