Evaluating the Performance of Different Multiple Imputation Methods When Imputing Missingness in Time-Series-Cross-Sectional Data
| dc.contributor.advisor | Chan, Kwun Chuen | |
| dc.contributor.advisor | Sadinle, Mauricio | |
| dc.contributor.author | Dai, Xiaochen | |
| dc.date.accessioned | 2020-02-04T19:24:32Z | |
| dc.date.issued | 2020-02-04 | |
| dc.date.submitted | 2019 | |
| dc.description | Thesis (Master's)--University of Washington, 2019 | |
| dc.description.abstract | This thesis evaluates the performance of different multiple imputation methods in imputing country-level proportions of key indicators that are missing in time-series-cross-sectional (TSCS) data. When imputing the country-level proportions missing in TSCS data due to questions not asked in the survey, we found that Amelia and Multiple Imputation by Chain Equation for two-level panel data (mice.2l.pan) performed best among seven methods being evaluated for both methods converged fast, produced reasonable and stable imputations and had small out-of-sample root mean squared error (RMSE) less than ±5% for proportions imputed and 95% coverage rate (CR_95) very close to 95%. In addition, we found that including incomplete auxiliary variables that are correlated with targeted incomplete variables improved the imputation performance regardless of the missing rate of the auxiliary variables. However, including the cluster means had little impact on the imputation performance. The goal of the thesis is to produce empirical evidence on the performance of different multiple imputation methods in imputing missingness in TSCS data. | |
| dc.embargo.lift | 2021-02-03T19:24:32Z | |
| dc.embargo.terms | Restrict to UW for 1 year -- then make Open Access | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.other | Dai_washington_0250O_20868.pdf | |
| dc.identifier.uri | http://hdl.handle.net/1773/45119 | |
| dc.language.iso | en_US | |
| dc.rights | CC BY-SA | |
| dc.subject | Amelia | |
| dc.subject | cross validation | |
| dc.subject | missing data | |
| dc.subject | multiple imputation | |
| dc.subject | time-series-cross-sectional data | |
| dc.subject | Statistics | |
| dc.subject | Public health | |
| dc.subject.other | Biostatistics | |
| dc.title | Evaluating the Performance of Different Multiple Imputation Methods When Imputing Missingness in Time-Series-Cross-Sectional Data | |
| dc.type | Thesis |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Dai_washington_0250O_20868.pdf
- Size:
- 2.51 MB
- Format:
- Adobe Portable Document Format
