Comparison of Two Newly Developed Multiple Imputation Methods for MNAR Cross-Sectional Data

Liu, April Xianxian 1989-

Comparison of Two Newly Developed Multiple Imputation Methods for MNAR Cross-Sectional Data

dc.contributor.advisor	Pahwa, Punam
dc.contributor.committeeMember	Khan, Shahedul
dc.contributor.committeeMember	Feng, Cindy
dc.contributor.committeeMember	Janzen, Bonnie
dc.contributor.committeeMember	Dosman, James
dc.contributor.committeeMember	Volodin, Andrei
dc.creator	Liu, April Xianxian 1989-
dc.creator.orcid	0000-0002-1744-536X
dc.date.accessioned	2020-03-09T19:36:10Z
dc.date.available	2020-03-09T19:36:10Z
dc.date.created	2020-01
dc.date.issued	2020-03-09
dc.date.submitted	January 2020
dc.date.updated	2020-03-09T19:36:11Z
dc.description.abstract	The problem of missing not at random (MNAR) data is a highly complex problem to the difficulty of joint modeling the outcome values and missing pattern while taking the variability of the missing data into consideration. In recent years, two methods by Galimard et. al (2016) and Ogundimu & Collins (2017) each developed their own multiple imputation (MI) methods for handling MNAR data. However, they have yet to be tested for their effectiveness in research sufficiently. This dissertation investigates the effectiveness of Galimard et. al and Ogundimu & Collins’ MIs alongside complete case (CC) analysis and Rubin’s MI when applied to two real-life datasets of different size (n1 = 4451, n2 = 1607) with induced missing data of MCAR, MAR, and MNAR mechanisms of 15%, 30%, and 50% missing data percentage. In addition, the methods will also be applied to simulated datasets with imputation and response models more complicated than in Galimard et. al and Ogundimu & Collins’ studies to see how widely they can be applied in datasets with different missing mechanisms and data percentage. It was found in the application results that Galimard et. al’s MI delivered the same results as CC in all missing mechanism and percentage combinations. For both datasets, Ogundimu & Collins’ MI performed better than the other 3 methods for 50% MNAR, though overall, both Galimard et. al and Ogundimu & Collins’ MIs performed better on MCAR and MAR data than MNAR. In simulation, Galimard et. al’s MI also delivered results consistently identical to CC for all missing percentage and mechanism combinations. Ogundimu & Collins’ MI consistently delivered superior results than the other 3 methods for 15% and 30% MNAR. However, Ogundimu & Collins’ MI should be used with caution because it did not converge for 50% missing and only converged for approximately 100 – 400 datasets out of 1000 for 15% and 30%. It will be interesting if future studies can apply Galimard et. al and Ogundimu & Collins’ MI methods other real-life datasets and easily-converge simulated datasets to see how well they can work when applied broadly in research and industry.
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/10388/12694
dc.subject	Missing Not At Random, Multiple Imputation
dc.title	Comparison of Two Newly Developed Multiple Imputation Methods for MNAR Cross-Sectional Data
dc.type	Thesis
dc.type.material	text
thesis.degree.department	School of Public Health
thesis.degree.discipline	Biostatistics
thesis.degree.grantor	University of Saskatchewan
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy (Ph.D.)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: LIU-DISSERTATION-2020.pdf
Size:: 3.51 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: LICENSE.txt
Size:: 2.26 KB
Format:: Plain Text
Description:

Download

Collections

Graduate Theses and Dissertations