Repository logo
 

MULTIPLE IMPUTATION TO CORRECT FOR MEASUREMENT ERROR: Application to Chronic Disease Case Ascertainment in Administrative Health Databases

dc.contributor.advisorLix, Lisaen_US
dc.contributor.committeeMemberLiu, Juxinen_US
dc.contributor.committeeMemberBickis, Miken_US
dc.contributor.committeeMemberSari, Nazmien_US
dc.creatorYao, Xueen_US
dc.date.accessioned2013-01-03T22:34:24Z
dc.date.available2013-01-03T22:34:24Z
dc.date.created2012-10en_US
dc.date.issued2012-11-16en_US
dc.date.submittedOctober 2012en_US
dc.description.abstractDiagnosis codes in administrative health databases (AHDs) are commonly used to ascertain chronic disease cases for research and surveillance. Low sensitivity of diagnosis codes has been demonstrated in many studies that validate AHDs against a gold standard data source in which the true disease status is known. This will result in misclassification of disease status, which can lead to biased prevalence estimates and loss of power to detect associations between diseases status and health outcomes. Model-based case detection algorithms in combination with multiple imputation (MI) methods in validation dataset/main dataset designs could be used to correct for misclassification of chronic disease status in AHDs. Under this approach, a predictive model of disease status (e.g., logistic model) is constructed in the validation dataset, the model parameters are estimated and MI methods are used to impute true disease status in the main dataset. This research considered scenarios that the misclassification of the observed disease status is independent of disease predictors and dependent on disease predictors. When the misclassification of the observed disease status is independent of disease predictors, the MI methods based on Frequentist logistic model (with and without bias correction) and Bayesian logistic model were compared. And when the misclassification of the observed disease status is dependent on disease predictors, the MI based on Frequentist logistic model with different variables as covariates were compared. Monte Carlo techniques were used to investigate the effects of the following data and model characteristics on bias and error in chronic disease prevalence estimates from AHDs: sensitivity of observed disease status based on diagnosis codes, size of the validation dataset, number of imputations, and the magnitude of measurement error in covariates of the predictive model. Relative bias, root mean squared error and coverage of 95% confidence interval were used to measure the performance. Without bias correction, the Bayesian MI model has lower RMSE than the Frequentist MI model. And the Frequentist MI model with bias correction is demonstrated via a simulation study to have superior performance to Bayesian MI model and the Frequentist MI model without bias correction. The results indicate that MI works well for measurement error correction if the missing true values are not missing not at random no matter whether the observed disease diagnosis is dependent on other disease predictors or not. Increasing the size of the validation dataset can improve the performance of MI better than increasing the number of imputations.en_US
dc.identifier.urihttp://hdl.handle.net/10388/ETD-2012-10-760en_US
dc.language.isoengen_US
dc.subjectKeywords: disease prevalence, measurement error, multiple imputationen_US
dc.titleMULTIPLE IMPUTATION TO CORRECT FOR MEASUREMENT ERROR: Application to Chronic Disease Case Ascertainment in Administrative Health Databasesen_US
dc.type.genreThesisen_US
dc.type.materialtexten_US
thesis.degree.departmentSchool of Public Healthen_US
thesis.degree.disciplineBiostatisticsen_US
thesis.degree.grantorUniversity of Saskatchewanen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Science (M.Sc.)en_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
YAO-THESIS.pdf
Size:
1.85 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1000 B
Format:
Plain Text
Description: