메뉴 건너뛰기




Volumn 179, Issue 6, 2014, Pages 764-774

Comparison of random forest and parametric imputation models for imputing missing data using MICE: A CALIBER study

Author keywords

angina stable; imputation; missing data; missingness at random; regression trees; simulation; survival

Indexed keywords

ALGORITHM; DATA ACQUISITION; EPIDEMIOLOGY; ESTIMATION METHOD; FOREST ECOSYSTEM; HEALTH RISK; MULTIVARIATE ANALYSIS; NONLINEARITY; NUMERICAL MODEL; REGRESSION ANALYSIS; SIMULATION;

EID: 84895894249     PISSN: 00029262     EISSN: 14766256     Source Type: Journal    
DOI: 10.1093/aje/kwt312     Document Type: Article
Times cited : (463)

References (36)
  • 1
    • 77953522089 scopus 로고    scopus 로고
    • Issues in multiple imputation of missing data for large general practice clinical databases
    • Marston L, Carpenter JR, Walters KR, et al. Issues in multiple imputation of missing data for large general practice clinical databases. Pharmacoepidemiol Drug Saf. 2010;19(6):618-626.
    • (2010) Pharmacoepidemiol Drug Saf , vol.19 , Issue.6 , pp. 618-626
    • Marston, L.1    Carpenter, J.R.2    Walters, K.R.3
  • 4
    • 79953732420 scopus 로고    scopus 로고
    • Mice: Multivariate imputation by chained equations in r
    • van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. J Stat Softw. 2011; 45(3):1-67.
    • (2011) J Stat Softw , vol.45 , Issue.3 , pp. 1-67
    • Van Buuren, S.1    Groothuis-Oudshoorn, K.2
  • 5
    • 84867917862 scopus 로고    scopus 로고
    • Multiple imputation of missing covariates with non-linear effects and interactions: An evaluation of statistical methods
    • Seaman SR, Bartlett JW, White IR. Multiple imputation of missing covariates with non-linear effects and interactions: an evaluation of statistical methods. BMC Med Res Methodol. 2012;12(1):46.
    • (2012) BMC Med Res Methodol , vol.12 , Issue.1 , pp. 46
    • Seaman, S.R.1    Bartlett, J.W.2    White, I.R.3
  • 6
    • 84870333136 scopus 로고    scopus 로고
    • Auxiliary variables in multiple imputation in regression with missing X: A warning against including too many in small sample research
    • Hardt J, Herke M, Leonhart R. Auxiliary variables in multiple imputation in regression with missing X: a warning against including too many in small sample research. BMC Med Res Methodol. 2012;12(1):184.
    • (2012) BMC Med Res Methodol , vol.12 , Issue.1 , pp. 184
    • Hardt, J.1    Herke, M.2    Leonhart, R.3
  • 7
    • 77958547578 scopus 로고    scopus 로고
    • Multiple imputation for missing data via sequential regression trees
    • Burgette LF, Reiter JP. Multiple imputation for missing data via sequential regression trees. Am J Epidemiol. 2010;172(9): 1070-1076.
    • (2010) Am J Epidemiol , vol.172 , Issue.9 , pp. 1070-1076
    • Burgette, L.F.1    Reiter, J.P.2
  • 8
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman L. Random forests. Mach Learn. 2001;45(1):5-32.
    • (2001) Mach Learn , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 10
    • 82455175511 scopus 로고    scopus 로고
    • Brief review of regression-based and machine learning methods in genetic epidemiology: The Genetic Analysis Workshop 17 experience
    • Dasgupta A, Sun YV, König IR, et al. Brief review of regression-based and machine learning methods in genetic epidemiology: the Genetic Analysis Workshop 17 experience. Genet Epidemiol. 2011;35(S1):S5-S11.
    • (2011) Genet Epidemiol , vol.35 , Issue.1
    • Dasgupta, A.1    Sun, Y.V.2    König, I.R.3
  • 11
    • 4944264367 scopus 로고    scopus 로고
    • Relative risk forests for exercise heart rate recovery as a predictor of mortality
    • Ishwaran H, Blackstone EH, Pothier CE, et al. Relative risk forests for exercise heart rate recovery as a predictor of mortality. J Am Stat Assoc. 2004;99(467):591-600.
    • (2004) J Am Stat Assoc , vol.99 , Issue.467 , pp. 591-600
    • Ishwaran, H.1    Blackstone, E.H.2    Pothier, C.E.3
  • 13
    • 84855356469 scopus 로고    scopus 로고
    • Potential responders to FOLFOX therapy for colorectal cancer by random forests analysis
    • Tsuji S, Midorikawa Y, Takahashi T, et al. Potential responders to FOLFOX therapy for colorectal cancer by random forests analysis. Br J Cancer. 2012;106(1):126-132.
    • (2012) Br J Cancer , vol.106 , Issue.1 , pp. 126-132
    • Tsuji, S.1    Midorikawa, Y.2    Takahashi, T.3
  • 14
    • 84855177476 scopus 로고    scopus 로고
    • MissForest-non-parametric missing value imputation for mixed-type data
    • Stekhoven DJ, Bühlmann P. MissForest-non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112-118.
    • (2012) Bioinformatics , vol.28 , Issue.1 , pp. 112-118
    • Stekhoven, D.J.1    Bühlmann, P.2
  • 15
    • 80052845770 scopus 로고    scopus 로고
    • Imputation of missing values of tumour stage in population-based cancer registration
    • Eisemann N, Waldmann A, Katalinic A. Imputation of missing values of tumour stage in population-based cancer registration. BMC Med Res Methodol. 2011;11(1):129.
    • (2011) BMC Med Res Methodol , vol.11 , Issue.1 , pp. 129
    • Eisemann, N.1    Waldmann, A.2    Katalinic, A.3
  • 16
    • 84872151662 scopus 로고    scopus 로고
    • Data resource profile: CArdiovascular disease research using LInked BEspoke studies and electronic Records (CALIBER)
    • Denaxas S, George J, Herrett E, et al. Data resource profile: CArdiovascular disease research using LInked BEspoke studies and electronic Records (CALIBER). Int J Epidemiol. 2012; 41(6):1625-1638.
    • (2012) Int J Epidemiol , vol.41 , Issue.6 , pp. 1625-1638
    • Denaxas, S.1    George, J.2    Herrett, E.3
  • 17
    • 84895899772 scopus 로고    scopus 로고
    • R package, version 0.1-2). Vienna, Austria: Comprehensive R Archive Network; 2013 Accessed November 12 2013)
    • Shah AD. CALIBERrfimpute: Imputation in MICE using Random Forest. (R package, version 0.1-2). Vienna, Austria: Comprehensive R Archive Network; 2013. (http://cran.r-project. org/web/packages/CALIBERrfimpute/index.html). (Accessed November 12, 2013).
    • CALIBERrfimpute: Imputation in MICE Using Random Forest
    • Shah, A.D.1
  • 18
    • 72949102852 scopus 로고    scopus 로고
    • Validation and validity of diagnoses in the General Practice Research Database: A systematic review
    • Herrett E, Thomas SL, Schoonen WM, et al. Validation and validity of diagnoses in the General Practice Research Database: a systematic review. Br J Clin Pharmacol. 2010; 69(1):4-14.
    • (2010) Br J Clin Pharmacol , vol.69 , Issue.1 , pp. 4-14
    • Herrett, E.1    Thomas, S.L.2    Schoonen, W.M.3
  • 19
    • 79958148713 scopus 로고    scopus 로고
    • Health and Social Care Information Centre Leeds United Kingdom: Health and Social Care Information Centre Accessed November 11 2013
    • Health and Social Care Information Centre. Hospital Episode Statistics. Leeds, United Kingdom: Health and Social Care Information Centre; 2013. (http://www.hscic.gov.uk/hes). (Accessed November 11, 2013).
    • (2013) Hospital Episode Statistics
  • 20
    • 77955453843 scopus 로고    scopus 로고
    • The myocardial ischaemia national audit project (MINAP)
    • Herrett E, Smeeth L, Walker L, et al. The Myocardial Ischaemia National Audit Project (MINAP). Heart. 2010;96(16):1264-1267.
    • (2010) Heart , vol.96 , Issue.16 , pp. 1264-1267
    • Herrett, E.1    Smeeth, L.2    Walker, L.3
  • 21
    • 79958012898 scopus 로고    scopus 로고
    • Threshold haemoglobin levels and the prognosis of stable coronary disease: Two new cohorts and a systematic review and meta-analysis
    • Shah AD, Nicholas O, Timmis AD, et al. Threshold haemoglobin levels and the prognosis of stable coronary disease: two new cohorts and a systematic review and meta-analysis. PLoS Med. 2011;8(5):e1000439.
    • (2011) PLoS Med , vol.8 , Issue.5
    • Shah, A.D.1    Nicholas, O.2    Timmis, A.D.3
  • 22
    • 80053191610 scopus 로고    scopus 로고
    • Neutrophils and clinical outcomes in patients with acute coronary syndromes and/or cardiac revascularization: A systematic review on more than 34,000 subjects
    • Guasti L, Dentali F, Castiglioni L, et al. Neutrophils and clinical outcomes in patients with acute coronary syndromes and/or cardiac revascularization: a systematic review on more than 34,000 subjects. Thromb Haemost. 2011;106(4): 591-599.
    • (2011) Thromb Haemost , vol.106 , Issue.4 , pp. 591-599
    • Guasti, L.1    Dentali, F.2    Castiglioni, L.3
  • 23
    • 79959987174 scopus 로고    scopus 로고
    • Low lymphocyte count and cardiovascular diseases
    • Núñez J, Miñana G, Bodí V, et al. Low lymphocyte count and cardiovascular diseases. Curr Med Chem. 2011;18(21): 3226-3233.
    • (2011) Curr Med Chem , vol.18 , Issue.21 , pp. 3226-3233
    • Núñez, J.1    Miñana, G.2    Bodí, V.3
  • 24
    • 0030817628 scopus 로고    scopus 로고
    • Validity and efficiency of approximation methods for tied survival times in Cox regression
    • Hertz-Picciotto I, Rockhill B. Validity and efficiency of approximation methods for tied survival times in Cox regression. Biometrics. 1997;53(3):1151-1156.
    • (1997) Biometrics , vol.53 , Issue.3 , pp. 1151-1156
    • Hertz-Picciotto, I.1    Rockhill, B.2
  • 25
    • 69949108828 scopus 로고    scopus 로고
    • Imputing missing covariate values for the Cox model
    • White IR, Royston P. Imputing missing covariate values for the Cox model. Stat Med. 2009;28(15):1982-1998.
    • (2009) Stat Med , vol.28 , Issue.15 , pp. 1982-1998
    • White, I.R.1    Royston, P.2
  • 26
    • 2442736478 scopus 로고    scopus 로고
    • Small-sample degrees of freedom with multiple imputation
    • Barnard J, Rubin D. Small-sample degrees of freedom with multiple imputation. Biometrika. 1999;86(4):948-955.
    • (1999) Biometrika , vol.86 , Issue.4 , pp. 948-955
    • Barnard, J.1    Rubin, D.2
  • 27
    • 84863304598 scopus 로고    scopus 로고
    • R Development Core Team Vienna, Austria: R Foundation for Statistical Computing Accessed November 11 2013
    • R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2010. (http://www.R-project.org/). (Accessed November 11, 2013).
    • (2010) R: A Language and Environment for Statistical Computing
  • 28
    • 84884721665 scopus 로고    scopus 로고
    • R package, version 1.3). Vienna, Austria: Comprehensive R Archive Network Accessed November 11 2013
    • Stekhoven DJ. missForest: Nonparametric Missing Value Imputation using Random Forest. (R package, version 1.3). Vienna, Austria: Comprehensive R Archive Network; 2012. (http://cran.r-project.org/web/packages/missForest/index. html). (Accessed November 11, 2013).
    • (2012) MissForest: Nonparametric Missing Value Imputation Using Random Forest
    • Stekhoven, D.J.1
  • 29
    • 33751088498 scopus 로고    scopus 로고
    • R package version 2.36-2). Vienna Austria: Comprehensive R Archive Network Accessed November 23 2010
    • Therneau T, Lumley T. survival: Survival Analysis, Including Penalised Likelihood. (R package, version 2.36-2). Vienna, Austria: Comprehensive R Archive Network; 2010. (http://cran. r-project.org/web/packages/survival/index. html). (Accessed November 23, 2010).
    • (2010) Survival: Survival Analysis Including Penalised Likelihood
    • Therneau, T.1    Lumley, T.2
  • 30
    • 0345040873 scopus 로고    scopus 로고
    • Classification and regression by randomForest
    • Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18-22.
    • (2002) R News , vol.2 , Issue.3 , pp. 18-22
    • Liaw, A.1    Wiener, M.2
  • 31
    • 0031599142 scopus 로고    scopus 로고
    • Mersenne twister: A 623-dimensionally equidistributed uniform pseudo-random number generator
    • Matsumoto M, Nishimura T. Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Trans Model Comput Simul. 1998; 8(1):3-30.
    • (1998) ACM Trans Model Comput Simul , vol.8 , Issue.1 , pp. 3-30
    • Matsumoto, M.1    Nishimura, T.2
  • 33
    • 79959703102 scopus 로고    scopus 로고
    • Estimating residual variance in random forest regression
    • Mendez G, Lohr S. Estimating residual variance in random forest regression. Comput Stat Data Anal. 2011;55(11): 2937-2950.
    • (2011) Comput Stat Data Anal , vol.55 , Issue.11 , pp. 2937-2950
    • Mendez, G.1    Lohr, S.2
  • 34
    • 0030539070 scopus 로고    scopus 로고
    • Multiple imputation after 18+ years
    • Rubin DB. Multiple imputation after 18+ years. J Am Stat Assoc. 1996;91(434):473-489.
    • (1996) J Am Stat Assoc , vol.91 , Issue.434 , pp. 473-489
    • Rubin, D.B.1
  • 35
    • 78650635637 scopus 로고    scopus 로고
    • Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: A resampling study
    • Marshall A, Altman DG, Holder RL. Comparison of imputation methods for handling missing covariate data when fitting a Cox proportional hazards model: a resampling study. BMC Med Res Methodol. 2010;10:112.
    • (2010) BMC Med Res Methodol , vol.10 , pp. 112
    • Marshall, A.1    Altman, D.G.2    Holder, R.L.3
  • 36
    • 84856274182 scopus 로고    scopus 로고
    • REALCOMIMPUTE software for multilevel multiple imputation with mixed response types
    • Carpenter JR, Goldstein H, Kenward MG. REALCOMIMPUTE software for multilevel multiple imputation with mixed response types. J Stat Softw. 2011;45(5):1-14.
    • (2011) J Stat Softw , vol.45 , Issue.5 , pp. 1-14
    • Carpenter, J.R.1    Goldstein, H.2    Kenward, M.G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.