메뉴 건너뛰기




Volumn 105, Issue 2, 2011, Pages 157-170

Empirical comparison of tree ensemble variable importance measures

Author keywords

Boosted trees; Conditional inference forests; Decision trees; Ensemble learning; Fault identification; Random forests; Variable importance

Indexed keywords

ACCURACY; ARTICLE; CONTROLLED STUDY; DECISION TREE; INTERMETHOD COMPARISON; MATHEMATICAL COMPUTING; MATHEMATICAL VARIABLE; PRIORITY JOURNAL; PROCESS MODEL; PROCESS OPTIMIZATION; QUALITATIVE ANALYSIS; SENSITIVITY AND SPECIFICITY; STATISTICAL ANALYSIS; STATISTICAL MODEL; VALIDATION PROCESS;

EID: 79951953906     PISSN: 01697439     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.chemolab.2010.12.004     Document Type: Article
Times cited : (82)

References (36)
  • 1
    • 33746317317 scopus 로고    scopus 로고
    • Predicting habitat suitability with machine learning models: the potential area of Pinus sylvestris L. in the Iberian Peninsula
    • Garzón M.B., Blazek R., Neteler M., Dios R.S.D., Ollero H.S., Furlanello C. Predicting habitat suitability with machine learning models: the potential area of Pinus sylvestris L. in the Iberian Peninsula. Ecological Modelling 2006, 197(3):383-393.
    • (2006) Ecological Modelling , vol.197 , Issue.3 , pp. 383-393
    • Garzón, M.B.1    Blazek, R.2    Neteler, M.3    Dios, R.S.D.4    Ollero, H.S.5    Furlanello, C.6
  • 6
    • 30344489020 scopus 로고    scopus 로고
    • QSAR analysis of phenolic antioxidants using MOLMAP descriptors of local properties
    • Gupta S., Matthew S., Abreu P.M., Aires-de-Sousa J. QSAR analysis of phenolic antioxidants using MOLMAP descriptors of local properties. Bioorganic & medicinal chemistry 2006, 14(4):1199-1206.
    • (2006) Bioorganic & medicinal chemistry , vol.14 , Issue.4 , pp. 1199-1206
    • Gupta, S.1    Matthew, S.2    Abreu, P.M.3    Aires-de-Sousa, J.4
  • 9
    • 13244262710 scopus 로고    scopus 로고
    • Few amino acid positions in rpoB are associated with most of the rifampin resistance in Mycobacterium tuberculosis
    • Cummings M., Segal M. Few amino acid positions in rpoB are associated with most of the rifampin resistance in Mycobacterium tuberculosis. BMC Bioinformatics 2004, 5(1):137-143.
    • (2004) BMC Bioinformatics , vol.5 , Issue.1 , pp. 137-143
    • Cummings, M.1    Segal, M.2
  • 10
    • 30644464444 scopus 로고    scopus 로고
    • Gene selection and classification of microarray data using random forest
    • Diaz-Uriarte R., Alvarez de Andres S. Gene selection and classification of microarray data using random forest. BMC Bioinformatics 2006, 7(1):3-15.
    • (2006) BMC Bioinformatics , vol.7 , Issue.1 , pp. 3-15
    • Diaz-Uriarte, R.1    Alvarez de Andres, S.2
  • 12
    • 35748978234 scopus 로고    scopus 로고
    • Empirical characterization of random forest variable importance measures
    • Archer K.J., Kimes R.V. Empirical characterization of random forest variable importance measures. Computational Statistics & Data Analysis 2008, 52(4):2249-2260.
    • (2008) Computational Statistics & Data Analysis , vol.52 , Issue.4 , pp. 2249-2260
    • Archer, K.J.1    Kimes, R.V.2
  • 13
    • 67650770061 scopus 로고    scopus 로고
    • Predictor correlation impacts machine learning algorithms: implications for genomic studies
    • Nicodemus K.K., Malley J.D. Predictor correlation impacts machine learning algorithms: implications for genomic studies. Bioinformatics 2009, 25(15):1884-1890.
    • (2009) Bioinformatics , vol.25 , Issue.15 , pp. 1884-1890
    • Nicodemus, K.K.1    Malley, J.D.2
  • 15
    • 33847096395 scopus 로고    scopus 로고
    • Bias in random forest variable importance measures: illustrations, sources and a solution
    • Strobl C., Boulesteix A.L., Zeileis A., Hothorn T. Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics 2007, 8:25-45.
    • (2007) BMC Bioinformatics , vol.8 , pp. 25-45
    • Strobl, C.1    Boulesteix, A.L.2    Zeileis, A.3    Hothorn, T.4
  • 17
    • 70450064693 scopus 로고    scopus 로고
    • Variable importance assessment in regression: linear regression versus random forest
    • Grömping U. Variable importance assessment in regression: linear regression versus random forest. The American Statistician 2009, 63(4):308-319.
    • (2009) The American Statistician , vol.63 , Issue.4 , pp. 308-319
    • Grömping, U.1
  • 19
    • 53549131556 scopus 로고    scopus 로고
    • A bias correction algorithm for the Gini variable importance measure in classification trees
    • Sandri M., Zuccolotto P. A bias correction algorithm for the Gini variable importance measure in classification trees. Journal of Computational and Graphical Statistics 2008, 17(3):611-628.
    • (2008) Journal of Computational and Graphical Statistics , vol.17 , Issue.3 , pp. 611-628
    • Sandri, M.1    Zuccolotto, P.2
  • 22
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Breiman L. Bagging predictors. Machine Learning 1996, 24(2):123-140.
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 23
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman L. Random forests. Machine Learning 2001, 45(1):5-32.
    • (2001) Machine Learning , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 25
    • 0035470889 scopus 로고    scopus 로고
    • Greedy function approximation: a gradient boosting machine
    • Friedman J.H. Greedy function approximation: a gradient boosting machine. The Annals of Statistics 2001, 29(5):1189-1232.
    • (2001) The Annals of Statistics , vol.29 , Issue.5 , pp. 1189-1232
    • Friedman, J.H.1
  • 28
    • 41349116565 scopus 로고    scopus 로고
    • A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes
    • Rodenburg W., Heidema A.G., Boer J.M.A., Bovee-Oudenhoven I.M.J., Feskens E.J.M., Mariman E.C.M., Keijer J. A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes. Physiological Genomics Oct. 2008, 33(1):78-90.
    • (2008) Physiological Genomics , vol.33 , Issue.1 , pp. 78-90
    • Rodenburg, W.1    Heidema, A.G.2    Boer, J.M.A.3    Bovee-Oudenhoven, I.M.J.4    Feskens, E.J.M.5    Mariman, E.C.M.6    Keijer, J.7
  • 29
    • 79951480123 scopus 로고    scopus 로고
    • R Development Core Team R Foundation for Statistical Computing, Vienna, Austria
    • R Development Core Team R: A Language and Environment for Statistical Computing 2010, R Foundation for Statistical Computing, Vienna, Austria.
    • (2010) R: A Language and Environment for Statistical Computing
  • 30
    • 0345040873 scopus 로고    scopus 로고
    • Classification and regression by randomForest
    • Liaw A., Wiener M. Classification and regression by randomForest. R News 2002, 2(3):18-22.
    • (2002) R News , vol.2 , Issue.3 , pp. 18-22
    • Liaw, A.1    Wiener, M.2
  • 32
    • 77949388276 scopus 로고    scopus 로고
    • The behaviour of random forest permutation-based variable importance measures under predictor correlation
    • Nicodemus K.K., Malley J.D., Strobl C., Ziegler A. The behaviour of random forest permutation-based variable importance measures under predictor correlation. BMC Bioinformatics 2010, 11(1):110-122.
    • (2010) BMC Bioinformatics , vol.11 , Issue.1 , pp. 110-122
    • Nicodemus, K.K.1    Malley, J.D.2    Strobl, C.3    Ziegler, A.4
  • 33
    • 21244436700 scopus 로고    scopus 로고
    • Performance of some variable selection methods when multicollinearity is present
    • Chong I., Jun C. Performance of some variable selection methods when multicollinearity is present. Chemometrics and Intelligent Laboratory Systems Jul. 2005, 78(1):103-112.
    • (2005) Chemometrics and Intelligent Laboratory Systems , vol.78 , Issue.1 , pp. 103-112
    • Chong, I.1    Jun, C.2
  • 34
    • 0034621334 scopus 로고    scopus 로고
    • Fault detection in industrial processes using canonical variate analysis and dynamic principal component analysis
    • Russell E.L., Chiang L.H., Braatz R.D. Fault detection in industrial processes using canonical variate analysis and dynamic principal component analysis. Chemometrics and Intelligent Laboratory Systems 2000, 51(1):81-93.
    • (2000) Chemometrics and Intelligent Laboratory Systems , vol.51 , Issue.1 , pp. 81-93
    • Russell, E.L.1    Chiang, L.H.2    Braatz, R.D.3
  • 35
    • 0027561446 scopus 로고
    • A plant-wide industrial process control problem
    • Downs J.J., Vogel E.F. A plant-wide industrial process control problem. Computers & Chemical Engineering 1993, 17(3):245-255.
    • (1993) Computers & Chemical Engineering , vol.17 , Issue.3 , pp. 245-255
    • Downs, J.J.1    Vogel, E.F.2
  • 36
    • 79951948649 scopus 로고    scopus 로고
    • Data-driven techniques for fault detection and diagnosis in chemical processes
    • Russell E.L., Chiang L.H., Braatz R.D. Data-driven techniques for fault detection and diagnosis in chemical processes. Springer 2000.
    • (2000) Springer
    • Russell, E.L.1    Chiang, L.H.2    Braatz, R.D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.