메뉴 건너뛰기




Volumn 24, Issue 1, 2014, Pages 21-34

A new variable importance measure for random forests with missing data

Author keywords

Missing data; Missing values; Permutation importance; Random forests; Variable importance measures

Indexed keywords


EID: 84891629894     PISSN: 09603174     EISSN: None     Source Type: Journal    
DOI: 10.1007/s11222-012-9349-1     Document Type: Article
Times cited : (152)

References (50)
  • 1
    • 0017173699 scopus 로고
    • Sleep in Mammals: ecological and constitutional correlates
    • Allison, T., Cicchetti, D. V.: Sleep in Mammals: ecological and constitutional correlates. Science 194(4266), 732-734 (1976).
    • (1976) Science , vol.194 , Issue.4266 , pp. 732-734
    • Allison, T.1    Cicchetti, D.V.2
  • 2
    • 77952814988 scopus 로고    scopus 로고
    • Permutation importance: a corrected feature importance measure
    • Altmann, A., Tolosi, L., Sander, O., Lengauer, T.: Permutation importance: a corrected feature importance measure. Bioinformatics 26(10), 1340-1347 (2010).
    • (2010) Bioinformatics , vol.26 , Issue.10 , pp. 1340-1347
    • Altmann, A.1    Tolosi, L.2    Sander, O.3    Lengauer, T.4
  • 3
    • 35748978234 scopus 로고    scopus 로고
    • Empirical characterization of random forest variable importance measures
    • Archer, K., Kimes, R.: Empirical characterization of random forest variable importance measures. Comput. Stat. Data Anal. 52(4), 2249-2260 (2008).
    • (2008) Comput. Stat. Data Anal. , vol.52 , Issue.4 , pp. 2249-2260
    • Archer, K.1    Kimes, R.2
  • 4
    • 54249099241 scopus 로고    scopus 로고
    • Consistency of random forests and other averaging classifiers
    • Biau, G., Devroye, L., Lugosi, G.: Consistency of random forests and other averaging classifiers. J. Mach. Learn. Res. 9, 2015-2033 (2008).
    • (2008) J. Mach. Learn. Res. , vol.9 , pp. 2015-2033
    • Biau, G.1    Devroye, L.2    Lugosi, G.3
  • 6
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123-140 (1996).
    • (1996) Mach. Learn. , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 7
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman, L.: Random forests. Mach. Learn. 45(1), 5-32 (2001).
    • (2001) Mach. Learn. , vol.45 , Issue.1 , pp. 5-32
    • Breiman, L.1
  • 8
    • 0041382385 scopus 로고    scopus 로고
    • (accessed 03. 02. 2011)
    • Breiman, L., Cutler, A.: Random forests (2008). http://www. stat. berkeley. edu/users/breiman/RandomForests/cc_home. htm (accessed 03. 02. 2011).
    • (2008) Random forests
    • Breiman, L.1    Cutler, A.2
  • 10
    • 84857095805 scopus 로고    scopus 로고
    • The use of classification trees for bioinformatics
    • Chen, X., Wang, M., Zhang, H.: The use of classification trees for bioinformatics. Data Min. Knowl. Discov. 1(1), 55-63 (2011).
    • (2011) Data Min. Knowl. Discov. , vol.1 , Issue.1 , pp. 55-63
    • Chen, X.1    Wang, M.2    Zhang, H.3
  • 12
    • 30644464444 scopus 로고    scopus 로고
    • Gene selection and classification of microarray data using random forest
    • Díaz-Uriarte, R., Alvarez de Andrés, S.: Gene selection and classification of microarray data using random forest. BMC Bioinform. 7(1), 3 (2006).
    • (2006) BMC Bioinform. , vol.7 , Issue.1 , pp. 3
    • Díaz-Uriarte, R.1    Alvarez de Andrés, S.2
  • 13
    • 1842692307 scopus 로고    scopus 로고
    • Bias correction in classification tree construction
    • Williams College, Williamstown, MA, USA, C. E. Brodley and A. P. Danyluk (Eds.), San Mateo: Morgan Kaufmann
    • Dobra, A., Gehrke, J.: Bias correction in classification tree construction. In: Brodley, C. E., Danyluk, A. P. (eds.) Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, pp. 90-97. Morgan Kaufmann, San Mateo (2001).
    • (2001) Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001) , pp. 90-97
    • Dobra, A.1    Gehrke, J.2
  • 19
    • 33749677657 scopus 로고    scopus 로고
    • Unbiased recursive partitioning: a conditional inference framework
    • Hothorn, T., Hornik, K., Zeileis, A.: Unbiased recursive partitioning: a conditional inference framework. J. Comput. Graph. Stat. 15(3), 651-674 (2006).
    • (2006) J. Comput. Graph. Stat. , vol.15 , Issue.3 , pp. 651-674
    • Hothorn, T.1    Hornik, K.2    Zeileis, A.3
  • 23
    • 1542573450 scopus 로고    scopus 로고
    • Classification trees with unbiased multiway splits
    • Kim, H., Loh, W.: Classification trees with unbiased multiway splits. J. Am. Stat. Assoc. 96, 589-604 (2001).
    • (2001) J. Am. Stat. Assoc. , vol.96 , pp. 589-604
    • Kim, H.1    Loh, W.2
  • 24
    • 33745653724 scopus 로고    scopus 로고
    • Random forests and adaptive nearest neighbors
    • Lin, Y., Jeon, Y.: Random forests and adaptive nearest neighbors. J. Am. Stat. Assoc. 101(474), 578-590 (2006).
    • (2006) J. Am. Stat. Assoc. , vol.101 , Issue.474 , pp. 578-590
    • Lin, Y.1    Jeon, Y.2
  • 26
    • 25444453244 scopus 로고    scopus 로고
    • Screening large-scale association study data: Exploiting interactions using random forests
    • Lunetta, K., Hayward, B. L., Segal, J., van Eerdewegh, P.: Screening large-scale association study data: exploiting interactions using random forests. BMC Genetics 5(1) (2004).
    • (2004) BMC Genetics , vol.5 , Issue.1
    • Lunetta, K.1    Hayward, B.L.2    Segal, J.3    van Eerdewegh, P.4
  • 27
    • 82255174148 scopus 로고    scopus 로고
    • Letter to the editor: On the stability and ranking of predictors from random forest variable importance measures
    • Nicodemus, K.: Letter to the editor: On the stability and ranking of predictors from random forest variable importance measures. Brief. Bioinform. (2011).
    • (2011) Brief. Bioinform
    • Nicodemus, K.1
  • 28
    • 77949388276 scopus 로고    scopus 로고
    • The behaviour of random forest permutation-based variable importance measures under predictor correlation
    • Nicodemus, K., Malley, J., Strobl, C., Ziegler, A.: The behaviour of random forest permutation-based variable importance measures under predictor correlation. BMC Bioinform. 11(1), 110 (2010).
    • (2010) BMC Bioinform. , vol.11 , Issue.1 , pp. 110
    • Nicodemus, K.1    Malley, J.2    Strobl, C.3    Ziegler, A.4
  • 29
    • 34248177183 scopus 로고    scopus 로고
    • The problem of disguised missing data
    • Pearson, R. K.: The problem of disguised missing data. ACM SIGKDD Explor. Newsl. 8(1), 83-92 (2006).
    • (2006) ACM SIGKDD Explor. Newsl. , vol.8 , Issue.1 , pp. 83-92
    • Pearson, R.K.1
  • 33
    • 41349116565 scopus 로고    scopus 로고
    • A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes
    • Rodenburg, W., Heidema, A. G., Boer, J. M. A., Bovee-Oudenhoven, I. M. J., Feskens, E. J. M., Mariman, E. C. M., Keijer, J.: A framework to identify physiological responses in microarray-based gene expression studies: selection and interpretation of biologically relevant genes. Physiol. Genomics 33(1), 78-90 (2008).
    • (2008) Physiol. Genomics , vol.33 , Issue.1 , pp. 78-90
    • Rodenburg, W.1    Heidema, A.G.2    Boer, J.M.A.3    Bovee-Oudenhoven, I.M.J.4    Feskens, E.J.M.5    Mariman, E.C.M.6    Keijer, J.7
  • 34
    • 0017133178 scopus 로고
    • Inference and missing data
    • Rubin, D. B.: Inference and missing data. Biometrika 63(3), 581-592 (1976).
    • (1976) Biometrika , vol.63 , Issue.3 , pp. 581-592
    • Rubin, D.B.1
  • 37
    • 85047673373 scopus 로고    scopus 로고
    • Missing data: our view of the state of the art
    • Schafer, J. L., Graham, J. W.: Missing data: our view of the state of the art. Psychol. Methods 7(2), 147-177 (2002).
    • (2002) Psychol. Methods , vol.7 , Issue.2 , pp. 147-177
    • Schafer, J.L.1    Graham, J.W.2
  • 38
    • 34548250123 scopus 로고    scopus 로고
    • Unbiased split selection for classification trees based on the gini index
    • Strobl, C., Boulesteix, A.-L., Augustin, T.: Unbiased split selection for classification trees based on the gini index. Comput. Stat. Data Anal. 52(1), 483-501 (2007).
    • (2007) Comput. Stat. Data Anal. , vol.52 , Issue.1 , pp. 483-501
    • Strobl, C.1    Boulesteix, A.-L.2    Augustin, T.3
  • 39
    • 33847096395 scopus 로고    scopus 로고
    • Bias in random forest variable importance measures: illustrations, sources and a solution
    • Strobl, C., Boulesteix, A.-L., Zeileis, A., Hothorn, T.: Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinform. 8(1), 25 (2007).
    • (2007) BMC Bioinform. , vol.8 , Issue.1 , pp. 25
    • Strobl, C.1    Boulesteix, A.-L.2    Zeileis, A.3    Hothorn, T.4
  • 41
    • 72449170109 scopus 로고    scopus 로고
    • An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests
    • Strobl, C., Malley, J., Tutz, G.: An introduction to recursive partitioning: rationale, application, and characteristics of classification and regression trees, bagging, and random forests. Psychol. Methods 14(4), 323-348 (2009).
    • (2009) Psychol. Methods , vol.14 , Issue.4 , pp. 323-348
    • Strobl, C.1    Malley, J.2    Tutz, G.3
  • 42
    • 71249150835 scopus 로고    scopus 로고
    • Identification of genes and haplotypes that predict rheumatoid arthritis using random forests
    • Tang, R., Sinnwell, J., Li, J., Rider, D., de Andrade, M., Biernacka, J.: Identification of genes and haplotypes that predict rheumatoid arthritis using random forests. BMC Proceedings 3(7), S68 (2009).
    • (2009) BMC Proceedings , vol.3 , Issue.7
    • Tang, R.1    Sinnwell, J.2    Li, J.3    Rider, D.4    de Andrade, M.5    Biernacka, J.6
  • 43
    • 79953732420 scopus 로고    scopus 로고
    • MICE: Multivariate Imputation by Chained Equations in R. J
    • in press
    • van Buuren, S., Groothuis-Oudshoorn, K.: MICE: Multivariate Imputation by Chained Equations in R. J. Stat. Softw. 01-68 (2010, in press).
    • (2010) Stat. Softw. 01-68
    • van Buuren, S.1    Groothuis-Oudshoorn, K.2
  • 45
    • 77951970995 scopus 로고    scopus 로고
    • Maximal conditional chi-square importance in random forests
    • Wang, M., Chen, X., Zhang, H.: Maximal conditional chi-square importance in random forests. Bioinformatics 26(6), 831-837 (2010).
    • (2010) Bioinformatics , vol.26 , Issue.6 , pp. 831-837
    • Wang, M.1    Chen, X.2    Zhang, H.3
  • 46
    • 0028443213 scopus 로고
    • Bias in information based measures in decision tree induction
    • White, A., Liu, W.: Bias in information based measures in decision tree induction. Mach. Learn. 15(3), 321-329 (1994).
    • (1994) Mach. Learn. , vol.15 , Issue.3 , pp. 321-329
    • White, A.1    Liu, W.2
  • 47
    • 78651256743 scopus 로고    scopus 로고
    • Multiple imputation using chained equations: issues and guidance for practice
    • White, I. R., Royston, P., Wood, A. M.: Multiple imputation using chained equations: issues and guidance for practice. Stat. Med. 30(4), 377-399 (2011).
    • (2011) Stat. Med. , vol.30 , Issue.4 , pp. 377-399
    • White, I.R.1    Royston, P.2    Wood, A.M.3
  • 48
    • 71249150072 scopus 로고    scopus 로고
    • Selection of important variables by statistical learning in genome-wide association analysis
    • Yang, W. W. W., Gu, C. C.: Selection of important variables by statistical learning in genome-wide association analysis. BMC Proceedings 3(7) (2009).
    • (2009) BMC Proceedings , vol.3 , Issue.7
    • Yang, W.W.W.1    Gu, C.C.2
  • 49
    • 78650726103 scopus 로고    scopus 로고
    • Predicting individual tree attributes from airborne laser point clouds based on the random forests technique
    • Yu, X., Hyyppä, J., Vastaranta, M., Holopainen, M., Viitala, R.: Predicting individual tree attributes from airborne laser point clouds based on the random forests technique. ISPRS J. Photogramm. Remote Sens. 66(1), 28-37 (2011).
    • (2011) ISPRS J. Photogramm. Remote Sens. , vol.66 , Issue.1 , pp. 28-37
    • Yu, X.1    Hyyppä, J.2    Vastaranta, M.3    Holopainen, M.4    Viitala, R.5
  • 50
    • 78651580875 scopus 로고    scopus 로고
    • Gene selection using random forest and proximity differences criterion on dna microarray data
    • Zhou, Q., Hong, W., Luo, L., Yang, F.: Gene selection using random forest and proximity differences criterion on dna microarray data. J. Conv. Inf. Technol. 5(6), 161-170 (2010).
    • (2010) J. Conv. Inf. Technol. , vol.5 , Issue.6 , pp. 161-170
    • Zhou, Q.1    Hong, W.2    Luo, L.3    Yang, F.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.