메뉴 건너뛰기




Volumn 26, Issue 14, 2010, Pages 1752-1758

On safari to random Jungle: A fast implementation of random forests for high-dimensional data

Author keywords

[No Author keywords available]

Indexed keywords

ARTICLE; COMPUTER PROGRAM; CROHN DISEASE; GENETIC ASSOCIATION; GENETIC PREDISPOSITION; GENETICS; GENOMICS; HUMAN; HUMAN GENOME; METHODOLOGY; SINGLE NUCLEOTIDE POLYMORPHISM;

EID: 77954485448     PISSN: 13674803     EISSN: 14602059     Source Type: Journal    
DOI: 10.1093/bioinformatics/btq257     Document Type: Article
Times cited : (202)

References (57)
  • 1
    • 35748978234 scopus 로고    scopus 로고
    • Empirical characterization of random forest variable importance measures
    • Archer, K.J. and Kimes, R.V. (2008) Empirical characterization of random forest variable importance measures. Comput. Stat. Data Anal., 52, 2249-2260.
    • (2008) Comput. Stat. Data Anal. , vol.52 , pp. 2249-2260
    • Archer, K.J.1    Kimes, R.V.2
  • 2
    • 24744462114 scopus 로고    scopus 로고
    • Tumor necrosis factor-related apoptosis-inducing ligand- mediated proliferation of tumor cells with receptor-proximal apoptosis defects
    • Baader, E. et al. (2005) Tumor necrosis factor-related apoptosis-inducing ligand- mediated proliferation of tumor cells with receptor-proximal apoptosis defects. Cancer Res., 65, 7888-7895.
    • (2005) Cancer Res. , vol.65 , pp. 7888-7895
    • Baader, E.1
  • 3
    • 0035883162 scopus 로고    scopus 로고
    • Disruption of NF-kappaB signaling reveals a novel role for NF-kappaB in the regulation of TNF-related apoptosis-inducing ligand expression
    • Baetu, T.M. et al. (2001) Disruption of NF-kappaB signaling reveals a novel role for NF-kappaB in the regulation of TNF-related apoptosis-inducing ligand expression. J. Immunol., 167, 3164-3173.
    • (2001) J. Immunol. , vol.167 , pp. 3164-3173
    • Baetu, T.M.1
  • 4
    • 48349136889 scopus 로고    scopus 로고
    • Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease
    • Barrett, J.C. et al. (2008) Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease. Nat. Genet., 40, 955-962.
    • (2008) Nat. Genet. , vol.40 , pp. 955-962
    • Barrett, J.C.1
  • 5
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Breiman, L. (1996) Bagging predictors. Mach. Learn., 24, 123-140.
    • (1996) Mach. Learn. , vol.24 , pp. 123-140
    • Breiman, L.1
  • 6
    • 0035478854 scopus 로고    scopus 로고
    • Random Forests
    • Breiman, L. (2001) Random Forests. Mach. Learn., 45, 5-32.
    • (2001) Mach. Learn. , vol.45 , pp. 5-32
    • Breiman, L.1
  • 7
    • 77954489577 scopus 로고    scopus 로고
    • Random Forests 5.1. Available at (last accessed date April 16, 2010)
    • Breiman, L. and Cutler, A. (2004) Random Forests 5.1. Available at http://www.stat.berkeley.edu/~breiman/RandomForests/cc_software.htm (last accessed date April 16, 2010).
    • (2004)
    • Breiman, L.1    Cutler, A.2
  • 8
    • 12744259874 scopus 로고    scopus 로고
    • Identifying SNPs predictive of phenotype using random forests
    • Bureau, A. et al. (2005) Identifying SNPs predictive of phenotype using random forests. Genet. Epidemiol., 28, 171-182.
    • (2005) Genet. Epidemiol. , vol.28 , pp. 171-182
    • Bureau, A.1
  • 9
    • 33646875285 scopus 로고    scopus 로고
    • Meta-analysis: colorectal and small bowel cancer risk in patients with Crohn's disease
    • Canavan, C. et al. (2006) Meta-analysis: colorectal and small bowel cancer risk in patients with Crohn's disease. Aliment Pharmacol. Ther., 23, 1097-1104.
    • (2006) Aliment Pharmacol. Ther. , vol.23 , pp. 1097-1104
    • Canavan, C.1
  • 10
    • 53349100175 scopus 로고    scopus 로고
    • Pathway analysis of single-nucleotide polymorphisms potentially associated with glioblastoma multiforme susceptibility using random forests
    • Chang, J.S. et al. (2008) Pathway analysis of single-nucleotide polymorphisms potentially associated with glioblastoma multiforme susceptibility using random forests. Cancer Epidemiol. Biomarkers Prev., 17, 1368-1373.
    • (2008) Cancer Epidemiol. Biomarkers Prev. , vol.17 , pp. 1368-1373
    • Chang, J.S.1
  • 11
    • 67349166946 scopus 로고    scopus 로고
    • Genome-wide association studies: detecting gene-gene interactions that underlie human diseases
    • Cordell, H.J. (2009) Genome-wide association studies: detecting gene-gene interactions that underlie human diseases. Nat. Rev. Genet., 10, 392-404.
    • (2009) Nat. Rev. Genet. , vol.10 , pp. 392-404
    • Cordell, H.J.1
  • 13
    • 30644464444 scopus 로고    scopus 로고
    • Gene selection and classification of microarray data using random forest
    • Diaz-Uriarte, R. and Alvarez de Andres, S. (2006) Gene selection and classification of microarray data using random forest. BMC Bioinformatics, 7, 3.
    • (2006) BMC Bioinformatics , vol.7 , pp. 3
    • Diaz-Uriarte, R.1    Alvarez de Andres, S.2
  • 14
    • 33845340501 scopus 로고    scopus 로고
    • A genome-wide association study identifies IL23R as an inflammatory bowel disease gene
    • Duerr, R.H. et al. (2006) A genome-wide association study identifies IL23R as an inflammatory bowel disease gene. Science, 314, 1461-1463.
    • (2006) Science , vol.314 , pp. 1461-1463
    • Duerr, R.H.1
  • 15
    • 0025167285 scopus 로고
    • Increased risk of large-bowel cancer in Crohn's disease with colonic involvement
    • Ekbom, A. et al. (1990) Increased risk of large-bowel cancer in Crohn's disease with colonic involvement. Lancet, 336, 357-359.
    • (1990) Lancet , vol.336 , pp. 357-359
    • Ekbom, A.1
  • 16
    • 0036828844 scopus 로고    scopus 로고
    • Induction of NOD2 in myelomonocytic and intestinal epithelial cells via nuclear factor-kappaB activation
    • Gutierrez, O. et al. (2002) Induction of NOD2 in myelomonocytic and intestinal epithelial cells via nuclear factor-kappaB activation. J. Biol. Chem., 277, 41701-41705.
    • (2002) J. Biol. Chem. , vol.277 , pp. 41701-41705
    • Gutierrez, O.1
  • 17
    • 0034527775 scopus 로고    scopus 로고
    • Selecting SNPs in two-stage analysis of disease association data: a model-free approach
    • Hoh, J. et al. (2000) Selecting SNPs in two-stage analysis of disease association data: a model-free approach. Ann. Hum. Genet., 64, 413-417.
    • (2000) Ann. Hum. Genet. , vol.64 , pp. 413-417
    • Hoh, J.1
  • 18
    • 33749677657 scopus 로고    scopus 로고
    • Unbiased recursive partitioning
    • Hothorn, T. et al. (2006) Unbiased recursive partitioning. J. Comput. Graph. Stat., 15, 651-674.
    • (2006) J. Comput. Graph. Stat. , vol.15 , pp. 651-674
    • Hothorn, T.1
  • 19
    • 61449230785 scopus 로고    scopus 로고
    • Interpretation of genetic association studies: markers with replicated highly significant odds ratios may be poor classifiers
    • Jakobsdottir, J. et al. (2009) Interpretation of genetic association studies: markers with replicated highly significant odds ratios may be poor classifiers. PLoS Genet., 5, e1000337.
    • (2009) PLoS Genet. , vol.5
    • Jakobsdottir, J.1
  • 20
    • 60849093174 scopus 로고    scopus 로고
    • Arandom forest approach to the detection of epistatic interactions in case-control studies
    • Jiang, R. et al. (2009)Arandom forest approach to the detection of epistatic interactions in case-control studies. BMC Bioinformatics, 10 (Suppl. 1), S65.
    • (2009) BMC Bioinformatics , vol.10 , Issue.SUPPL. 1
    • Jiang, R.1
  • 21
    • 58149347608 scopus 로고    scopus 로고
    • Patient-centered yes/no prognosis using learning machines
    • König, I.R. et al. (2008) Patient-centered yes/no prognosis using learning machines. Int. J. Data Min. Bioinform., 2, 289-341.
    • (2008) Int. J. Data Min. Bioinform. , vol.2 , pp. 289-341
    • König, I.R.1
  • 22
    • 0345040873 scopus 로고    scopus 로고
    • Classification and Regression by randomForest
    • Liaw, A. and Wiener, M. (2002) Classification and Regression by randomForest. R News, 2, 18-22.
    • (2002) R News , vol.2 , pp. 18-22
    • Liaw, A.1    Wiener, M.2
  • 23
    • 25444453244 scopus 로고    scopus 로고
    • Screening large-scale association study data: exploiting interactions using random forests
    • Lunetta, K.L. et al. (2004) Screening large-scale association study data: exploiting interactions using random forests. BMC Genet., 5, 32.
    • (2004) BMC Genet. , vol.5 , pp. 32
    • Lunetta, K.L.1
  • 24
    • 0001457509 scopus 로고
    • Some methods of classification and analysis of multivariate observations
    • University of California Press, B erkeley and LosAngeles, California
    • Macqueen, J.B. (1967) Some methods of classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathamatical Statistics and Probability, University of California Press, B erkeley and LosAngeles, California, pp. 281-297.
    • (1967) Proceedings of the 5th Berkeley Symposium on Mathamatical Statistics and Probability , pp. 281-297
    • Macqueen, J.B.1
  • 25
    • 70349956433 scopus 로고    scopus 로고
    • Finding the missing heritability of complex diseases
    • Manolio, T.A. et al. (2009) Finding the missing heritability of complex diseases. Nature, 461, 747-753.
    • (2009) Nature , vol.461 , pp. 747-753
    • Manolio, T.A.1
  • 26
    • 16844366786 scopus 로고    scopus 로고
    • Genome-wide strategies for detecting multiple loci that influence complex diseases
    • Marchini, J. et al. (2005) Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat. Genet., 37, 413-417.
    • (2005) Nat. Genet. , vol.37 , pp. 413-417
    • Marchini, J.1
  • 27
    • 34347344976 scopus 로고    scopus 로고
    • Anew multipoint method for genome-wide association studies by imputation of genotypes
    • Marchini, J. et al. (2007)Anew multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet., 39, 906-913.
    • (2007) Nat. Genet. , vol.39 , pp. 906-913
    • Marchini, J.1
  • 28
    • 42349112088 scopus 로고    scopus 로고
    • Genome-wide association studies for complex traits: consensus, uncertainty and challenges
    • McCarthy, M.I. et al. (2008) Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat. Rev. Genet., 9, 356-369.
    • (2008) Nat. Rev. Genet. , vol.9 , pp. 356-369
    • McCarthy, M.I.1
  • 29
    • 33744937046 scopus 로고    scopus 로고
    • Machine learning for detecting gene-gene interactions: a review
    • McKinney, B.A. et al. (2006) Machine learning for detecting gene-gene interactions: a review. Appl. Bioinformatics, 5, 77-88.
    • (2006) Appl. Bioinformatics , vol.5 , pp. 77-88
    • McKinney, B.A.1
  • 30
    • 63449100294 scopus 로고    scopus 로고
    • Capturing the spectrum of interaction effects in genetic association studies bysimulated evaporative cooling networka nalysis
    • McKinney, B.A. et al. (2009) Capturing the spectrum of interaction effects in genetic association studies bysimulated evaporative cooling networka nalysis. PLoS Genet., 5, e1000432.
    • (2009) PLoS Genet. , vol.5
    • McKinney, B.A.1
  • 31
    • 64549095229 scopus 로고    scopus 로고
    • Performance of random forest when SNPs are in linkage disequilibrium
    • Meng, Y. et al. (2009) Performance of random forest when SNPs are in linkage disequilibrium. BMC Bioinformatics, 10, 78.
    • (2009) BMC Bioinformatics , vol.10 , pp. 78
    • Meng, Y.1
  • 32
    • 38049072586 scopus 로고    scopus 로고
    • Genetic Analysis Workshop 15 simulation of a complex genetic model for rheumatoid arthritis in nuclear families including a dense SNP map with linkage disequilibrium between marker loci and trait loci
    • Miller, M.B. et al. (2007) Genetic Analysis Workshop 15: simulation of a complex genetic model for rheumatoid arthritis in nuclear families including a dense SNP map with linkage disequilibrium between marker loci and trait loci. BMC Proc., 1 (Suppl. 1), S4.
    • (2007) BMC Proc. , vol.1 , Issue.SUPPL. 1
    • Miller, M.B.1
  • 33
    • 77949497074 scopus 로고    scopus 로고
    • Bioinformatics challenges for genome-wide association studies
    • Moore, J.H. et al. (2010) Bioinformatics challenges for genome-wide association studies. Bioinformatics, 26, 445-455.
    • (2010) Bioinformatics , vol.26 , pp. 445-455
    • Moore, J.H.1
  • 34
    • 67650770061 scopus 로고    scopus 로고
    • Predictor correlation impacts machine learning algorithms: implications for genomic studies
    • Nicodemus, K.K. and Malley, J.D. (2009) Predictor correlation impacts machine learning algorithms: implications for genomic studies. Bioinformatics, 25, 1884-1890.
    • (2009) Bioinformatics , vol.25 , pp. 1884-1890
    • Nicodemus, K.K.1    Malley, J.D.2
  • 35
    • 77949388276 scopus 로고    scopus 로고
    • The behaviour of random forest permutation-based variable importance measures under predictor correlation
    • Nicodemus, K.K. et al. (2010) The behaviour of random forest permutation-based variable importance measures under predictor correlation. BMC Bioinformatics, 11, 110.
    • (2010) BMC Bioinformatics , vol.11 , pp. 110
    • Nicodemus, K.K.1
  • 36
    • 0035870249 scopus 로고    scopus 로고
    • Overexpression of a dominant-negative signal transducer and activator of transcription 3 variant in tumor cells leads to production of soluble factors that induce apoptosis and cell cycle arrest
    • Niu, G. et al. (2001) Overexpression of a dominant-negative signal transducer and activator of transcription 3 variant in tumor cells leads to production of soluble factors that induce apoptosis and cell cycle arrest. Cancer Res., 61, 3276-3280.
    • (2001) Cancer Res. , vol.61 , pp. 3276-3280
    • Niu, G.1
  • 37
    • 10744227070 scopus 로고    scopus 로고
    • Control of pancreas and liver gene expression by HNF transcription factors
    • Odom, D.T. et al. (2004) Control of pancreas and liver gene expression by HNF transcription factors. Science, 303, 1378-1381.
    • (2004) Science , vol.303 , pp. 1378-1381
    • Odom, D.T.1
  • 38
    • 0036604306 scopus 로고    scopus 로고
    • A receptor for the heterodimeric cytokine IL-23 is composed of IL-12Rbeta1 and a novel cytokine receptor subunit, IL-23R
    • Parham, C. et al. (2002) A receptor for the heterodimeric cytokine IL-23 is composed of IL-12Rbeta1 and a novel cytokine receptor subunit, IL-23R. J. Immunol., 168, 5699-5708.
    • (2002) J. Immunol. , vol.168 , pp. 5699-5708
    • Parham, C.1
  • 39
    • 0035227182 scopus 로고    scopus 로고
    • Classification methods for confronting heterogeneity
    • Province, M.A. et al. (2001) Classification methods for confronting heterogeneity. Adv. Genet., 42, 273-286.
    • (2001) Adv. Genet. , vol.42 , pp. 273-286
    • Province, M.A.1
  • 40
  • 41
    • 34247554965 scopus 로고    scopus 로고
    • Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis
    • Rioux, J.D. et al. (2007) Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis. Nat. Genet., 39, 596-604.
    • (2007) Nat. Genet. , vol.39 , pp. 596-604
    • Rioux, J.D.1
  • 42
    • 35848935648 scopus 로고    scopus 로고
    • Parallels between global transcriptional programs of polarizing Caco-2 intestinal epithelial cells in vitro and gene expression programs in normal colon and colon cancer
    • Saaf, A.M. et al. (2007) Parallels between global transcriptional programs of polarizing Caco-2 intestinal epithelial cells in vitro and gene expression programs in normal colon and colon cancer. Mol. Biol. Cell., 18, 4245-4260.
    • (2007) Mol. Biol. Cell. , vol.18 , pp. 4245-4260
    • Saaf, A.M.1
  • 43
    • 34547623750 scopus 로고    scopus 로고
    • Genomewide association analysis of coronary artery disease
    • Samani, N.J. et al. (2007) Genomewide association analysis of coronary artery disease. N. Engl J. Med., 357, 443-453.
    • (2007) N. Engl J. Med. , vol.357 , pp. 443-453
    • Samani, N.J.1
  • 44
    • 0025448521 scopus 로고
    • The strength of weak learnability
    • Schapire, R.E. (1990) The strength of weak learnability. Mach. Learn., 5, 197-227.
    • (1990) Mach. Learn. , vol.5 , pp. 197-227
    • Schapire, R.E.1
  • 45
    • 38049070957 scopus 로고    scopus 로고
    • Picking single-nucleotide polymorphisms in forests
    • Schwarz, D.F. et al. (2007) Picking single-nucleotide polymorphisms in forests. BMC Proc., 1(Suppl. 1), S59.
    • (2007) BMC Proc. , vol.1 , Issue.SUPPL. 1
    • Schwarz, D.F.1
  • 46
    • 71249140807 scopus 로고    scopus 로고
    • Evaluation of single-nucleotide polymorphism imputation using random forests
    • Schwarz, D.F. et al. (2009) Evaluation of single-nucleotide polymorphism imputation using random forests. BMC Proc., 3, S65.
    • (2009) BMC Proc. , vol.3
    • Schwarz, D.F.1
  • 47
    • 0037040834 scopus 로고    scopus 로고
    • Sp1 transcription factor as a molecular target for nitric oxide- and cyclic nucleotide-mediated suppression of cGMP-dependent protein kinase-Ialpha expression in vascular smooth muscle cells
    • Sellak, H. et al. (2002) Sp1 transcription factor as a molecular target for nitric oxide- and cyclic nucleotide-mediated suppression of cGMP-dependent protein kinase-Ialpha expression in vascular smooth muscle cells. Circ. Res., 90, 405-412.
    • (2002) Circ. Res. , vol.90 , pp. 405-412
    • Sellak, H.1
  • 48
    • 48549095457 scopus 로고    scopus 로고
    • Conditional variable importance for random forests
    • Strobl, C. et al. (2008) Conditional variable importance for random forests. BMC Bioinformatics, 9, 307.
    • (2008) BMC Bioinformatics , vol.9 , pp. 307
    • Strobl, C.1
  • 49
    • 33847096395 scopus 로고    scopus 로고
    • Bias in random forest variable importance measures: illustrations, sources and a solution
    • Strobl, C. et al. (2007) Bias in random forest variable importance measures: illustrations, sources and a solution. BMC Bioinformatics, 8, 25.
    • (2007) BMC Bioinformatics , vol.8 , pp. 25
    • Strobl, C.1
  • 50
    • 38049048625 scopus 로고    scopus 로고
    • Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests
    • Sun, Y.V. et al. (2007) Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests. BMC Proc., 1(Suppl. 1), S62.
    • (2007) BMC Proc. , vol.1 , Issue.SUPPL. 1
    • Sun, Y.V.1
  • 51
    • 0036735391 scopus 로고    scopus 로고
    • Cyclooxygenase-2 overexpression inhibits death receptor 5 expression and confers resistance to tumor necrosis factor-related apoptosis- inducing ligand-induced apoptosis in human colon cancer cells
    • Tang, X. et al. (2002) Cyclooxygenase-2 overexpression inhibits death receptor 5 expression and confers resistance to tumor necrosis factor-related apoptosis- inducing ligand-induced apoptosis in human colon cancer cells. Cancer Res., 62, 4903-4908.
    • (2002) Cancer Res. , vol.62 , pp. 4903-4908
    • Tang, X.1
  • 52
    • 0043016254 scopus 로고    scopus 로고
    • Rottlerin sensitizes colon carcinoma cells to tumor necrosis factor-related apoptosis-inducing ligand-induced apoptosis via uncoupling of the mitochondria independent of protein kinase C
    • Tillman, D.M. et al. (2003) Rottlerin sensitizes colon carcinoma cells to tumor necrosis factor-related apoptosis-inducing ligand-induced apoptosis via uncoupling of the mitochondria independent of protein kinase C. Cancer Res., 63, 5118-5125.
    • (2003) Cancer Res. , vol.63 , pp. 5118-5125
    • Tillman, D.M.1
  • 53
    • 84969213492 scopus 로고    scopus 로고
    • Genome-wide association study of 14, 000 cases of seven common diseases and 3000 shared controls
    • Wellcome Trust Case Control Consortium
    • Wellcome Trust Case Control Consortium (2007) Genome-wide association study of 14, 000 cases of seven common diseases and 3, 000 shared controls. Nature, 447, 661-678.
    • (2007) Nature , vol.447 , pp. 661-678
  • 54
    • 53049107304 scopus 로고    scopus 로고
    • Sp1-mediated TRAIL induction in chemosensitization
    • Xu, J. et al. (2008) Sp1-mediated TRAIL induction in chemosensitization. Cancer Res., 68, 6718-6726.
    • (2008) Cancer Res. , vol.68 , pp. 6718-6726
    • Xu, J.1
  • 55
    • 65849256988 scopus 로고    scopus 로고
    • Willows: a memory efficient tree and forest construction package
    • Zhang, H. et al. (2009)Willows: a memory efficient tree and forest construction package. BMC Bioinformatics, 10, 130.
    • (2009) BMC Bioinformatics , vol.10 , pp. 130
    • Zhang, H.1
  • 56
    • 38049006418 scopus 로고    scopus 로고
    • Data mining, neural nets, trees-problems 2 and 3 of Genetic Analysis Workshop 15
    • Ziegler, A. et al. (2007) Data mining, neural nets, trees-problems 2 and 3 of Genetic Analysis Workshop 15. Genet. Epidemiol., 31 (Suppl. 1), S51-S60.
    • (2007) Genet. Epidemiol. , vol.31 , Issue.SUPPL. 1
    • Ziegler, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.