메뉴 건너뛰기




Volumn 11, Issue 12, 2016, Pages

Effective feature selection for classification of promoter sequences

Author keywords

[No Author keywords available]

Indexed keywords

DECISION TREE; EUKARYOTE; K NEAREST NEIGHBOR; NONHUMAN; PROMOTER REGION; REGULATORY MECHANISM; SPECIES; SUPPORT VECTOR MACHINE; ALGORITHM; BIOLOGICAL MODEL; BIOLOGY; GENE EXPRESSION REGULATION; PROCEDURES;

EID: 85006288684     PISSN: None     EISSN: 19326203     Source Type: Journal    
DOI: 10.1371/journal.pone.0167165     Document Type: Article
Times cited : (6)

References (50)
  • 1
    • 4744365291 scopus 로고    scopus 로고
    • Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis
    • Aug31 PMID: 15465477
    • Pan F, Wang B, Hu X, Perrizo W. Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis. Journal of Biomedical Informatics. 2004 Aug 31; 37(4):240-8. doi: 10.1016/j. jbi.2004.07.003 PMID: 15465477
    • (2004) Journal of Biomedical Informatics , vol.37 , Issue.4 , pp. 240-248
    • Pan, F.1    Wang, B.2    Hu, X.3    Perrizo, W.4
  • 2
    • 84925687964 scopus 로고    scopus 로고
    • The current status and challenges in computational analysis of genomic big data
    • Mar31
    • Qin Y, Yalamanchili HK, Qin J, Yan B, Wang J. The current status and challenges in computational analysis of genomic big data. Big Data Research. 2015 Mar 31; 2(1):12-8.
    • (2015) Big Data Research , vol.2 , Issue.1 , pp. 12-18
    • Qin, Y.1    Yalamanchili, H.K.2    Qin, J.3    Yan, B.4    Wang, J.5
  • 3
    • 0036107903 scopus 로고    scopus 로고
    • Discovery of regulatory elements by a computational method for phylogenetic footprinting
    • May1 PMID: 11997340
    • Blanchette M, Tompa M. Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome research. 2002 May 1; 12(5):739-48. doi: 10.1101/gr.6902 PMID: 11997340
    • (2002) Genome research , vol.12 , Issue.5 , pp. 739-748
    • Blanchette, M.1    Tompa, M.2
  • 4
    • 0029038960 scopus 로고
    • Promoter sequences using transcription factor binding sites
    • Jun23 PMID: 7791218
    • Prestridge DS. Predicting Pol II promoter sequences using transcription factor binding sites. Journal of molecular biology. 1995 Jun 23; 249(5):923-32. doi: 10.1006/jmbi.1995.0349 PMID: 7791218
    • (1995) Journal of molecular biology , vol.249 , Issue.5 , pp. 923-932
    • Prestridge, D.S.1    Predicting Pol, I.I.2
  • 5
    • 34247110449 scopus 로고    scopus 로고
    • Eukaryotic promoter prediction based on relative entropy and positional information
    • Apr12 041908
    • Wu S, Xie X, Liew AW, Yan H. Eukaryotic promoter prediction based on relative entropy and positional information. Physical Review E. 2007 Apr 12; 75(4):041908.
    • (2007) Physical Review E , vol.75 , Issue.4
    • Wu, S.1    Xie, X.2    Liew, A.W.3    Yan, H.4
  • 7
    • 84970984388 scopus 로고    scopus 로고
    • Promoter Sequence Analysis through No Gap Multiple Sequence Alignment of Motif Pairs
    • Dec
    • Kouser K, Rangarajan L. Promoter Sequence Analysis through No Gap Multiple Sequence Alignment of Motif Pairs. Procedia Computer Science. 2015 Dec 31; 58:356-62.
    • (2015) Procedia Computer Science , vol.31 , Issue.58 , pp. 356-362
    • Kouser, K.1    Rangarajan, L.2
  • 8
    • 84904490572 scopus 로고    scopus 로고
    • Effective automated feature construction and selection for classification of biological sequences
    • Jul17 e99982 PMID: 25033270
    • Kamath U, De Jong K, Shehu A. Effective automated feature construction and selection for classification of biological sequences. PloS one. 2014 Jul 17; 9(7): e99982. doi: 10.1371/journal.pone.0099982 PMID: 25033270
    • (2014) PloS one , vol.9 , Issue.7
    • Kamath, U.1    De Jong, K.2    Shehu, A.3
  • 9
    • 84927712367 scopus 로고    scopus 로고
    • RepDNA-a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects
    • PMID: 25504848
    • Liu B, Liu F, Fang L, Wang X, Chou K C. repDNA-a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects. Bioinformatics.2015, 31(8):1307-1309. doi: 10.1093/bioinformatics/btu820 PMID: 25504848
    • (2015) Bioinformatics , vol.31 , Issue.8 , pp. 1307-1309
    • Liu, B.1    Liu, F.2    Fang, L.3    Wang, X.4    Chou, K.C.5
  • 10
    • 84956620000 scopus 로고    scopus 로고
    • repRNA-a web server for generating various feature vectors of RNA sequences
    • PMID: 26085220
    • Liu B, Liu F, Fang L, Wang X, Chou K C. repRNA-a web server for generating various feature vectors of RNA sequences. Molecular Genetics and Genomics. 2016, 291(1): 473-481. doi: 10.1007/s00438-015-1078-7 PMID: 26085220
    • (2016) Molecular Genetics and Genomics , vol.291 , Issue.1 , pp. 473-481
    • Liu, B.1    Liu, F.2    Fang, L.3    Wang, X.4    Chou, K.C.5
  • 11
    • 84979865452 scopus 로고    scopus 로고
    • Pse-in-One: A web server for generating various modes of pseudo components of DNA, RNA, and protein sequences
    • Jul1 W1 PMID: 25958395
    • Liu B, Liu F, Wang X, Chen J, Fang L, Chou K C.Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences. Nucleic Acids Res. 2015 Jul 1; 43 (W1): W65-71. doi: 10.1093/nar/gkv458 PMID: 25958395
    • (2015) Nucleic Acids Res. , vol.43 , pp. W65-W71
    • Liu, B.1    Liu, F.2    Wang, X.3    Chen, J.4    Fang, L.5    Chou, K.C.6
  • 12
    • 35748932917 scopus 로고    scopus 로고
    • Larrañaga P. A review of feature selection techniques in bioinformatics
    • Oct1 PMID: 17720704
    • Saeys Y, Inza I, Larrañaga P. A review of feature selection techniques in bioinformatics. Bioinformatics.2007 Oct 1; 23(19):2507-17. doi: 10.1093/bioinformatics/btm344 PMID: 17720704
    • (2007) Bioinformatics , vol.23 , Issue.19 , pp. 2507-2517
    • Saeys, Y.1    Inza, I.2
  • 13
    • 81055156693 scopus 로고    scopus 로고
    • A brief survey on sequence classification
    • ACM Nov9
    • Xing Z, Pei J, Keogh E. A brief survey on sequence classification. ACM SIGKDD Explorations Newsletter. 2010 Nov 9; 12(1):40-8.
    • (2010) SIGKDD Explorations Newsletter , vol.12 , Issue.1 , pp. 40-48
    • Xing, Z.1    Pei, J.2    Keogh, E.3
  • 15
    • 34347342058 scopus 로고    scopus 로고
    • Discovering D.N.A.Motifs with Nucleotide Dependency
    • Oct16
    • Leung HC, Chin FY. Discovering DNA Motifs with Nucleotide Dependency. InBIBE 2006 Oct 16 (pp. 70-80).
    • (2006) InBIBE , pp. 70-80
    • Leung, H.C.1    Chin, F.Y.2
  • 16
    • 0025275564 scopus 로고
    • Application of a new method of pattern recognition in DNA sequence analysis:A study of E. Coli promoters
    • Apr11 PMID: 2186368
    • Alexandrov NN, Mironov AA. Application of a new method of pattern recognition in DNA sequence analysis: a study of E. coli promoters. Nucleic acids research. 1990 Apr 11; 18(7):1847-52. PMID: 2186368
    • (1990) Nucleic acids research , vol.18 , Issue.7 , pp. 1847-1852
    • Alexandrov, N.N.1    Mironov, A.A.2
  • 19
    • 79959473605 scopus 로고    scopus 로고
    • Generic spaced DNA motif discovery using Genetic Algorithm
    • IEEE Jul
    • Chan TM, Leung KS, Lee KH. Generic spaced DNA motif discovery using Genetic Algorithm. In IEEE Congress on Evolutionary Computation 2010 Jul 18 (pp. 1-8). IEEE.
    • (2010) Congress on Evolutionary Computation , vol.18 , pp. 1-8
    • Chan, T.M.1    Leung, K.S.2    Lee, K.H.3
  • 20
    • 84865790047 scopus 로고    scopus 로고
    • Encode Project Consortium. An integrated encyclopedia of DNA elements in the human genome
    • Sep6 PMID: 22955616
    • ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012 Sep 6; 489(7414):57-74. doi: 10.1038/nature11247 PMID: 22955616
    • (2012) Nature , vol.489 , Issue.7414 , pp. 57-74
  • 22
    • 17644384367 scopus 로고    scopus 로고
    • Minimum redundancy feature selection from microarray gene expression data. J
    • Apr PMID: 15852500
    • Ding C, Peng H.Minimum redundancy feature selection from microarray gene expression data.J Bioinform Comput Biol. 2005 Apr; 3(2):185-205. PMID: 15852500
    • (2005) Bioinform Comput Biol , vol.3 , Issue.2 , pp. 185-205
    • Ding, C.1    Peng, H.2
  • 23
    • 52249114284 scopus 로고    scopus 로고
    • Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis
    • Sep16
    • Li GZ, Bu HL, Yang MQ, Zeng XQ, Yang JY. Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis. BMC genomics. 2008 Sep 16; 9(2):1.
    • (2008) BMC genomics , vol.9 , Issue.2 , pp. 1
    • Li, G.Z.1    Bu, H.L.2    Yang, M.Q.3    Zeng, X.Q.4    Yang, J.Y.5
  • 25
    • 84885838906 scopus 로고    scopus 로고
    • LibD3C: Ensemble classifiers with a clustering and dynamic selection strategy
    • Jan10
    • Lin C, Chen W, Qiu C, Wu Y, Krishnan S, Zou Q. LibD3C: ensemble classifiers with a clustering and dynamic selection strategy. Neurocomputing. 2014 Jan 10; 123:424-35.
    • (2014) Neurocomputing , vol.123 , pp. 424-435
    • Lin, C.1    Chen, W.2    Qiu, C.3    Wu, Y.4    Krishnan, S.5    Zou, Q.6
  • 26
    • 42149102103 scopus 로고    scopus 로고
    • Two different classes of co-occurring motif pairs found by a novel visualization method in human promoter regions
    • Mar1
    • Murakami K, Imanishi T, Gojobori T, Nakai K. Two different classes of co-occurring motif pairs found by a novel visualization method in human promoter regions. BMC genomics. 2008 Mar 1; 9(1):1.
    • (2008) BMC genomics , vol.9 , Issue.1 , pp. 1
    • Murakami, K.1    Imanishi, T.2    Gojobori, T.3    Nakai, K.4
  • 27
    • 77955565395 scopus 로고    scopus 로고
    • Decision forest for classification of gene expression data
    • Aug31 PMID: 20591424
    • Huang J, Fang H, Fan X. Decision forest for classification of gene expression data. Computers in biology and medicine. 2010 Aug 31; 40(8):698-704. doi: 10.1016/j.compbiomed.2010.06.004 PMID: 20591424
    • (2010) Computers in biology and medicine , vol.40 , Issue.8 , pp. 698-704
    • Huang, J.1    Fang, H.2    Fan, X.3
  • 28
    • 0344441000 scopus 로고    scopus 로고
    • Diagnostic and prognostic prediction using gene expression profiles in high-dimensional microarray data
    • Nov3 PMID: 14583755
    • Simon R. Diagnostic and prognostic prediction using gene expression profiles in high-dimensional microarray data. British journal of cancer. 2003 Nov 3; 89(9):1599-604. doi: 10.1038/sj.bjc.6601326 PMID: 14583755
    • (2003) British journal of cancer , vol.89 , Issue.9 , pp. 1599-1604
    • Simon, R.1
  • 29
    • 33747344944 scopus 로고    scopus 로고
    • Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data
    • Jul26
    • Jeffery IB, Higgins DG, Culhane AC. Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data. BMC bioinformatics. 2006 Jul 26; 7(1):359.
    • (2006) BMC bioinformatics , vol.7 , Issue.1 , pp. 359
    • Jeffery, I.B.1    Higgins, D.G.2    Culhane, A.C.3
  • 30
    • 0033636139 scopus 로고    scopus 로고
    • Support vector machine classification and validation of cancer tissue samples using microarray expression data
    • Oct1 PMID: 11120680
    • Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000 Oct 1; 16(10):906-14. PMID: 11120680
    • (2000) Bioinformatics , vol.16 , Issue.10 , pp. 906-914
    • Furey, T.S.1    Cristianini, N.2    Duffy, N.3    Bednarski, D.W.4    Schummer, M.5    Haussler, D.6
  • 31
    • 0036161259 scopus 로고    scopus 로고
    • Gene selection for cancer classification using support vector machines
    • Jan1 1-3
    • Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Machine learning. 2002 Jan 1; 46(1-3):389-422.
    • (2002) Machine learning , vol.46 , pp. 389-422
    • Guyon, I.1    Weston, J.2    Barnhill, S.3    Vapnik, V.4
  • 35
    • 0036358995 scopus 로고    scopus 로고
    • The spectrum kernel: A string kernel for SVM protein classification
    • Jan2
    • Leslie CS, Eskin E, Noble WS. The spectrum kernel: A string kernel for SVM protein classification. In Pacific symposium on biocomputing 2002 Jan 2 (Vol. 7, No. 7, pp. 566-575).
    • (2002) Pacific symposium on biocomputing , vol.7 , Issue.7 , pp. 566-575
    • Leslie, C.S.1    Eskin, E.2    Noble, W.S.3
  • 36
    • 33947325162 scopus 로고    scopus 로고
    • Learning interpretable SVMs for biological sequence classification
    • Mar20
    • Rätsch G, Sonnenburg S, Schäfer C. Learning interpretable SVMs for biological sequence classification. BMC bioinformatics. 2006 Mar 20; 7(Suppl 1): S9.
    • (2006) BMC bioinformatics , vol.7 , pp. S9
    • Rätsch, G.1    Sonnenburg, S.2    Schäfer, C.3
  • 38
    • 25144481906 scopus 로고    scopus 로고
    • Semi-supervised protein classification using cluster kernels
    • Aug1 PMID: 15905279
    • Weston J, Leslie C, Ie E, Zhou D, Elisseeff A, Noble WS. Semi-supervised protein classification using cluster kernels. Bioinformatics. 2005 Aug 1; 21(15):3241-7. doi: 10.1093/bioinformatics/bti497 PMID: 15905279
    • (2005) Bioinformatics , vol.21 , Issue.15 , pp. 3241-3247
    • Weston, J.1    Leslie, C.2    Ie, E.3    Zhou, D.4    Elisseeff, A.5    Noble, W.S.6
  • 39
    • 85006309196 scopus 로고    scopus 로고
    • Classification and regression trees
    • Brieman L. Classification and regression trees. Chapman and Hall/CRC,1998.
    • (1998) Chapman and Hall/CRC
    • Brieman, L.1
  • 40
    • 84979683664 scopus 로고
    • Programming for machine learning
    • c4. 5 Jan
    • Quinlan JR. C4. 5: Programming for machine learning. Morgan Kauffmann. 1993 Jan:38.
    • (1993) Morgan Kauffmann , pp. 38
    • Quinlan, J.R.1
  • 41
    • 51349111653 scopus 로고    scopus 로고
    • What are decision trees?
    • Sep1 PMID: 18779814
    • Kingsford C, Salzberg SL. What are decision trees? Nature biotechnology. 2008 Sep 1; 26(9):1011-3. doi: 10.1038/nbt0908-1011 PMID: 18779814
    • (2008) Nature biotechnology , vol.26 , Issue.9 , pp. 1011-1013
    • Kingsford, C.1    Salzberg, S.L.2
  • 43
    • 79955613590 scopus 로고    scopus 로고
    • Measuring relevance between discrete and continuous features based on neighborhood mutual information
    • Sep30
    • Hu Q, Zhang L, Zhang D, Pan W, An S, Pedrycz W. Measuring relevance between discrete and continuous features based on neighborhood mutual information. Expert Systems with Applications. 2011 Sep 30; 38(9):10737-50.
    • (2011) Expert Systems with Applications , vol.38 , Issue.9 , pp. 10737-10750
    • Hu, Q.1    Zhang, L.2    Zhang, D.3    Pan, W.4    An, S.5    Pedrycz, W.6
  • 44
    • 62549089350 scopus 로고    scopus 로고
    • Using cell fate attractors to uncover transcriptional regulation of HL60 neutrophil differentiation
    • Feb18 1
    • Huang AC, Hu L, Kauffman SA, Zhang W, Shmulevich I. Using cell fate attractors to uncover transcriptional regulation of HL60 neutrophil differentiation. BMC systems biology. 2009 Feb 18; 3(1):1.
    • (2009) BMC systems biology , vol.3 , Issue.1
    • Huang, A.C.1    Hu, L.2    Kauffman, S.A.3    Zhang, W.4    Shmulevich, I.5
  • 45
    • 2542584654 scopus 로고    scopus 로고
    • Detection of functional DNA motifs via statistical over-representation
    • Mar15 PMID: 14988425
    • Frith MC, Fu Y, Yu L, Chen JF, Hansen U, Weng Z. Detection of functional DNA motifs via statistical over-representation. Nucleic acids research. 2004 Mar 15; 32(4):1372-81. doi: 10.1093/nar/gkh299 PMID: 14988425
    • (2004) Nucleic acids research , vol.32 , Issue.4 , pp. 1372-1381
    • Frith, M.C.1    Fu, Y.2    Yu, L.3    Chen, J.F.4    Hansen, U.5    Weng, Z.6
  • 47
    • 79960819723 scopus 로고    scopus 로고
    • Identification of human housekeeping genes and tissue-selective genes by microarray meta-analysis
    • Jul27 PMID: 21818400
    • Chang CW, Cheng WC, Chen CR, Shu WY, Tsai ML, Huang CL, Hsu IC. Identification of human housekeeping genes and tissue-selective genes by microarray meta-analysis. PloS one. 2011 Jul 27; 6(7): e22859. doi: 10.1371/journal.pone.0022859 PMID: 21818400
    • (2011) PloS one , vol.6 , Issue.7 , pp. e22859
    • Chang, C.W.1    Cheng, W.C.2    Chen, C.R.3    Shu, W.Y.4    Tsai, M.L.5    Huang, C.L.6    Hsu, I.C.7
  • 48
    • 46049110099 scopus 로고    scopus 로고
    • TiGER: A database for tissue-specific gene expression and regulation
    • Jun9
    • Liu X, Yu X, Zack DJ, Zhu H, Qian J. TiGER: a database for tissue-specific gene expression and regulation. BMC bioinformatics. 2008 Jun 9; 9(1):271.
    • (2008) BMC bioinformatics , vol.9 , Issue.1 , pp. 271
    • Liu, X.1    Yu, X.2    Zack, D.J.3    Zhu, H.4    Qian, J.5
  • 49
    • 16344371755 scopus 로고    scopus 로고
    • Identifying differentially expressed genes from microarray experiments via statistic synthesis
    • Apr1 PMID: 15513985
    • Yang YH, Xiao Y, Segal MR. Identifying differentially expressed genes from microarray experiments via statistic synthesis. Bioinformatics. 2005 Apr 1; 21(7):1084-93. doi: 10.1093/bioinformatics/bti108 PMID: 15513985
    • (2005) Bioinformatics , vol.21 , Issue.7 , pp. 1084-1093
    • Yang, Y.H.1    Xiao, Y.2    Segal, M.R.3
  • 50
    • 19544362938 scopus 로고    scopus 로고
    • Bayesian model averaging: Development of an improved multiclass, gene selection and classification tool for microarray data
    • May15 PMID: 15713736
    • Yeung KY, Bumgarner RE, Raftery AE. Bayesian model averaging: development of an improved multiclass, gene selection and classification tool for microarray data. Bioinformatics. 2005 May 15; 21 (10):2394-402. doi: 10.1093/bioinformatics/bti319 PMID: 15713736
    • (2005) Bioinformatics , vol.21 , Issue.10 , pp. 2394-2402
    • Yeung, K.Y.1    Bumgarner, R.E.2    Raftery, A.E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.