메뉴 건너뛰기




Volumn 6, Issue 11, 2011, Pages

Sequence-based classification using discriminatory motif feature selection

Author keywords

[No Author keywords available]

Indexed keywords

DNA; SMALL INTERFERING RNA; PROTEIN;

EID: 80755126015     PISSN: None     EISSN: 19326203     Source Type: Journal    
DOI: 10.1371/journal.pone.0027382     Document Type: Article
Times cited : (8)

References (58)
  • 1
    • 13144306071 scopus 로고    scopus 로고
    • Genome-wide association studies for common diseases and complex traits
    • Hirschhorn J, Daly M, (2005) Genome-wide association studies for common diseases and complex traits. Nature Reviews Genetics 6: 95-108.
    • (2005) Nature Reviews Genetics , vol.6 , pp. 95-108
    • Hirschhorn, J.1    Daly, M.2
  • 2
    • 33645777446 scopus 로고    scopus 로고
    • CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure
    • Bock C, Paulsen M, Tierling S, Mikeska T, Lengauer T, et al. (2006) CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure. PLoS Genet 2: e26.
    • (2006) PLoS Genet , vol.2
    • Bock, C.1    Paulsen, M.2    Tierling, S.3    Mikeska, T.4    Lengauer, T.5
  • 3
    • 33749368402 scopus 로고    scopus 로고
    • Evidence of inuence of genomic dna sequence on human x chromosome inactivation
    • Wang Z, Willard HF, Mukherjee S, Furey TS, (2006) Evidence of inuence of genomic dna sequence on human x chromosome inactivation. PLoS Comput Biol 2: e113.
    • (2006) PLoS Comput Biol , vol.2
    • Wang, Z.1    Willard, H.F.2    Mukherjee, S.3    Furey, T.S.4
  • 5
    • 69949162927 scopus 로고    scopus 로고
    • SOLpro: accurate sequence-based prediction of protein solubility
    • Magnan CN, Randall A, Baldi P, (2009) SOLpro: accurate sequence-based prediction of protein solubility. Bioinformatics 25: 2200-2207.
    • (2009) Bioinformatics , vol.25 , pp. 2200-2207
    • Magnan, C.N.1    Randall, A.2    Baldi, P.3
  • 6
    • 77950342398 scopus 로고    scopus 로고
    • An overview of in silico protein function prediction
    • Sleator RD, Walsh P, (2010) An overview of in silico protein function prediction. Archives of Microbiology 192: 151-155.
    • (2010) Archives of Microbiology , vol.192 , pp. 151-155
    • Sleator, R.D.1    Walsh, P.2
  • 8
    • 84883575579 scopus 로고    scopus 로고
    • Fast string kernels using inexact matching for protein sequences
    • Leslie C, Kuang R, (2004) Fast string kernels using inexact matching for protein sequences. J Mach Learn Res 5: 14351455.
    • (2004) J Mach Learn Res , vol.5 , pp. 14351455
    • Leslie, C.1    Kuang, R.2
  • 9
    • 36949013631 scopus 로고    scopus 로고
    • Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs
    • Shamim MTA, Anwaruddin M, Nagarajaram H, (2007) Support vector machine-based classification of protein folds using the structural properties of amino acid residues and amino acid residue pairs. Bioinformatics 23: 3320-3327.
    • (2007) Bioinformatics , vol.23 , pp. 3320-3327
    • Shamim, M.T.A.1    Anwaruddin, M.2    Nagarajaram, H.3
  • 11
    • 19444386649 scopus 로고    scopus 로고
    • Prediction of siRNA functionality using generalized string kernel and support vector machine
    • Teramoto R, Aoki M, Kimura T, Kanaoka M, (2005) Prediction of siRNA functionality using generalized string kernel and support vector machine. FEBS Letters 579: 2878-2882.
    • (2005) FEBS Letters , vol.579 , pp. 2878-2882
    • Teramoto, R.1    Aoki, M.2    Kimura, T.3    Kanaoka, M.4
  • 12
    • 30344447264 scopus 로고    scopus 로고
    • Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine
    • Xue C, Li F, He T, Liu G, Li Y, et al. (2005) Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine. BMC Bioinformatics 6: 310.
    • (2005) BMC Bioinformatics , vol.6 , pp. 310
    • Xue, C.1    Li, F.2    He, T.3    Liu, G.4    Li, Y.5
  • 13
    • 34447309058 scopus 로고    scopus 로고
    • De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures
    • Ng KLS, Mishra SK, (2007) De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures. Bioinformatics 23: 1321-1330.
    • (2007) Bioinformatics , vol.23 , pp. 1321-1330
    • Ng, K.L.S.1    Mishra, S.K.2
  • 14
    • 84898968688 scopus 로고    scopus 로고
    • Mismatch string kernels for SVM protein classification
    • Leslie C, Eskin E, Noble W, (2003) Mismatch string kernels for SVM protein classification. In: Neural Information Processing Systems 15. pp. 1441-1448. URLhttp://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.4737.
    • (2003) In: Neural Information Processing Systems , vol.15 , pp. 1441-1448
    • Leslie, C.1    Eskin, E.2    Noble, W.3
  • 15
    • 1542714925 scopus 로고    scopus 로고
    • Mismatch string kernels for discriminative protein classification
    • Leslie CS, Eskin E, Cohen A, Weston J, Noble WS, (2004) Mismatch string kernels for discriminative protein classification. Bioinformatics 20: 467-476.
    • (2004) Bioinformatics , vol.20 , pp. 467-476
    • Leslie, C.S.1    Eskin, E.2    Cohen, A.3    Weston, J.4    Noble, W.S.5
  • 16
    • 29144512262 scopus 로고    scopus 로고
    • RASE: recognition of alternatively spliced exons in c.elegans
    • Ratsch G, Sonnenburg S, Scholkopf B, (2005) RASE: recognition of alternatively spliced exons in c.elegans. Bioinformatics 21: i369-i377.
    • (2005) Bioinformatics , vol.21
    • Ratsch, G.1    Sonnenburg, S.2    Scholkopf, B.3
  • 17
    • 46249106500 scopus 로고    scopus 로고
    • POIMs: positional oligomer importance matricesunderstanding support vector machine-based signal detectors
    • Sonnenburg S, Zien A, Philips P, Rtsch G, (2008) POIMs: positional oligomer importance matricesunderstanding support vector machine-based signal detectors. Bioinformatics 24: i6-i14.
    • (2008) Bioinformatics , vol.24
    • Sonnenburg, S.1    Zien, A.2    Philips, P.3    Rtsch, G.4
  • 18
    • 26444467746 scopus 로고    scopus 로고
    • Learning interpretable SVMs for biological sequence classification
    • Sonnenburg S, Rtsch G, Schfer C, (2005) Learning interpretable SVMs for biological sequence classification. BMC BIOINFORMATICS 3500: 389-407.
    • (2005) BMC BIOINFORMATICS , vol.3500 , pp. 389-407
    • Sonnenburg, S.1    Rtsch, G.2    Schfer, C.3
  • 24
    • 79954531273 scopus 로고    scopus 로고
    • Identifying discriminative classification-based motifs in biological sequences
    • Vens C, Rosso M, Danchin EGJ, (2011) Identifying discriminative classification-based motifs in biological sequences. Bioinformatics 27: 1231-1238.
    • (2011) Bioinformatics , vol.27 , pp. 1231-1238
    • Vens, C.1    Rosso, M.2    Danchin, E.G.J.3
  • 25
    • 0027912333 scopus 로고
    • Detecting subtle sequence signals - A Gibbs sampling strategy for multiple alignment
    • Lawrence C, Altschul S, Boguski M, Liu J, Neuwald A, et al. (1993) Detecting subtle sequence signals- A Gibbs sampling strategy for multiple alignment. Science 262: 208-214.
    • (1993) Science , vol.262 , pp. 208-214
    • Lawrence, C.1    Altschul, S.2    Boguski, M.3    Liu, J.4    Neuwald, A.5
  • 26
    • 0002759539 scopus 로고
    • Unsupervised learning of multiple motifs in biopolymers using expectation maximization
    • Bailey T, Elkan C, (1995) Unsupervised learning of multiple motifs in biopolymers using expectation maximization. Machine Learning 21: 51-80.
    • (1995) Machine Learning , vol.21 , pp. 51-80
    • Bailey, T.1    Elkan, C.2
  • 27
    • 0032826179 scopus 로고    scopus 로고
    • Identifying DNA and protein patterns with statistically significant alignments of multiple sequences
    • Hertz G, Stormo G, (1999) Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics 15: 563-577.
    • (1999) Bioinformatics , vol.15 , pp. 563-577
    • Hertz, G.1    Stormo, G.2
  • 28
    • 0034628901 scopus 로고    scopus 로고
    • Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae
    • Hughes J, Estep P, Tavazoie S, Church G, (2000) Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. Journal of Molecular Biology 296: 1205-1214.
    • (2000) Journal of Molecular Biology , vol.296 , pp. 1205-1214
    • Hughes, J.1    Estep, P.2    Tavazoie, S.3    Church, G.4
  • 29
    • 0032483307 scopus 로고    scopus 로고
    • Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies1
    • Van Helden J, Andre B, Collado-Vides J, (1998) Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies1. Journal of Molecular Biology 281: 827-842.
    • (1998) Journal of Molecular Biology , vol.281 , pp. 827-842
    • Van Helden, J.1    Andre, B.2    Collado-Vides, J.3
  • 30
    • 0042905768 scopus 로고    scopus 로고
    • YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation
    • Sinha S, Tompa M, (2003) YMF: A program for discovery of novel transcription factor binding sites by statistical overrepresentation. Nucleic Acids Research 31: 3586.
    • (2003) Nucleic Acids Research , vol.31 , pp. 3586
    • Sinha, S.1    Tompa, M.2
  • 32
    • 0037361669 scopus 로고    scopus 로고
    • Discovery of conserved sequence patterns using a stochastic dictionary model
    • Gupta M, Liu J, (2003) Discovery of conserved sequence patterns using a stochastic dictionary model. Journal of the American Statistical Association 98: 55-66.
    • (2003) Journal of the American Statistical Association , vol.98 , pp. 55-66
    • Gupta, M.1    Liu, J.2
  • 33
    • 84889870510 scopus 로고    scopus 로고
    • From promoter sequence to expression: a probabilistic framework
    • In: Proceedings of the sixth annual international conference on Computational biology. ACM
    • Segal E, Barash Y, Simon I, Friedman N, Koller D, (2002) From promoter sequence to expression: a probabilistic framework. In: Proceedings of the sixth annual international conference on Computational biology. ACM.
    • (2002)
    • Segal, E.1    Barash, Y.2    Simon, I.3    Friedman, N.4    Koller, D.5
  • 34
    • 23144460837 scopus 로고    scopus 로고
    • WordSpy: identifying transcription factor binding motifs by building a dictionary and learning a grammar
    • Wang G, Yu T, Zhang W, (2005) WordSpy: identifying transcription factor binding motifs by building a dictionary and learning a grammar. Nucleic Acids Research 33: W412-W416.
    • (2005) Nucleic Acids Research , vol.33 , pp. 412-416
    • Wang, G.1    Yu, T.2    Zhang, W.3
  • 35
    • 33745631199 scopus 로고    scopus 로고
    • A steganalysis-based approach to comprehensive identification and characterization of functional regulatory elements
    • Wang G, Zhang W, (2006) A steganalysis-based approach to comprehensive identification and characterization of functional regulatory elements. Genome Biol 7: R49.
    • (2006) Genome Biol , vol.7
    • Wang, G.1    Zhang, W.2
  • 36
    • 21144439147 scopus 로고    scopus 로고
    • Assessing computational tools for the discovery of transcription factor binding sites
    • Tompa M, Li N, Bailey TL, Church GM, Moor BD, et al. (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nature Biotechnology 23: 137-144.
    • (2005) Nature Biotechnology , vol.23 , pp. 137-144
    • Tompa, M.1    Li, N.2    Bailey, T.L.3    Church, G.M.4    Moor, B.D.5
  • 37
    • 0034201441 scopus 로고    scopus 로고
    • EMBOSS: the European Molecular Biology Open Software Suite
    • Rice P, Longden I, Bleasby A, (2000) EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 16: 276-277.
    • (2000) Trends Genet , vol.16 , pp. 276-277
    • Rice, P.1    Longden, I.2    Bleasby, A.3
  • 38
    • 75949117507 scopus 로고    scopus 로고
    • Moods: fast search for position weight matrix matches in dna sequences
    • Korhonen J, Martinmki P, Pizzi C, Rastas P, Ukkonen E, (2009) Moods: fast search for position weight matrix matches in dna sequences. Bioinformatics 25: 3181-3182.
    • (2009) Bioinformatics , vol.25 , pp. 3181-3182
    • Korhonen, J.1    Martinmki, P.2    Pizzi, C.3    Rastas, P.4    Ukkonen, E.5
  • 39
    • 0035478854 scopus 로고    scopus 로고
    • Random forests
    • Breiman L, (2001) Random forests. Machine Learning 45: 5-32.
    • (2001) Machine Learning , vol.45 , pp. 5-32
    • Breiman, L.1
  • 42
    • 29144499905 scopus 로고    scopus 로고
    • Working set selection using second order information for training support vector machines
    • Fan R, Chen P, Lin C, (2005) Working set selection using second order information for training support vector machines. The Journal of Machine Learning Research 6: 1918.
    • (2005) The Journal of Machine Learning Research , vol.6 , pp. 1918
    • Fan, R.1    Chen, P.2    Lin, C.3
  • 43
    • 60349089645 scopus 로고    scopus 로고
    • Nucleosome positioning and gene regulation: advances through genomics
    • Jiang C, Pugh B, (2009) Nucleosome positioning and gene regulation: advances through genomics. Nature Reviews Genetics 10: 161-72.
    • (2009) Nature Reviews Genetics , vol.10 , pp. 161-172
    • Jiang, C.1    Pugh, B.2
  • 44
    • 34250347716 scopus 로고    scopus 로고
    • Independent and complementary methods for large-scale structural analysis of mammalian chromatin
    • Dennis JH, Fan HY, Reynolds SM, Yuan G, Meldrim JC, et al. (2007) Independent and complementary methods for large-scale structural analysis of mammalian chromatin. Genome Research 17: 928-939.
    • (2007) Genome Research , vol.17 , pp. 928-939
    • Dennis, J.H.1    Fan, H.Y.2    Reynolds, S.M.3    Yuan, G.4    Meldrim, J.C.5
  • 46
    • 78650576256 scopus 로고    scopus 로고
    • Contributions of histone sequence preferences to nucleosome organization: proposed definitions and methodology
    • Kaplan N, Hughes T, Lieb J, Widom J, Segal E, (2010) Contributions of histone sequence preferences to nucleosome organization: proposed definitions and methodology. Genome Biology 11: 140.
    • (2010) Genome Biology , vol.11 , pp. 140
    • Kaplan, N.1    Hughes, T.2    Lieb, J.3    Widom, J.4    Segal, E.5
  • 47
    • 33846862405 scopus 로고    scopus 로고
    • High-throughput mapping of the chromatin structure of human promoters
    • Ozsolak F, Song JS, Liu XS, Fisher DE, (2007) High-throughput mapping of the chromatin structure of human promoters. Nature Biotechnology 25: 244-248.
    • (2007) Nature Biotechnology , vol.25 , pp. 244-248
    • Ozsolak, F.1    Song, J.S.2    Liu, X.S.3    Fisher, D.E.4
  • 48
    • 75149135277 scopus 로고    scopus 로고
    • G+C content dominates intrinsic nucleosome occupancy
    • Tillo D, Hughes T, (2009) G+C content dominates intrinsic nucleosome occupancy. BMC Bioinformatics 10: 442.
    • (2009) BMC Bioinformatics , vol.10 , pp. 442
    • Tillo, D.1    Hughes, T.2
  • 49
    • 34748826166 scopus 로고    scopus 로고
    • A high-resolution atlas of nucleosome occupancy in yeast
    • Lee W, Tillo D, Bray N, Morse RH, Davis RW, et al. (2007) A high-resolution atlas of nucleosome occupancy in yeast. Nat Genet 39: 1235-1244.
    • (2007) Nat Genet , vol.39 , pp. 1235-1244
    • Lee, W.1    Tillo, D.2    Bray, N.3    Morse, R.H.4    Davis, R.W.5
  • 51
    • 33846041078 scopus 로고    scopus 로고
    • The universal protein resource (UniProt)
    • The UniProt Consortium
    • The UniProt Consortium (2007) The universal protein resource (UniProt). Nucleic Acids Research 35: D193-D197.
    • (2007) Nucleic Acids Research , vol.35 , pp. 193-197
  • 52
    • 8844222708 scopus 로고    scopus 로고
    • Targetdb: a target registration database for structural genomics projects
    • Chen L, Oughtred R, Berman HM, Westbrook J, (2004) Targetdb: a target registration database for structural genomics projects. Bioinformatics 20: 2860-2862.
    • (2004) Bioinformatics , vol.20 , pp. 2860-2862
    • Chen, L.1    Oughtred, R.2    Berman, H.M.3    Westbrook, J.4
  • 53
    • 18844434395 scopus 로고    scopus 로고
    • Understanding the relationship between the primary structure of proteins and their amyloidogenic propensity: clues from inclusion body formation
    • April
    • Idicula-Thomas S, Balaji PV, (April 2005) Understanding the relationship between the primary structure of proteins and their amyloidogenic propensity: clues from inclusion body formation. Protein Engineering Design and Selection 18: 175-180.
    • (2005) Protein Engineering Design and Selection , vol.18 , pp. 175-180
    • Idicula-Thomas, S.1    Balaji, P.V.2
  • 54
  • 56
    • 0035470889 scopus 로고    scopus 로고
    • Greedy function approximation: A gradient boosting machine
    • Friedman JH, (2011) Greedy function approximation: A gradient boosting machine. The Annals of Statistics 29: 1189-1232.
    • (2011) The Annals of Statistics , vol.29 , pp. 1189-1232
    • Friedman, J.H.1
  • 57
    • 38049141180 scopus 로고    scopus 로고
    • Discriminative motif discovery in DNA and protein sequences using the DEME algorithm
    • Redhead E, Bailey T, (2007) Discriminative motif discovery in DNA and protein sequences using the DEME algorithm. BMC Bioinformatics 8: 385.
    • (2007) BMC Bioinformatics , vol.8 , pp. 385
    • Redhead, E.1    Bailey, T.2
  • 58
    • 79953300078 scopus 로고    scopus 로고
    • Fimo: scanning for occurrences of a given motif
    • Grant CE, Bailey TL, Noble WS, (2011) Fimo: scanning for occurrences of a given motif. Bioinformatics 27: 1017-1018.
    • (2011) Bioinformatics , vol.27 , pp. 1017-1018
    • Grant, C.E.1    Bailey, T.L.2    Noble, W.S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.