메뉴 건너뛰기




Volumn 1, Issue 2, 2005, Pages 181-201

Effective statistical features for coding and non-coding DNA sequence classification for yeast, C. elegans and human

Author keywords

coding statistics; correlation analysis; DNA sequence; exon intron classification; feature selection; information content

Indexed keywords

ANIMAL; CAENORHABDITIS ELEGANS; DNA SEQUENCE; EXON; HUMAN; NUCLEOTIDE SEQUENCE; OPEN READING FRAME;

EID: 34249308376     PISSN: 17445485     EISSN: 17445493     Source Type: Journal    
DOI: 10.1504/ijbra.2005.007577     Document Type: Article
Times cited : (7)

References (32)
  • 1
    • 0000241874 scopus 로고
    • Genmark: parallel gene recognition for both DNA strands
    • Borodovsky, M. and McIninch, J. (1993) ‘Genmark: parallel gene recognition for both DNA strands’, Comput. Chem., Vol. 17, pp.123–133.
    • (1993) Comput. Chem. , vol.17 , pp. 123-133
    • Borodovsky, M.1    McIninch, J.2
  • 2
    • 0031586003 scopus 로고    scopus 로고
    • Prediction of complete gene structures in human genomic DNA
    • Burge, C. and Karlin, S. (1997) ‘Prediction of complete gene structures in human genomic DNA’, J. Mol. Biol., Vol. 268, pp.78–94.
    • (1997) J. Mol. Biol. , vol.268 , pp. 78-94
    • Burge, C.1    Karlin, S.2
  • 3
    • 0030585734 scopus 로고    scopus 로고
    • Evaluation of gene structure prediction programs
    • Burset, M. and Guigo, R. (1996) ‘Evaluation of gene structure prediction programs’, Genomic, Vol. 34, pp.353–367.
    • (1996) Genomic , vol.34 , pp. 353-367
    • Burset, M.1    Guigo, R.2
  • 4
    • 0020480512 scopus 로고
    • Recognition of protein coding regions in DNA sequences
    • Fickett, J.W. (1982) ‘Recognition of protein coding regions in DNA sequences’, Nucleic Acids Res., Vol. 10, pp.5303–5318.
    • (1982) Nucleic Acids Res. , vol.10 , pp. 5303-5318
    • Fickett, J.W.1
  • 5
    • 0027059264 scopus 로고
    • Assessment of protein coding measures
    • Fickett, J.W. and Tung, C.S. (1992) ‘Assessment of protein coding measures’, Nucleic Acids Res., Vol. 20, pp.6641–6450.
    • (1992) Nucleic Acids Res. , vol.20
    • Fickett, J.W.1    Tung, C.S.2
  • 6
    • 0030218799 scopus 로고    scopus 로고
    • Finding genes by computer: the state of the art
    • Fickett, J.W. (1996) ‘Finding genes by computer: the state of the art’, Trends Genet., Vol. 12, pp.316–320.
    • (1996) Trends Genet. , vol.12 , pp. 316-320
    • Fickett, J.W.1
  • 7
    • 1842507532 scopus 로고    scopus 로고
    • Comparison of various algorithms for recognizing short coding sequences of human genes
    • Gao, F. and Zhang, C.T. (2004) ‘Comparison of various algorithms for recognizing short coding sequences of human genes’, Bioinformatics, Vol. 20, pp.673–681.
    • (2004) Bioinformatics , vol.20 , pp. 673-681
    • Gao, F.1    Zhang, C.T.2
  • 9
    • 0002345468 scopus 로고    scopus 로고
    • DNA composition, codon usage and exon prediction
    • Bishop, M. (Ed.) Academic Press
    • Guigo, R. (1999) ‘DNA composition, codon usage and exon prediction’, in Bishop, M. (Ed.): Genetic Databases, Academic Press, pp.53–80.
    • (1999) Genetic Databases , pp. 53-80
    • Guigo, R.1
  • 10
    • 0023685544 scopus 로고
    • A survey on intron and exon lengths
    • Hawkins, J.D. (1988) ‘A survey on intron and exon lengths’, Nucleic Acids Res., Vol. 16, pp.9893–9908.
    • (1988) Nucleic Acids Res. , vol.16 , pp. 9893-9908
    • Hawkins, J.D.1
  • 12
    • 0001818891 scopus 로고
    • Measuring correlations in symbol sequences
    • Herzel, H. and Grosse, I. (1995) ‘Measuring correlations in symbol sequences’, Physica A, Vol. 216, pp.518–542.
    • (1995) Physica A , vol.216 , pp. 518-542
    • Herzel, H.1    Grosse, I.2
  • 14
    • 10044287193 scopus 로고    scopus 로고
    • Selection of statistical features based on mutual information for classification of human coding and non-coding DNA sequences
    • August 23–26, Cambridge, United Kingdom
    • Liew, A.W.C., Wu, Y. and Yan, H. (2004) ‘Selection of statistical features based on mutual information for classification of human coding and non-coding DNA sequences’, Proceedings of the 17th International Conference on Pattern Recognition, August 23–26, Cambridge, United Kingdom.
    • (2004) Proceedings of the 17th International Conference on Pattern Recognition
    • Liew, A.W.C.1    Wu, Y.2    Yan, H.3
  • 15
    • 0032519353 scopus 로고    scopus 로고
    • GeneMark.hmm: new solutions for gene finding
    • Lukashin, A.V. and Borodovsky, M. (1998) ‘GeneMark.hmm: new solutions for gene finding’, Nucleic Acids Res., Vol. 26, pp.1107–1115.
    • (1998) Nucleic Acids Res. , vol.26 , pp. 1107-1115
    • Lukashin, A.V.1    Borodovsky, M.2
  • 16
    • 8844252293 scopus 로고    scopus 로고
    • TigrScan and GlimmerHMM: two open–source ab initio eukaryotic gene-finders
    • doi:10.1093/bioinformatics/bth315
    • Majoros, W.H., Pertea, M. and Salzberg, S.L. (2004) ‘TigrScan and GlimmerHMM: two open–source ab initio eukaryotic gene-finders’, Bioinformatics, doi:10.1093/bioinformatics/bth315.
    • (2004) Bioinformatics
    • Majoros, W.H.1    Pertea, M.2    Salzberg, S.L.3
  • 17
    • 0032903891 scopus 로고    scopus 로고
    • On negative selection against ATG triplets near start codons in eukaryotic and prokaryotic genomes
    • Saito, R. and Tomita, M. (1999) ‘On negative selection against ATG triplets near start codons in eukaryotic and prokaryotic genomes’, J. Mol. Evol., Vol. 48, pp.213–217.
    • (1999) J. Mol. Evol. , vol.48 , pp. 213-217
    • Saito, R.1    Tomita, M.2
  • 18
    • 0032518163 scopus 로고    scopus 로고
    • Microbial gene identification using interpolated Markov models
    • Salzberg, S.L., Delcher, A.L., Kasif, S. and White, O. (1998) ‘Microbial gene identification using interpolated Markov models’, Nucleic Acids Res., Vol. 26, pp.544–548.
    • (1998) Nucleic Acids Res. , vol.26 , pp. 544-548
    • Salzberg, S.L.1    Delcher, A.L.2    Kasif, S.3    White, O.4
  • 20
    • 84940644968 scopus 로고
    • A mathematical theory of communication
    • Shannon, C.E. (1948) ‘A mathematical theory of communication’, The Bell System Technical Journal, Vol. 27, pp.379–423.
    • (1948) The Bell System Technical Journal , vol.27 , pp. 379-423
    • Shannon, C.E.1
  • 21
    • 0019542835 scopus 로고
    • Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justication
    • Shepherd, J.C. (1981) ‘Method to determine the reading frame of a protein from the purine/pyrimidine genome sequence and its possible evolutionary justication’, Proceedings National Academy Sciences, USA, Vol. 78, pp.1596–1600.
    • (1981) Proceedings National Academy Sciences, USA , vol.78 , pp. 1596-1600
    • Shepherd, J.C.1
  • 22
    • 0020039567 scopus 로고
    • Codon preference and its use in identifying protein coding regions in long DNA sequences
    • Staden, R. and McLachlan, A.D. (1982) ‘Codon preference and its use in identifying protein coding regions in long DNA sequences’, Nucleic Acids Research, Vol. 10, pp.141–156.
    • (1982) Nucleic Acids Research , vol.10 , pp. 141-156
    • Staden, R.1    McLachlan, A.D.2
  • 26
    • 0036167758 scopus 로고    scopus 로고
    • Recognizing shorter coding regions of human genes based on the statistics of stop codons
    • Wang, Y., Zhang, C.T. and Dong, P. (2002) ‘Recognizing shorter coding regions of human genes based on the statistics of stop codons’, Biopolymers, Vol. 63, pp.207–216.
    • (2002) Biopolymers , vol.63 , pp. 207-216
    • Wang, Y.1    Zhang, C.T.2    Dong, P.3
  • 27
    • 84929026405 scopus 로고    scopus 로고
    • Classification of short human exons and introns based on statistical features
    • Art. No. 061916
    • Wu, Y., Liew, A.W.C., Yan, H. and Yang, M. (2003) ‘Classification of short human exons and introns based on statistical features’, Physical Review E, Art. No. 061916, Vol. 67, No. 6, pp.1–7.
    • (2003) Physical Review E , vol.67 , Issue.6 , pp. 1-7
    • Wu, Y.1    Liew, A.W.C.2    Yan, H.3    Yang, M.4
  • 29
    • 0027995001 scopus 로고
    • Z curves, an intuitive tool for visualizing and analyzing DNA sequences
    • Zhang, R. and Zhang, C.T. (1994) ‘Z curves, an intuitive tool for visualizing and analyzing DNA sequences’, Journal Biomolecular Structure Dynamics, Vol. 11, pp.767–782.
    • (1994) Journal Biomolecular Structure Dynamics , vol.11 , pp. 767-782
    • Zhang, R.1    Zhang, C.T.2
  • 30
    • 0031558402 scopus 로고    scopus 로고
    • A symmetrical theory of DNA sequences and its application
    • Zhang, C.T. (1997a) ‘A symmetrical theory of DNA sequences and its application’, J. Theoretical Biol., Vol. 187, pp.297–306.
    • (1997) J. Theoretical Biol. , vol.187 , pp. 297-306
    • Zhang, C.T.1
  • 31
    • 0034662286 scopus 로고    scopus 로고
    • Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve
    • Zhang, C.T. and Wang, J. (2000) ‘Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve’, Nucleic Acids Res., Vol. 28, pp.2804–2814.
    • (2000) Nucleic Acids Res. , vol.28 , pp. 2804-2814
    • Zhang, C.T.1    Wang, J.2
  • 32
    • 0031027525 scopus 로고    scopus 로고
    • Identification of protein coding regions in the human genome by quadratic discriminant analysis
    • Zhang, M.Q. (1997b) ‘Identification of protein coding regions in the human genome by quadratic discriminant analysis’, Proc. Natl. Acad. Sci., USA, Vol. 94, pp.565–568.
    • (1997) Proc. Natl. Acad. Sci., USA , vol.94 , pp. 565-568
    • Zhang, M.Q.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.