메뉴 건너뛰기




Volumn 2, Issue 3, 2002, Pages 233-247

Construction of stochastic context trees for genetic texts

Author keywords

Complexity; Genetic texts; Information measure; Statistical modelling; Suffix tree visualisation; Variable memory Markov model

Indexed keywords

BETA GLOBIN; DNA; DOUBLE STRANDED DNA; NUCLEOTIDE DERIVATIVE; OLIGONUCLEOTIDE; PROTEIN; TRANSCRIPTION FACTOR;

EID: 0036455560     PISSN: 13866338     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (13)

References (45)
  • 1
    • 0033531330 scopus 로고    scopus 로고
    • A standard deviation based quantification differentiates coding from non-coding DNA sequences and gives insight to their evolutionary history
    • Almirantis, Y. (1999). A standard deviation based quantification differentiates coding from non-coding DNA sequences and gives insight to their evolutionary history. J. Theor. Biol. 196, 297-308.
    • (1999) J. Theor. Biol. , vol.196 , pp. 297-308
    • Almirantis, Y.1
  • 3
    • 0032183995 scopus 로고    scopus 로고
    • The minimum description length principle in coding and modelling
    • Barron, A., Rissanen, J. and Yu, B. (1998). The minimum description length principle in coding and modelling. IEEE Trans. Inform. Theory 44, 2743-2760.
    • (1998) IEEE Trans. Inform. Theory , vol.44 , pp. 2743-2760
    • Barron, A.1    Rissanen, J.2    Yu, B.3
  • 4
    • 0035109647 scopus 로고    scopus 로고
    • Variation of probabilistic suffix trees: Statistical modeling and prediction of protein families
    • Bejerano, G. and Yona, G. (2001). Variation of probabilistic suffix trees: Statistical modeling and prediction of protein families. Bioinformatics 17, 23-43.
    • (2001) Bioinformatics , vol.17 , pp. 23-43
    • Bejerano, G.1    Yona, G.2
  • 5
    • 0034619234 scopus 로고    scopus 로고
    • Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences
    • Dodin, G., Vandergheynst, P., Levoir, P., Cordier, C. and Marcourt, L. (2000). Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. J. Theor. Biol. 206, 323-326.
    • (2000) J. Theor. Biol. , vol.206 , pp. 323-326
    • Dodin, G.1    Vandergheynst, P.2    Levoir, P.3    Cordier, C.4    Marcourt, L.5
  • 6
    • 0000952690 scopus 로고    scopus 로고
    • Distribution of base pair repeats in coding and non-coding DNA sequences
    • Dokholyan, N. V., Buldyrev, S. V., Havlin, S. and Stanley, H. E. (1997). Distribution of base pair repeats in coding and non-coding DNA sequences. Phys. Rev. Letts. 79, 5182-5185.
    • (1997) Phys. Rev. Letts. , vol.79 , pp. 5182-5185
    • Dokholyan, N.V.1    Buldyrev, S.V.2    Havlin, S.3    Stanley, H.E.4
  • 8
    • 0027065108 scopus 로고
    • Mathematical characterization of chaos game representation. New algorithms for nucleotide sequence analysis location
    • Dutta, C. and Das, J. (1992). Mathematical characterization of chaos game representation. New algorithms for nucleotide sequence analysis location. J. Mol. Biol. 228, 715-719.
    • (1992) J. Mol. Biol. , vol.228 , pp. 715-719
    • Dutta, C.1    Das, J.2
  • 9
    • 0019079941 scopus 로고
    • On grammars, complexity, and information measures of biological macromolecules
    • Ebeling, W. and Jimenez-Montano, M. A. (1980). On grammars, complexity, and information measures of biological macromolecules. Math. Biosci. 52, 53-71.
    • (1980) Math. Biosci. , vol.52 , pp. 53-71
    • Ebeling, W.1    Jimenez-Montano, M.A.2
  • 10
    • 0002813049 scopus 로고    scopus 로고
    • Classification of symbol sequences over their frequency dictionaries: Towards the connection between structure and natural taxonomy
    • Gorban, A. N., Popova, T. G., Sadovsky, M. G. (2000). Classification of symbol sequences over their frequency dictionaries: Towards the connection between structure and natural taxonomy. Open Sys. & Information Dyn. 7, 1-17.
    • (2000) Open Sys. & Information Dyn. , vol.7 , pp. 1-17
    • Gorban, A.N.1    Popova, T.G.2    Sadovsky, M.G.3
  • 11
    • 0034180314 scopus 로고    scopus 로고
    • Species independence of mutual information in coding and noncoding DNA
    • Grosse, I., Herzel, H., Buldyrev, S. V. and Stanley, H. E. (2000). Species independence of mutual information in coding and noncoding DNA. Phys. Rev. E 61, 5624-5629.
    • (2000) Phys. Rev. E , vol.61 , pp. 5624-5629
    • Grosse, I.1    Herzel, H.2    Buldyrev, S.V.3    Stanley, H.E.4
  • 12
    • 0000100455 scopus 로고
    • A new challenge for compression algorithms: Genetic sequences
    • Grumbach, S. and Tahi, F. (1994). A new challenge for compression algorithms: Genetic sequences. J. Inf. Process. Manage 30, 875-886.
    • (1994) J. Inf. Process. Manage , vol.30 , pp. 875-886
    • Grumbach, S.1    Tahi, F.2
  • 13
  • 14
    • 0033592774 scopus 로고    scopus 로고
    • Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms
    • Haring, D. and Kypr, J. (1999). Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms. J. Theor. Biol. 201, 141-156.
    • (1999) J. Theor. Biol. , vol.201 , pp. 141-156
    • Haring, D.1    Kypr, J.2
  • 15
    • 5244285298 scopus 로고    scopus 로고
    • Correlations in DNA sequences: The role of protein coding segments
    • Herzel, H. and Grosse, I. (1997). Correlations in DNA sequences: The role of protein coding segments. Phys. Rev. E 55, 800-811.
    • (1997) Phys. Rev. E , vol.55 , pp. 800-811
    • Herzel, H.1    Grosse, I.2
  • 16
    • 0030595138 scopus 로고    scopus 로고
    • Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences
    • Ioshikhes, I., Bolshoy, A., Derenshteyn, K., Borodovsky, M. and Trifonov, E. N. (1996). Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. J. Mol. Biol. 262, 129-139.
    • (1996) J. Mol. Biol. , vol.262 , pp. 129-139
    • Ioshikhes, I.1    Bolshoy, A.2    Derenshteyn, K.3    Borodovsky, M.4    Trifonov, E.N.5
  • 17
    • 0027458843 scopus 로고
    • Patchiness and correlations in DNA sequences
    • Karlin, S. and Brendel, V. (1993). Patchiness and correlations in DNA sequences. Science 259, 677-680.
    • (1993) Science , vol.259 , pp. 677-680
    • Karlin, S.1    Brendel, V.2
  • 18
    • 0029060923 scopus 로고
    • Dinucleotide relative abundance extremes: A genomic signature
    • Karlin, S. and Burge, C. (1995). Dinucleotide relative abundance extremes: A genomic signature. Trends Genet. 11, 283-290.
    • (1995) Trends Genet. , vol.11 , pp. 283-290
    • Karlin, S.1    Burge, C.2
  • 19
    • 0028606501 scopus 로고
    • Comparisons of eukaryotic genomic sequences
    • Karlin, S. and Ladunga, I. (1994). Comparisons of eukaryotic genomic sequences. Proc. Nat. Acad. Sci. USA 91, 12832-12836.
    • (1994) Proc. Nat. Acad. Sci. USA , vol.91 , pp. 12832-12836
    • Karlin, S.1    Ladunga, I.2
  • 21
    • 0026492782 scopus 로고
    • Long-range doublet correlations in DNA and the coding regions
    • Mani, G. S. (1992). Long-range doublet correlations in DNA and the coding regions. J. Theor. Biol. 158, 447-464.
    • (1992) J. Theor. Biol. , vol.158 , pp. 447-464
    • Mani, G.S.1
  • 23
    • 0012035110 scopus 로고    scopus 로고
    • Protein primary sequences as markov chains
    • Novosibirsk, Institute of Cytology and Genetics Press
    • Mitra, C.K. and Arusharka, S. (2000). Protein primary sequences as markov chains. In: Proceedings of BGRS'2000, Novosibirsk, Institute of Cytology and Genetics Press 2, 180-182.
    • (2000) Proceedings of BGRS'2000 , vol.2 , pp. 180-182
    • Mitra, C.K.1    Arusharka, S.2
  • 24
    • 0028961335 scopus 로고
    • SCOP: A structural classification of proteins database for the investigation of sequences and structures
    • Murzin, A. G., Brenner, S. E., Hubbard, T. and Chothia, C. (1995). SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536-540.
    • (1995) J. Mol. Biol. , vol.247 , pp. 536-540
    • Murzin, A.G.1    Brenner, S.E.2    Hubbard, T.3    Chothia, C.4
  • 25
    • 0021759169 scopus 로고
    • Doublet frequencies in evolutionary distinct groups
    • Nussinov, R. (1984). Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res. 12, 1749-1763.
    • (1984) Nucleic Acids Res. , vol.12 , pp. 1749-1763
    • Nussinov, R.1
  • 27
    • 0012095411 scopus 로고    scopus 로고
    • Context dependencies in amino acid sequences of protein domains
    • Novosibirsk, Institute of Cytology and Genetics Press
    • Orlov, Y. L., Ivanisenko, V. A. and Potapov, V. N. (2000). Context dependencies in amino acid sequences of protein domains. In: Proceedings of BGRS'2000, Novosibirsk, Institute of Cytology and Genetics Press 2, 211-215.
    • (2000) Proceedings of BGRS'2000 , vol.2 , pp. 211-215
    • Orlov, Y.L.1    Ivanisenko, V.A.2    Potapov, V.N.3
  • 28
    • 0035224579 scopus 로고    scopus 로고
    • An algorithm for finding signals of unknown length in DNA sequences
    • Pavesi, G., Mauri, G. and Pesole, G. (2001). An algorithm for finding signals of unknown length in DNA sequences. Bioinformatics 17 (Suppl.1), S207-S214.
    • (2001) Bioinformatics , vol.17 , Issue.SUPPL. 1
    • Pavesi, G.1    Mauri, G.2    Pesole, G.3
  • 29
    • 0033499841 scopus 로고    scopus 로고
    • Segmentation of yeast DNA using hidden Markov models
    • Peshkin, L. and Gelfand, M. S. (1999). Segmentation of yeast DNA using hidden Markov models. Bioinformatics 15, 980-986.
    • (1999) Bioinformatics , vol.15 , pp. 980-986
    • Peshkin, L.1    Gelfand, M.S.2
  • 30
    • 0029898695 scopus 로고    scopus 로고
    • Pair preferences: A quantitative measure of regularities in protein sequences
    • Rani, M. and Mitra, C. K. (1996). Pair preferences: A quantitative measure of regularities in protein sequences. J. Biomol. Struct. Dyn. 13, 935-944.
    • (1996) J. Biomol. Struct. Dyn. , vol.13 , pp. 935-944
    • Rani, M.1    Mitra, C.K.2
  • 31
    • 0032678532 scopus 로고    scopus 로고
    • Fast universal coding with context models
    • Rissanen, J. (1999). Fast universal coding with context models. IEEE Trans. Inform. Theory 45, 1065-1071.
    • (1999) IEEE Trans. Inform. Theory , vol.45 , pp. 1065-1071
    • Rissanen, J.1
  • 32
    • 0030282113 scopus 로고    scopus 로고
    • The power of amnesia: Learning probabilistic automata with variable memory length
    • Ron, D., Singer, Y. and Tishby, N. (1996). The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning 25, 117-149.
    • (1996) Machine Learning , vol.25 , pp. 117-149
    • Ron, D.1    Singer, Y.2    Tishby, N.3
  • 33
    • 0023001414 scopus 로고
    • Sequence periodicities in chicken nucleosome core DNA
    • Satchwell, S. C., Drew, H. R. and Travers, A. A. (1986). Sequence periodicities in chicken nucleosome core DNA. J. Mol. Biol. 191, 659-675.
    • (1986) J. Mol. Biol. , vol.191 , pp. 659-675
    • Satchwell, S.C.1    Drew, H.R.2    Travers, A.A.3
  • 35
    • 0031558556 scopus 로고    scopus 로고
    • Estimating the entropy of DNA sequences
    • Schmitt, A. O. and Herzel, H. (1997). Estimating the entropy of DNA sequences. J. Theor. Biol. 188, 369-377.
    • (1997) J. Theor. Biol. , vol.188 , pp. 369-377
    • Schmitt, A.O.1    Herzel, H.2
  • 36
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • Shannon, C. E. (1948). A mathematical theory of communication. Bell Syst. Tech. J. 27, pt.I., 379-423; pt.II., 623-656.
    • (1948) Bell Syst. Tech. J. , vol.27 , Issue.PART I-II
    • Shannon, C.E.1
  • 37
    • 0033081336 scopus 로고    scopus 로고
    • A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment
    • Stein, A. and Bina, M. (1999). A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment. Nucleic Acids Res. 27, 848-853.
    • (1999) Nucleic Acids Res. , vol.27 , pp. 848-853
    • Stein, A.1    Bina, M.2
  • 38
    • 0033548562 scopus 로고    scopus 로고
    • The stationary statistical properties of human coding sequences
    • Torney, D. C., Whittaker, C. C. and Xie, G. (1999). The stationary statistical properties of human coding sequences. J. Mol. Biol. 286, 1461-1469.
    • (1999) J. Mol. Biol. , vol.286 , pp. 1461-1469
    • Torney, D.C.1    Whittaker, C.C.2    Xie, G.3
  • 39
    • 0024334889 scopus 로고
    • The multiple codes of nucleotide sequences
    • Trifonov, E. N. (1989). The multiple codes of nucleotide sequences. Bull. Math. Biol. 51, 417-432.
    • (1989) Bull. Math. Biol. , vol.51 , pp. 417-432
    • Trifonov, E.N.1
  • 40
    • 0007060573 scopus 로고    scopus 로고
    • Genetic level of DNA sequences is determined by superposition of many codes
    • (in Russian)
    • Trifonov, E. N. (1997). Genetic level of DNA sequences is determined by superposition of many codes. Mol. Biol. (Mosk.) 31, 759-767 (in Russian).
    • (1997) Mol. Biol. (Mosk.) , vol.31 , pp. 759-767
    • Trifonov, E.N.1
  • 43
    • 0033506881 scopus 로고    scopus 로고
    • Modelling and predicting transcriptional units of Escherichia coli genes using hidden Markov models
    • Yada, T., Nakao, M., Totoki, Y. and Nakai, K. (1999). Modelling and predicting transcriptional units of Escherichia coli genes using hidden Markov models. Bioinformatics 15, 987-993.
    • (1999) Bioinformatics , vol.15 , pp. 987-993
    • Yada, T.1    Nakao, M.2    Totoki, Y.3    Nakai, K.4
  • 44
    • 0031760475 scopus 로고    scopus 로고
    • A new Fourier transform approach for protein coding measure based on the format of the Z curve
    • Yan, M., Lin, Z.-S. and Zhang C.-T. (1998). A new Fourier transform approach for protein coding measure based on the format of the Z curve. Bioinformatics 14, 685-690.
    • (1998) Bioinformatics , vol.14 , pp. 685-690
    • Yan, M.1    Lin, Z.-S.2    Zhang, C.-T.3
  • 45
    • 0031558402 scopus 로고    scopus 로고
    • A symmetrical theory of DNA sequences and its applications
    • Zhang, C.-T. (1997). A symmetrical theory of DNA sequences and its applications. J. Theor. Biol. 187, 297-306.
    • (1997) J. Theor. Biol. , vol.187 , pp. 297-306
    • Zhang, C.-T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.