-
1
-
-
0033531330
-
A standard deviation based quantification differentiates coding from non-coding DNA sequences and gives insight to their evolutionary history
-
Almirantis, Y. (1999). A standard deviation based quantification differentiates coding from non-coding DNA sequences and gives insight to their evolutionary history. J. Theor. Biol. 196, 297-308.
-
(1999)
J. Theor. Biol.
, vol.196
, pp. 297-308
-
-
Almirantis, Y.1
-
2
-
-
0034100094
-
Efficient detection of unusual words
-
Apstolico, A., Bock, M. E., Lonardi, S. and Xu, X. (2000). Efficient detection of unusual words. J. Comput. Biol. 7, 71-94.
-
(2000)
J. Comput. Biol.
, vol.7
, pp. 71-94
-
-
Apstolico, A.1
Bock, M.E.2
Lonardi, S.3
Xu, X.4
-
3
-
-
0032183995
-
The minimum description length principle in coding and modelling
-
Barron, A., Rissanen, J. and Yu, B. (1998). The minimum description length principle in coding and modelling. IEEE Trans. Inform. Theory 44, 2743-2760.
-
(1998)
IEEE Trans. Inform. Theory
, vol.44
, pp. 2743-2760
-
-
Barron, A.1
Rissanen, J.2
Yu, B.3
-
4
-
-
0035109647
-
Variation of probabilistic suffix trees: Statistical modeling and prediction of protein families
-
Bejerano, G. and Yona, G. (2001). Variation of probabilistic suffix trees: Statistical modeling and prediction of protein families. Bioinformatics 17, 23-43.
-
(2001)
Bioinformatics
, vol.17
, pp. 23-43
-
-
Bejerano, G.1
Yona, G.2
-
5
-
-
0034619234
-
Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences
-
Dodin, G., Vandergheynst, P., Levoir, P., Cordier, C. and Marcourt, L. (2000). Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. J. Theor. Biol. 206, 323-326.
-
(2000)
J. Theor. Biol.
, vol.206
, pp. 323-326
-
-
Dodin, G.1
Vandergheynst, P.2
Levoir, P.3
Cordier, C.4
Marcourt, L.5
-
6
-
-
0000952690
-
Distribution of base pair repeats in coding and non-coding DNA sequences
-
Dokholyan, N. V., Buldyrev, S. V., Havlin, S. and Stanley, H. E. (1997). Distribution of base pair repeats in coding and non-coding DNA sequences. Phys. Rev. Letts. 79, 5182-5185.
-
(1997)
Phys. Rev. Letts.
, vol.79
, pp. 5182-5185
-
-
Dokholyan, N.V.1
Buldyrev, S.V.2
Havlin, S.3
Stanley, H.E.4
-
7
-
-
0003516147
-
-
Cambridge University Press
-
Durbin, R., Eddy, S. R., Krogh, A. and Mitchison, G. (1998). Biological Sequence Analysis: Probabilistic Models of Protein and Nucleic Acids. Cambridge University Press, pp. 1-347.
-
(1998)
Biological Sequence Analysis: Probabilistic Models of Protein and Nucleic Acids
, pp. 1-347
-
-
Durbin, R.1
Eddy, S.R.2
Krogh, A.3
Mitchison, G.4
-
8
-
-
0027065108
-
Mathematical characterization of chaos game representation. New algorithms for nucleotide sequence analysis location
-
Dutta, C. and Das, J. (1992). Mathematical characterization of chaos game representation. New algorithms for nucleotide sequence analysis location. J. Mol. Biol. 228, 715-719.
-
(1992)
J. Mol. Biol.
, vol.228
, pp. 715-719
-
-
Dutta, C.1
Das, J.2
-
9
-
-
0019079941
-
On grammars, complexity, and information measures of biological macromolecules
-
Ebeling, W. and Jimenez-Montano, M. A. (1980). On grammars, complexity, and information measures of biological macromolecules. Math. Biosci. 52, 53-71.
-
(1980)
Math. Biosci.
, vol.52
, pp. 53-71
-
-
Ebeling, W.1
Jimenez-Montano, M.A.2
-
10
-
-
0002813049
-
Classification of symbol sequences over their frequency dictionaries: Towards the connection between structure and natural taxonomy
-
Gorban, A. N., Popova, T. G., Sadovsky, M. G. (2000). Classification of symbol sequences over their frequency dictionaries: Towards the connection between structure and natural taxonomy. Open Sys. & Information Dyn. 7, 1-17.
-
(2000)
Open Sys. & Information Dyn.
, vol.7
, pp. 1-17
-
-
Gorban, A.N.1
Popova, T.G.2
Sadovsky, M.G.3
-
11
-
-
0034180314
-
Species independence of mutual information in coding and noncoding DNA
-
Grosse, I., Herzel, H., Buldyrev, S. V. and Stanley, H. E. (2000). Species independence of mutual information in coding and noncoding DNA. Phys. Rev. E 61, 5624-5629.
-
(2000)
Phys. Rev. E
, vol.61
, pp. 5624-5629
-
-
Grosse, I.1
Herzel, H.2
Buldyrev, S.V.3
Stanley, H.E.4
-
12
-
-
0000100455
-
A new challenge for compression algorithms: Genetic sequences
-
Grumbach, S. and Tahi, F. (1994). A new challenge for compression algorithms: Genetic sequences. J. Inf. Process. Manage 30, 875-886.
-
(1994)
J. Inf. Process. Manage
, vol.30
, pp. 875-886
-
-
Grumbach, S.1
Tahi, F.2
-
13
-
-
0033499888
-
On the complexity measures of genetic sequences
-
Gusev, V. D., Nemytikova, L. A. and Chuzhanova, N. A. (1999). On the complexity measures of genetic sequences. Bioinformatics 15, 994-999.
-
(1999)
Bioinformatics
, vol.15
, pp. 994-999
-
-
Gusev, V.D.1
Nemytikova, L.A.2
Chuzhanova, N.A.3
-
14
-
-
0033592774
-
Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms
-
Haring, D. and Kypr, J. (1999). Variations of the mononucleotide and short oligonucleotide distributions in the genomes of various organisms. J. Theor. Biol. 201, 141-156.
-
(1999)
J. Theor. Biol.
, vol.201
, pp. 141-156
-
-
Haring, D.1
Kypr, J.2
-
15
-
-
5244285298
-
Correlations in DNA sequences: The role of protein coding segments
-
Herzel, H. and Grosse, I. (1997). Correlations in DNA sequences: The role of protein coding segments. Phys. Rev. E 55, 800-811.
-
(1997)
Phys. Rev. E
, vol.55
, pp. 800-811
-
-
Herzel, H.1
Grosse, I.2
-
16
-
-
0030595138
-
Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences
-
Ioshikhes, I., Bolshoy, A., Derenshteyn, K., Borodovsky, M. and Trifonov, E. N. (1996). Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. J. Mol. Biol. 262, 129-139.
-
(1996)
J. Mol. Biol.
, vol.262
, pp. 129-139
-
-
Ioshikhes, I.1
Bolshoy, A.2
Derenshteyn, K.3
Borodovsky, M.4
Trifonov, E.N.5
-
17
-
-
0027458843
-
Patchiness and correlations in DNA sequences
-
Karlin, S. and Brendel, V. (1993). Patchiness and correlations in DNA sequences. Science 259, 677-680.
-
(1993)
Science
, vol.259
, pp. 677-680
-
-
Karlin, S.1
Brendel, V.2
-
18
-
-
0029060923
-
Dinucleotide relative abundance extremes: A genomic signature
-
Karlin, S. and Burge, C. (1995). Dinucleotide relative abundance extremes: A genomic signature. Trends Genet. 11, 283-290.
-
(1995)
Trends Genet.
, vol.11
, pp. 283-290
-
-
Karlin, S.1
Burge, C.2
-
19
-
-
0028606501
-
Comparisons of eukaryotic genomic sequences
-
Karlin, S. and Ladunga, I. (1994). Comparisons of eukaryotic genomic sequences. Proc. Nat. Acad. Sci. USA 91, 12832-12836.
-
(1994)
Proc. Nat. Acad. Sci. USA
, vol.91
, pp. 12832-12836
-
-
Karlin, S.1
Ladunga, I.2
-
20
-
-
0032844391
-
Nucleosomal DNA property database
-
Levitsky, V. G., Ponomarenko, M. P., Ponomarenko, J. V., Frolov, A. S. and Kolchanov, N. A. (1999). Nucleosomal DNA property database. Bioinformatics 15, 582-592.
-
(1999)
Bioinformatics
, vol.15
, pp. 582-592
-
-
Levitsky, V.G.1
Ponomarenko, M.P.2
Ponomarenko, J.V.3
Frolov, A.S.4
Kolchanov, N.A.5
-
21
-
-
0026492782
-
Long-range doublet correlations in DNA and the coding regions
-
Mani, G. S. (1992). Long-range doublet correlations in DNA and the coding regions. J. Theor. Biol. 158, 447-464.
-
(1992)
J. Theor. Biol.
, vol.158
, pp. 447-464
-
-
Mani, G.S.1
-
22
-
-
0028765892
-
Linguistic features of non-coding DNA sequences
-
Mantegna, R. N., Buldyrev, S. V., Goldberger, A. L., Havlin, S., Peng, C.-K., Simons, M. and Stanley, H. E. (1994). Linguistic features of non-coding DNA sequences. Phys. Rev. Lett. 73, 3169-3172.
-
(1994)
Phys. Rev. Lett.
, vol.73
, pp. 3169-3172
-
-
Mantegna, R.N.1
Buldyrev, S.V.2
Goldberger, A.L.3
Havlin, S.4
Peng, C.-K.5
Simons, M.6
Stanley, H.E.7
-
23
-
-
0012035110
-
Protein primary sequences as markov chains
-
Novosibirsk, Institute of Cytology and Genetics Press
-
Mitra, C.K. and Arusharka, S. (2000). Protein primary sequences as markov chains. In: Proceedings of BGRS'2000, Novosibirsk, Institute of Cytology and Genetics Press 2, 180-182.
-
(2000)
Proceedings of BGRS'2000
, vol.2
, pp. 180-182
-
-
Mitra, C.K.1
Arusharka, S.2
-
24
-
-
0028961335
-
SCOP: A structural classification of proteins database for the investigation of sequences and structures
-
Murzin, A. G., Brenner, S. E., Hubbard, T. and Chothia, C. (1995). SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536-540.
-
(1995)
J. Mol. Biol.
, vol.247
, pp. 536-540
-
-
Murzin, A.G.1
Brenner, S.E.2
Hubbard, T.3
Chothia, C.4
-
25
-
-
0021759169
-
Doublet frequencies in evolutionary distinct groups
-
Nussinov, R. (1984). Doublet frequencies in evolutionary distinct groups. Nucleic Acids Res. 12, 1749-1763.
-
(1984)
Nucleic Acids Res.
, vol.12
, pp. 1749-1763
-
-
Nussinov, R.1
-
27
-
-
0012095411
-
Context dependencies in amino acid sequences of protein domains
-
Novosibirsk, Institute of Cytology and Genetics Press
-
Orlov, Y. L., Ivanisenko, V. A. and Potapov, V. N. (2000). Context dependencies in amino acid sequences of protein domains. In: Proceedings of BGRS'2000, Novosibirsk, Institute of Cytology and Genetics Press 2, 211-215.
-
(2000)
Proceedings of BGRS'2000
, vol.2
, pp. 211-215
-
-
Orlov, Y.L.1
Ivanisenko, V.A.2
Potapov, V.N.3
-
28
-
-
0035224579
-
An algorithm for finding signals of unknown length in DNA sequences
-
Pavesi, G., Mauri, G. and Pesole, G. (2001). An algorithm for finding signals of unknown length in DNA sequences. Bioinformatics 17 (Suppl.1), S207-S214.
-
(2001)
Bioinformatics
, vol.17
, Issue.SUPPL. 1
-
-
Pavesi, G.1
Mauri, G.2
Pesole, G.3
-
29
-
-
0033499841
-
Segmentation of yeast DNA using hidden Markov models
-
Peshkin, L. and Gelfand, M. S. (1999). Segmentation of yeast DNA using hidden Markov models. Bioinformatics 15, 980-986.
-
(1999)
Bioinformatics
, vol.15
, pp. 980-986
-
-
Peshkin, L.1
Gelfand, M.S.2
-
30
-
-
0029898695
-
Pair preferences: A quantitative measure of regularities in protein sequences
-
Rani, M. and Mitra, C. K. (1996). Pair preferences: A quantitative measure of regularities in protein sequences. J. Biomol. Struct. Dyn. 13, 935-944.
-
(1996)
J. Biomol. Struct. Dyn.
, vol.13
, pp. 935-944
-
-
Rani, M.1
Mitra, C.K.2
-
31
-
-
0032678532
-
Fast universal coding with context models
-
Rissanen, J. (1999). Fast universal coding with context models. IEEE Trans. Inform. Theory 45, 1065-1071.
-
(1999)
IEEE Trans. Inform. Theory
, vol.45
, pp. 1065-1071
-
-
Rissanen, J.1
-
32
-
-
0030282113
-
The power of amnesia: Learning probabilistic automata with variable memory length
-
Ron, D., Singer, Y. and Tishby, N. (1996). The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning 25, 117-149.
-
(1996)
Machine Learning
, vol.25
, pp. 117-149
-
-
Ron, D.1
Singer, Y.2
Tishby, N.3
-
33
-
-
0023001414
-
Sequence periodicities in chicken nucleosome core DNA
-
Satchwell, S. C., Drew, H. R. and Travers, A. A. (1986). Sequence periodicities in chicken nucleosome core DNA. J. Mol. Biol. 191, 659-675.
-
(1986)
J. Mol. Biol.
, vol.191
, pp. 659-675
-
-
Satchwell, S.C.1
Drew, H.R.2
Travers, A.A.3
-
35
-
-
0031558556
-
Estimating the entropy of DNA sequences
-
Schmitt, A. O. and Herzel, H. (1997). Estimating the entropy of DNA sequences. J. Theor. Biol. 188, 369-377.
-
(1997)
J. Theor. Biol.
, vol.188
, pp. 369-377
-
-
Schmitt, A.O.1
Herzel, H.2
-
36
-
-
84856043672
-
A mathematical theory of communication
-
Shannon, C. E. (1948). A mathematical theory of communication. Bell Syst. Tech. J. 27, pt.I., 379-423; pt.II., 623-656.
-
(1948)
Bell Syst. Tech. J.
, vol.27
, Issue.PART I-II
-
-
Shannon, C.E.1
-
37
-
-
0033081336
-
A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment
-
Stein, A. and Bina, M. (1999). A signal encoded in vertebrate DNA that influences nucleosome positioning and alignment. Nucleic Acids Res. 27, 848-853.
-
(1999)
Nucleic Acids Res.
, vol.27
, pp. 848-853
-
-
Stein, A.1
Bina, M.2
-
38
-
-
0033548562
-
The stationary statistical properties of human coding sequences
-
Torney, D. C., Whittaker, C. C. and Xie, G. (1999). The stationary statistical properties of human coding sequences. J. Mol. Biol. 286, 1461-1469.
-
(1999)
J. Mol. Biol.
, vol.286
, pp. 1461-1469
-
-
Torney, D.C.1
Whittaker, C.C.2
Xie, G.3
-
39
-
-
0024334889
-
The multiple codes of nucleotide sequences
-
Trifonov, E. N. (1989). The multiple codes of nucleotide sequences. Bull. Math. Biol. 51, 417-432.
-
(1989)
Bull. Math. Biol.
, vol.51
, pp. 417-432
-
-
Trifonov, E.N.1
-
40
-
-
0007060573
-
Genetic level of DNA sequences is determined by superposition of many codes
-
(in Russian)
-
Trifonov, E. N. (1997). Genetic level of DNA sequences is determined by superposition of many codes. Mol. Biol. (Mosk.) 31, 759-767 (in Russian).
-
(1997)
Mol. Biol. (Mosk.)
, vol.31
, pp. 759-767
-
-
Trifonov, E.N.1
-
41
-
-
0034619248
-
Information content of protein sequences
-
Weiss, O., Jimenez-Montano, M. A. and Herzel, H. (2000). Information content of protein sequences. J. Theor. Biol. 206, 379-386.
-
(2000)
J. Theor. Biol.
, vol.206
, pp. 379-386
-
-
Weiss, O.1
Jimenez-Montano, M.A.2
Herzel, H.3
-
42
-
-
0031564633
-
Identification and characterization of genomic nucleosome-positioning sequences
-
Widlund, H. R., Cao, H., Simonsson, S., Magnusson, E., Simonsson, T., Nielsen, P. E., Kahn, J. D., Crothers, D. M. and Kubista, M. (1997). Identification and characterization of genomic nucleosome-positioning sequences. J. Mol. Biol. 267, 807-817.
-
(1997)
J. Mol. Biol.
, vol.267
, pp. 807-817
-
-
Widlund, H.R.1
Cao, H.2
Simonsson, S.3
Magnusson, E.4
Simonsson, T.5
Nielsen, P.E.6
Kahn, J.D.7
Crothers, D.M.8
Kubista, M.9
-
43
-
-
0033506881
-
Modelling and predicting transcriptional units of Escherichia coli genes using hidden Markov models
-
Yada, T., Nakao, M., Totoki, Y. and Nakai, K. (1999). Modelling and predicting transcriptional units of Escherichia coli genes using hidden Markov models. Bioinformatics 15, 987-993.
-
(1999)
Bioinformatics
, vol.15
, pp. 987-993
-
-
Yada, T.1
Nakao, M.2
Totoki, Y.3
Nakai, K.4
-
44
-
-
0031760475
-
A new Fourier transform approach for protein coding measure based on the format of the Z curve
-
Yan, M., Lin, Z.-S. and Zhang C.-T. (1998). A new Fourier transform approach for protein coding measure based on the format of the Z curve. Bioinformatics 14, 685-690.
-
(1998)
Bioinformatics
, vol.14
, pp. 685-690
-
-
Yan, M.1
Lin, Z.-S.2
Zhang, C.-T.3
-
45
-
-
0031558402
-
A symmetrical theory of DNA sequences and its applications
-
Zhang, C.-T. (1997). A symmetrical theory of DNA sequences and its applications. J. Theor. Biol. 187, 297-306.
-
(1997)
J. Theor. Biol.
, vol.187
, pp. 297-306
-
-
Zhang, C.-T.1
|