-
1
-
-
0001841122
-
On the computational complexity of approximating probability distributions by probabilistic automata
-
Abe, N., & Warmuth, M. (1992). On the computational complexity of approximating probability distributions by probabilistic automata. Machine Learning, 9, 205-260.
-
(1992)
Machine Learning
, vol.9
, pp. 205-260
-
-
Abe, N.1
Warmuth, M.2
-
2
-
-
0030642529
-
Text compression by context tree weighting
-
IEEE Computer Society Press
-
Aberg, J., & Shtarkov, Y. (1997). Text compression by context tree weighting. In Proceedings Data Compression Conference (DCC), pp. 377-386. IEEE Computer Society Press.
-
(1997)
Proceedings Data Compression Conference (DCC)
, pp. 377-386
-
-
Aberg, J.1
Shtarkov, Y.2
-
3
-
-
0031043056
-
A new method for analyzing protein sequence relationships based on sammon maps
-
Agrafiotis, D. (1997). A new method for analyzing protein sequence relationships based on sammon maps. Protein Science, 6, 287-293.
-
(1997)
Protein Science
, vol.6
, pp. 287-293
-
-
Agrafiotis, D.1
-
4
-
-
0030671788
-
A corpus for the evaluation of lossless compression algorithms
-
Arnold, R., & Bell, T. (1997). A corpus for the evaluation of lossless compression algorithms. In Designs, Codes and Cryptography, pp. 201-210.
-
(1997)
Designs, Codes and Cryptography
, pp. 201-210
-
-
Arnold, R.1
Bell, T.2
-
5
-
-
0035301887
-
Texture mixing and texture movie synthesis using statistical learning
-
Bar-Joseph, Z., El-Yaniv, R., Lischinski, D., & Werman, M. (2001). Texture mixing and texture movie synthesis using statistical learning. IEEE Transactions on Visualization and Computer Graphics, 7(2), 120-135.
-
(2001)
IEEE Transactions on Visualization and Computer Graphics
, vol.7
, Issue.2
, pp. 120-135
-
-
Bar-Joseph, Z.1
El-Yaniv, R.2
Lischinski, D.3
Werman, M.4
-
6
-
-
0035109647
-
Variations on probabilistic suffix trees: Statistical modeling and the prediction of protein families
-
Bejerano, G., & Yona, G. (2001). Variations on probabilistic suffix trees: Statistical modeling and the prediction of protein families.. Bioinformatics, 17(1), 23-43.
-
(2001)
Bioinformatics
, vol.17
, Issue.1
, pp. 23-43
-
-
Bejerano, G.1
Yona, G.2
-
7
-
-
0004015324
-
-
Prentice-Hall, Inc
-
Bell, T., Cleary, J., & Witten, I. (1990). Text Compression. Prentice-Hall, Inc.
-
(1990)
Text Compression
-
-
Bell, T.1
Cleary, J.2
Witten, I.3
-
9
-
-
0008275693
-
Semantically motivated improvements for PPM variants
-
Bunton, S. (1997). Semantically motivated improvements for PPM variants. The Computer Journal, 40(2/3), 76-92.
-
(1997)
The Computer Journal
, vol.40
, Issue.2-3
, pp. 76-92
-
-
Bunton, S.1
-
10
-
-
0003573193
-
A block-sorting lossless data compression algorithm
-
Digital Equipement Corporation
-
Burrows, M., & Wheeler, D. J. (1994). A block-sorting lossless data compression algorithm.. Tech. rep. 124, Digital Equipement Corporation.
-
(1994)
Tech. Rep.
, vol.124
-
-
Burrows, M.1
Wheeler, D.J.2
-
11
-
-
27344448874
-
-
Protein structure prediction center
-
CASP (2002). Protein structure prediction center, http:// predictioncenter.llnl.gov/.
-
(2002)
-
-
-
12
-
-
0035008559
-
Compressing XML with multiplexed hierarchical PPM models
-
Cheney, J. (2001). Compressing XML with multiplexed hierarchical PPM models. In Data Compression Conference, pp. 163-172.
-
(2001)
Data Compression Conference
, pp. 163-172
-
-
Cheney, J.1
-
14
-
-
0003081845
-
Unbounded length contexts for PPM
-
Cleary, J., & Teahan, W. (1997). Unbounded length contexts for PPM. Computer Journal, 40, 67-75.
-
(1997)
Computer Journal
, vol.40
, pp. 67-75
-
-
Cleary, J.1
Teahan, W.2
-
15
-
-
0021405335
-
Data compression using adaptive coding and partial string matching
-
Cleary, J., & Witten, I. (1984). Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications, COM-32(4), 396-402.
-
(1984)
IEEE Transactions on Communications
, vol.COM-32
, Issue.4
, pp. 396-402
-
-
Cleary, J.1
Witten, I.2
-
16
-
-
0017994420
-
A convergent gambling estimate of the entropy of English
-
Cover, T., & King, R. (1978). A convergent gambling estimate of the entropy of English. IEEE Transactions on Information Theory, 24(4), 413-421.
-
(1978)
IEEE Transactions on Information Theory
, vol.24
, Issue.4
, pp. 413-421
-
-
Cover, T.1
King, R.2
-
18
-
-
0016990971
-
The origin and evolution of protein superfamilies
-
Dayhoff, M. (1976). The origin and evolution of protein superfamilies. In Federation Proceedings, pp. 2132-2138.
-
(1976)
Federation Proceedings
, pp. 2132-2138
-
-
Dayhoff, M.1
-
20
-
-
0035014847
-
Multi-class protein fold recognition using support vector machines and neural networks
-
Ding, C., & Dubchak, I. (2001). Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics, 17, 349-358.
-
(2001)
Bioinformatics
, vol.17
, pp. 349-358
-
-
Ding, C.1
Dubchak, I.2
-
22
-
-
0142135034
-
Using machine-learning methods for musical style modeling
-
Dubnov, S., Assayag, G., Lartillot, O., & Bejerano, G. (2003). Using machine-learning methods for musical style modeling. IEEE Computer, 36(10), 73-80.
-
(2003)
IEEE Computer
, vol.36
, Issue.10
, pp. 73-80
-
-
Dubnov, S.1
Assayag, G.2
Lartillot, O.3
Bejerano, G.4
-
23
-
-
0028273691
-
Relations between entropy and error probability
-
Feder, M., & Merhav, N. (1994). Relations between entropy and error probability. IEEE Transactions on Information Theory, 40(1), 259-266.
-
(1994)
IEEE Transactions on Information Theory
, vol.40
, Issue.1
, pp. 259-266
-
-
Feder, M.1
Merhav, N.2
-
24
-
-
0028290005
-
Parser for protein folding units
-
Holm, L., & Sander, C. (1994). Parser for protein folding units. Proteins, 19, 256-268.
-
(1994)
Proteins
, vol.19
, pp. 256-268
-
-
Holm, L.1
Sander, C.2
-
26
-
-
0033289037
-
Using the Fisher kernel method to detect remote protein homologies
-
Jaakkola, T., Diekhans, M., & Haussler., D. (1999). Using the Fisher kernel method to detect remote protein homologies. In Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology, pp. 149-158.
-
(1999)
Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
, pp. 149-158
-
-
Jaakkola, T.1
Diekhans, M.2
Haussler, D.3
-
27
-
-
0043231338
-
Prediction of protein structural classes by a new measure of information discrepancy
-
Jin, L., Fang, W., &: Tang, H. (2003). Prediction of protein structural classes by a new measure of information discrepancy. Computational Biology and Chemistry, 27, 373-380.
-
(2003)
Computational Biology and Chemistry
, vol.27
, pp. 373-380
-
-
Jin, L.1
Fang, W.2
Tang, H.3
-
28
-
-
0032648630
-
On-line algorithms for combining language models
-
Kalai, A., Chen, S., Blum, A., & Rosenfeld, R. (1999). On-line algorithms for combining language models. In Proceedings of the International Conference on Accoustics, Speech, and Signal Processing.
-
(1999)
Proceedings of the International Conference on Accoustics, Speech, and Signal Processing
-
-
Kalai, A.1
Chen, S.2
Blum, A.3
Rosenfeld, R.4
-
29
-
-
84945308250
-
Improved smoothing for probabilistic suffix trees seen as variable order Markov chains
-
Kermorvant, C., & Dupont, P. (2002). Improved smoothing for probabilistic suffix trees seen as variable order Markov chains. In European Conference on Machine Learning (ECML), pp. 185-194.
-
(2002)
European Conference on Machine Learning (ECML)
, pp. 185-194
-
-
Kermorvant, C.1
Dupont, P.2
-
31
-
-
0034672751
-
Structural class prediction: An application of residue distribution along the sequence
-
Kumarevel, T., Gromiha, M., & Ponnuswamy, M. (2000). Structural class prediction: An application of residue distribution along the sequence. Biophys Chem., 88, 81-101.
-
(2000)
Biophys Chem.
, vol.88
, pp. 81-101
-
-
Kumarevel, T.1
Gromiha, M.2
Ponnuswamy, M.3
-
32
-
-
0020718764
-
A note on the Ziv-Lempel model for compressing individual sequences
-
Langdon, G. (1983). A note on the Ziv-Lempel model for compressing individual sequences. IEEE Transactions on Information Theory, 29, 284- 287.
-
(1983)
IEEE Transactions on Information Theory
, vol.29
, pp. 284-287
-
-
Langdon, G.1
-
34
-
-
1542714925
-
Mismatch string kernels for discriminative protein classification
-
Leslie, C., Eskin, E., Cohen, A., Weston, J., & Noble, W. (2004). Mismatch string kernels for discriminative protein classification. Bioinformatics, 20, 467-476.
-
(2004)
Bioinformatics
, vol.20
, pp. 467-476
-
-
Leslie, C.1
Eskin, E.2
Cohen, A.3
Weston, J.4
Noble, W.5
-
35
-
-
0017309766
-
Structural patterns in globular proteins
-
Levitt, M., & Chothia, C. (1976). Structural patterns in globular proteins. Nature, 261, 552-558.
-
(1976)
Nature
, vol.261
, pp. 552-558
-
-
Levitt, M.1
Chothia, C.2
-
36
-
-
0037967496
-
An analysis of the burrows-wheeler transform
-
Manzini, G. (2001). An analysis of the burrows-wheeler transform. Journal of the ACM, 48(3), 407-430.
-
(2001)
Journal of the ACM
, vol.48
, Issue.3
, pp. 407-430
-
-
Manzini, G.1
-
37
-
-
3142657660
-
Concentration inequalities for the missing mass and for histogram rule error
-
McAllester, D., &: Ortiz, L. (2003). Concentration inequalities for the missing mass and for histogram rule error. Journal of Machine Learning Research, 4, 895-911.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 895-911
-
-
McAllester, D.1
Ortiz, L.2
-
38
-
-
0000747663
-
Maximum entropy Markov models for information extraction and segmentation
-
Morgan Kaufmann, San Francisco, CA
-
McCallum, A., Freitag, D., & Pereira, F. (2000). Maximum entropy Markov models for information extraction and segmentation. In Proc. 17th International Conf. on Machine Learning, pp. 591-598. Morgan Kaufmann, San Francisco, CA.
-
(2000)
Proc. 17th International Conf. on Machine Learning
, pp. 591-598
-
-
McCallum, A.1
Freitag, D.2
Pereira, F.3
-
39
-
-
0035109761
-
What are the baselines for protein fold recognition?
-
McGuffin, L., Bryson, K., & Jones, D. (2001). What are the baselines for protein fold recognition?. Bioinformatics, 17, 63-72.
-
(2001)
Bioinformatics
, vol.17
, pp. 63-72
-
-
McGuffin, L.1
Bryson, K.2
Jones, D.3
-
40
-
-
0029304928
-
A strong version of the redundancy-capacity theorem of universal coding
-
Merhav, N., & Feder, M. (1995). A strong version of the redundancy-capacity theorem of universal coding. IEEE Transactions on Information Theory, 41(3), 714-722.
-
(1995)
IEEE Transactions on Information Theory
, vol.41
, Issue.3
, pp. 714-722
-
-
Merhav, N.1
Feder, M.2
-
41
-
-
0027635149
-
On loss functions which minimize to conditional expected values and posterior probabilities
-
Miller, J., Goodman, R., & Smyth, P. (1993). On loss functions which minimize to conditional expected values and posterior probabilities. IEEE Transactions on Information Theory, 39(4:), 1404-1408.
-
(1993)
IEEE Transactions on Information Theory
, vol.39
, Issue.4
, pp. 1404-1408
-
-
Miller, J.1
Goodman, R.2
Smyth, P.3
-
42
-
-
0025516650
-
Implementing the PPM data compression scheme
-
Moffat, A. (1990). Implementing the PPM data compression scheme. IEEE Transactions on Communications, 38(11), 1917-1921.
-
(1990)
IEEE Transactions on Communications
, vol.38
, Issue.11
, pp. 1917-1921
-
-
Moffat, A.1
-
43
-
-
0028961335
-
SCOP: A structural classification of proteins database for the investigation of sequences and structures
-
Murzin, A., Brenner, S., Hubbard, T., & Chothia, C. (1995). SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology, 247, 536-540.
-
(1995)
Journal of Molecular Biology
, vol.247
, pp. 536-540
-
-
Murzin, A.1
Brenner, S.2
Hubbard, T.3
Chothia, C.4
-
46
-
-
9444225532
-
Towards behaviometric security systems: Learning to identify a typist
-
Nisenson, M., Yariv, I., El-Yaniv, R., & Meir, R. (2003). Towards behaviometric security systems: Learning to identify a typist. In The 7th European Conference on Principles and Practice of Knowledge Discovery in Databases.
-
(2003)
The 7th European Conference on Principles and Practice of Knowledge Discovery in Databases
-
-
Nisenson, M.1
Yariv, I.2
El-Yaniv, R.3
Meir, R.4
-
47
-
-
0142084741
-
Always good turing: Asymptotically optimal probability estimation
-
Orlitsky, A., Santhanam, N., & Zhang, J. (2003). Always Good Turing: Asymptotically optimal probability estimation. Science, 302(5644), 427-431.
-
(2003)
Science
, vol.302
, Issue.5644
, pp. 427-431
-
-
Orlitsky, A.1
Santhanam, N.2
Zhang, J.3
-
48
-
-
0036649406
-
Playing with virtual musicians: The continuator in practice
-
Pachet, F. (2002). Playing with virtual musicians: The continuator in practice. IEEE MultiMedia, 9(3), 77-82.
-
(2002)
IEEE MultiMedia
, vol.9
, Issue.3
, pp. 77-82
-
-
Pachet, F.1
-
49
-
-
0032509105
-
Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods
-
Park, J., Karplus, K., Barrett, C., Hughey, R., Haussler, D., Hubbard, T., & Chothia, C. (1998). Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. Journal of Molecular Biology, 284, 1201-1210.
-
(1998)
Journal of Molecular Biology
, vol.284
, pp. 1201-1210
-
-
Park, J.1
Karplus, K.2
Barrett, C.3
Hughey, R.4
Haussler, D.5
Hubbard, T.6
Chothia, C.7
-
50
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Rabiner, L. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(3), 257-286.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.3
, pp. 257-286
-
-
Rabiner, L.1
-
53
-
-
0021466584
-
Universal coding, information, prediction, and estimation
-
Rissanen, J. (1984). Universal coding, information, prediction, and estimation. IEEE Transactions on Information Theory, 30(4), 629-636.
-
(1984)
IEEE Transactions on Information Theory
, vol.30
, Issue.4
, pp. 629-636
-
-
Rissanen, J.1
-
54
-
-
0018442815
-
Arithmetic coding
-
Rissanen, J., & Langdon, G. (1979). Arithmetic coding. IBM Journal of Research and Development, 23(2), 149-162.
-
(1979)
IBM Journal of Research and Development
, vol.23
, Issue.2
, pp. 149-162
-
-
Rissanen, J.1
Langdon, G.2
-
55
-
-
0030282113
-
The power of amnesia: Learning probabilistic automata with variable memory length
-
Ron, D., Singer, Y., & Tishby, N. (1996). The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning, 25(2-3), 117-149.
-
(1996)
Machine Learning
, vol.25
, Issue.2-3
, pp. 117-149
-
-
Ron, D.1
Singer, Y.2
Tishby, N.3
-
56
-
-
0018782641
-
Hierarchic organization of domains in globular proteins
-
Rose, G. (1979). Hierarchic organization of domains in globular proteins. Journal of Molecular Biology, 134, 447-470.
-
(1979)
Journal of Molecular Biology
, vol.134
, pp. 447-470
-
-
Rose, G.1
-
57
-
-
0033901221
-
Implementing the context tree weighting method for text compression
-
Sadakane, K., Okazaki, T., & Imai, H. (2000). Implementing the context tree weighting method for text compression. In Data Compression Conference, pp. 123-132.
-
(2000)
Data Compression Conference
, pp. 123-132
-
-
Sadakane, K.1
Okazaki, T.2
Imai, H.3
-
58
-
-
0030736448
-
Redundancy of the Lempel-Ziv incremental parsing rule
-
Savari, S. (1997). Redundancy of the Lempel-Ziv incremental parsing rule. IEEE Transactions on Information Theory, 43, 9-21.
-
(1997)
IEEE Transactions on Information Theory
, vol.43
, pp. 9-21
-
-
Savari, S.1
-
60
-
-
84948416199
-
PPM: One step to practicality
-
Shkarin, D. (2002). PPM: One step to practicality. In Data Compression Conference, pp. 202-212.
-
(2002)
Data Compression Conference
, pp. 202-212
-
-
Shkarin, D.1
-
61
-
-
0037753853
-
Adaptive mixtures of probabilistic transducers
-
Singer, Y. (1997). Adaptive mixtures of probabilistic transducers. Neural Computation, 9(8), 1711-1733.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1711-1733
-
-
Singer, Y.1
-
62
-
-
0028671942
-
Dynamical encoding of cursive handwriting
-
Singer, Y., & Tishby, N. (1994). Dynamical encoding of cursive handwriting. Biological Cybernetics, 71(3), 227-237.
-
(1994)
Biological Cybernetics
, vol.71
, Issue.3
, pp. 227-237
-
-
Singer, Y.1
Tishby, N.2
-
63
-
-
0030925920
-
Pfam: A comprehensive database of protein domain families based on seed alignments
-
Sonnhammer, E., Eddy, S., & Durbin, R. (1997). Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins, 28, 405-420.
-
(1997)
Proteins
, vol.28
, pp. 405-420
-
-
Sonnhammer, E.1
Eddy, S.2
Durbin, R.3
-
64
-
-
0030660581
-
A genomic perspective on protein families
-
Tatusov, R., Eugene, V., & David, J. (1997). A genomic perspective on protein families. Science, 278, 631-637.
-
(1997)
Science
, vol.278
, pp. 631-637
-
-
Tatusov, R.1
Eugene, V.2
David, J.3
-
65
-
-
1242280857
-
Using compression based language models for text categorization
-
Teahan, W., &. Harper, D. (2001). Using compression based language models for text categorization. In Workshop on Language Modeling and Information Retrieval, ARDA, Carnegie Mellon University, pp. 83-88.
-
(2001)
Workshop on Language Modeling and Information Retrieval, ARDA, Carnegie Mellon University
, pp. 83-88
-
-
Teahan, W.1
Harper, D.2
-
67
-
-
0030699597
-
A context-tree weighting method for text generating sources
-
Tjalkens, T., Volf, P., & Willems, F. (1997). A context-tree weighting method for text generating sources. In Data Compression Conference, p. 472.
-
(1997)
Data Compression Conference
, pp. 472
-
-
Tjalkens, T.1
Volf, P.2
Willems, F.3
-
70
-
-
0032022518
-
The context-tree weighting method: Extensions
-
Willems, F. (1998). The context-tree weighting method: Extensions. IEEE Transactions on Information Theory, 44(2), 792-798.
-
(1998)
IEEE Transactions on Information Theory
, vol.44
, Issue.2
, pp. 792-798
-
-
Willems, F.1
-
71
-
-
0029307102
-
The context-tree weighting method: Basic properties
-
Willems, F., Shtarkov, Y., & Tjalkens, T. (1995). The context-tree weighting method: Basic properties. IEEE Transactions on Information Theory, 653-664.
-
(1995)
IEEE Transactions on Information Theory
, pp. 653-664
-
-
Willems, F.1
Shtarkov, Y.2
Tjalkens, T.3
-
72
-
-
0026187945
-
The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression
-
Witten, I., & Bell, T. (1991). The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transactions on Information Theory, 37(4), 1085-1094.
-
(1991)
IEEE Transactions on Information Theory
, vol.37
, Issue.4
, pp. 1085-1094
-
-
Witten, I.1
Bell, T.2
-
73
-
-
0032650194
-
Text mining: A new frontier for lossless compression
-
IEEE Computer Society
-
Witten, I., Bray, Z., Mahoui, M., & Teahan, B. (1999). Text mining: A new frontier for lossless compression. In Proceedings of the Conference on Data Compression, pp. 198-207. IEEE Computer Society.
-
(1999)
Proceedings of the Conference on Data Compression
, pp. 198-207
-
-
Witten, I.1
Bray, Z.2
Mahoui, M.3
Teahan, B.4
-
74
-
-
0036307493
-
Within the twilight zone: A sensitive profile-profile comparison tool based on information theory
-
Yona, G., & Levitt, M. (2002). Within the twilight zone: A sensitive profile-profile comparison tool based on information theory. Journal of Molecular Biology, 315, 1257-1275.
-
(2002)
Journal of Molecular Biology
, vol.315
, pp. 1257-1275
-
-
Yona, G.1
Levitt, M.2
-
75
-
-
0032726692
-
Protomap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space
-
Yona, G., Linial, N., & Linial, M. (1999). Protomap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins, 37, 360-378.
-
(1999)
Proteins
, vol.37
, pp. 360-378
-
-
Yona, G.1
Linial, N.2
Linial, M.3
-
76
-
-
0018019231
-
Compression of individual sequences via variable-rate coding
-
Ziv, J., & Lempel, A. (1978). Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory, 24, 530-536.
-
(1978)
IEEE Transactions on Information Theory
, vol.24
, pp. 530-536
-
-
Ziv, J.1
Lempel, A.2
|