메뉴 건너뛰기




Volumn 22, Issue , 2004, Pages 385-421

On prediction using variable order Markov models

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CLASSIFICATION (OF INFORMATION); DATA COMPRESSION; PROBLEM SOLVING; PROTEINS; TREES (MATHEMATICS);

EID: 27344458404     PISSN: 10769757     EISSN: 10769757     Source Type: Journal    
DOI: 10.1613/jair.1491     Document Type: Article
Times cited : (364)

References (76)
  • 1
    • 0001841122 scopus 로고
    • On the computational complexity of approximating probability distributions by probabilistic automata
    • Abe, N., & Warmuth, M. (1992). On the computational complexity of approximating probability distributions by probabilistic automata. Machine Learning, 9, 205-260.
    • (1992) Machine Learning , vol.9 , pp. 205-260
    • Abe, N.1    Warmuth, M.2
  • 2
    • 0030642529 scopus 로고    scopus 로고
    • Text compression by context tree weighting
    • IEEE Computer Society Press
    • Aberg, J., & Shtarkov, Y. (1997). Text compression by context tree weighting. In Proceedings Data Compression Conference (DCC), pp. 377-386. IEEE Computer Society Press.
    • (1997) Proceedings Data Compression Conference (DCC) , pp. 377-386
    • Aberg, J.1    Shtarkov, Y.2
  • 3
    • 0031043056 scopus 로고    scopus 로고
    • A new method for analyzing protein sequence relationships based on sammon maps
    • Agrafiotis, D. (1997). A new method for analyzing protein sequence relationships based on sammon maps. Protein Science, 6, 287-293.
    • (1997) Protein Science , vol.6 , pp. 287-293
    • Agrafiotis, D.1
  • 4
    • 0030671788 scopus 로고    scopus 로고
    • A corpus for the evaluation of lossless compression algorithms
    • Arnold, R., & Bell, T. (1997). A corpus for the evaluation of lossless compression algorithms. In Designs, Codes and Cryptography, pp. 201-210.
    • (1997) Designs, Codes and Cryptography , pp. 201-210
    • Arnold, R.1    Bell, T.2
  • 6
    • 0035109647 scopus 로고    scopus 로고
    • Variations on probabilistic suffix trees: Statistical modeling and the prediction of protein families
    • Bejerano, G., & Yona, G. (2001). Variations on probabilistic suffix trees: Statistical modeling and the prediction of protein families.. Bioinformatics, 17(1), 23-43.
    • (2001) Bioinformatics , vol.17 , Issue.1 , pp. 23-43
    • Bejerano, G.1    Yona, G.2
  • 9
    • 0008275693 scopus 로고    scopus 로고
    • Semantically motivated improvements for PPM variants
    • Bunton, S. (1997). Semantically motivated improvements for PPM variants. The Computer Journal, 40(2/3), 76-92.
    • (1997) The Computer Journal , vol.40 , Issue.2-3 , pp. 76-92
    • Bunton, S.1
  • 10
    • 0003573193 scopus 로고
    • A block-sorting lossless data compression algorithm
    • Digital Equipement Corporation
    • Burrows, M., & Wheeler, D. J. (1994). A block-sorting lossless data compression algorithm.. Tech. rep. 124, Digital Equipement Corporation.
    • (1994) Tech. Rep. , vol.124
    • Burrows, M.1    Wheeler, D.J.2
  • 11
    • 27344448874 scopus 로고    scopus 로고
    • Protein structure prediction center
    • CASP (2002). Protein structure prediction center, http:// predictioncenter.llnl.gov/.
    • (2002)
  • 12
    • 0035008559 scopus 로고    scopus 로고
    • Compressing XML with multiplexed hierarchical PPM models
    • Cheney, J. (2001). Compressing XML with multiplexed hierarchical PPM models. In Data Compression Conference, pp. 163-172.
    • (2001) Data Compression Conference , pp. 163-172
    • Cheney, J.1
  • 13
    • 0038083621 scopus 로고    scopus 로고
    • Predicting daily behavior via wearable sensors
    • MIT
    • Clarkson, B., & Pentland, A. (2001). Predicting daily behavior via wearable sensors. Tech. rep. Vismod TR#540, MIT.
    • (2001) Tech. Rep. Vismod TR#540 , vol.540
    • Clarkson, B.1    Pentland, A.2
  • 14
    • 0003081845 scopus 로고    scopus 로고
    • Unbounded length contexts for PPM
    • Cleary, J., & Teahan, W. (1997). Unbounded length contexts for PPM. Computer Journal, 40, 67-75.
    • (1997) Computer Journal , vol.40 , pp. 67-75
    • Cleary, J.1    Teahan, W.2
  • 15
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary, J., & Witten, I. (1984). Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications, COM-32(4), 396-402.
    • (1984) IEEE Transactions on Communications , vol.COM-32 , Issue.4 , pp. 396-402
    • Cleary, J.1    Witten, I.2
  • 16
    • 0017994420 scopus 로고
    • A convergent gambling estimate of the entropy of English
    • Cover, T., & King, R. (1978). A convergent gambling estimate of the entropy of English. IEEE Transactions on Information Theory, 24(4), 413-421.
    • (1978) IEEE Transactions on Information Theory , vol.24 , Issue.4 , pp. 413-421
    • Cover, T.1    King, R.2
  • 18
    • 0016990971 scopus 로고
    • The origin and evolution of protein superfamilies
    • Dayhoff, M. (1976). The origin and evolution of protein superfamilies. In Federation Proceedings, pp. 2132-2138.
    • (1976) Federation Proceedings , pp. 2132-2138
    • Dayhoff, M.1
  • 20
    • 0035014847 scopus 로고    scopus 로고
    • Multi-class protein fold recognition using support vector machines and neural networks
    • Ding, C., & Dubchak, I. (2001). Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics, 17, 349-358.
    • (2001) Bioinformatics , vol.17 , pp. 349-358
    • Ding, C.1    Dubchak, I.2
  • 22
    • 0142135034 scopus 로고    scopus 로고
    • Using machine-learning methods for musical style modeling
    • Dubnov, S., Assayag, G., Lartillot, O., & Bejerano, G. (2003). Using machine-learning methods for musical style modeling. IEEE Computer, 36(10), 73-80.
    • (2003) IEEE Computer , vol.36 , Issue.10 , pp. 73-80
    • Dubnov, S.1    Assayag, G.2    Lartillot, O.3    Bejerano, G.4
  • 23
    • 0028273691 scopus 로고
    • Relations between entropy and error probability
    • Feder, M., & Merhav, N. (1994). Relations between entropy and error probability. IEEE Transactions on Information Theory, 40(1), 259-266.
    • (1994) IEEE Transactions on Information Theory , vol.40 , Issue.1 , pp. 259-266
    • Feder, M.1    Merhav, N.2
  • 24
    • 0028290005 scopus 로고
    • Parser for protein folding units
    • Holm, L., & Sander, C. (1994). Parser for protein folding units. Proteins, 19, 256-268.
    • (1994) Proteins , vol.19 , pp. 256-268
    • Holm, L.1    Sander, C.2
  • 27
    • 0043231338 scopus 로고    scopus 로고
    • Prediction of protein structural classes by a new measure of information discrepancy
    • Jin, L., Fang, W., &: Tang, H. (2003). Prediction of protein structural classes by a new measure of information discrepancy. Computational Biology and Chemistry, 27, 373-380.
    • (2003) Computational Biology and Chemistry , vol.27 , pp. 373-380
    • Jin, L.1    Fang, W.2    Tang, H.3
  • 29
    • 84945308250 scopus 로고    scopus 로고
    • Improved smoothing for probabilistic suffix trees seen as variable order Markov chains
    • Kermorvant, C., & Dupont, P. (2002). Improved smoothing for probabilistic suffix trees seen as variable order Markov chains. In European Conference on Machine Learning (ECML), pp. 185-194.
    • (2002) European Conference on Machine Learning (ECML) , pp. 185-194
    • Kermorvant, C.1    Dupont, P.2
  • 31
    • 0034672751 scopus 로고    scopus 로고
    • Structural class prediction: An application of residue distribution along the sequence
    • Kumarevel, T., Gromiha, M., & Ponnuswamy, M. (2000). Structural class prediction: An application of residue distribution along the sequence. Biophys Chem., 88, 81-101.
    • (2000) Biophys Chem. , vol.88 , pp. 81-101
    • Kumarevel, T.1    Gromiha, M.2    Ponnuswamy, M.3
  • 32
    • 0020718764 scopus 로고
    • A note on the Ziv-Lempel model for compressing individual sequences
    • Langdon, G. (1983). A note on the Ziv-Lempel model for compressing individual sequences. IEEE Transactions on Information Theory, 29, 284- 287.
    • (1983) IEEE Transactions on Information Theory , vol.29 , pp. 284-287
    • Langdon, G.1
  • 34
    • 1542714925 scopus 로고    scopus 로고
    • Mismatch string kernels for discriminative protein classification
    • Leslie, C., Eskin, E., Cohen, A., Weston, J., & Noble, W. (2004). Mismatch string kernels for discriminative protein classification. Bioinformatics, 20, 467-476.
    • (2004) Bioinformatics , vol.20 , pp. 467-476
    • Leslie, C.1    Eskin, E.2    Cohen, A.3    Weston, J.4    Noble, W.5
  • 35
    • 0017309766 scopus 로고
    • Structural patterns in globular proteins
    • Levitt, M., & Chothia, C. (1976). Structural patterns in globular proteins. Nature, 261, 552-558.
    • (1976) Nature , vol.261 , pp. 552-558
    • Levitt, M.1    Chothia, C.2
  • 36
    • 0037967496 scopus 로고    scopus 로고
    • An analysis of the burrows-wheeler transform
    • Manzini, G. (2001). An analysis of the burrows-wheeler transform. Journal of the ACM, 48(3), 407-430.
    • (2001) Journal of the ACM , vol.48 , Issue.3 , pp. 407-430
    • Manzini, G.1
  • 37
    • 3142657660 scopus 로고    scopus 로고
    • Concentration inequalities for the missing mass and for histogram rule error
    • McAllester, D., &: Ortiz, L. (2003). Concentration inequalities for the missing mass and for histogram rule error. Journal of Machine Learning Research, 4, 895-911.
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 895-911
    • McAllester, D.1    Ortiz, L.2
  • 38
    • 0000747663 scopus 로고    scopus 로고
    • Maximum entropy Markov models for information extraction and segmentation
    • Morgan Kaufmann, San Francisco, CA
    • McCallum, A., Freitag, D., & Pereira, F. (2000). Maximum entropy Markov models for information extraction and segmentation. In Proc. 17th International Conf. on Machine Learning, pp. 591-598. Morgan Kaufmann, San Francisco, CA.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 591-598
    • McCallum, A.1    Freitag, D.2    Pereira, F.3
  • 39
    • 0035109761 scopus 로고    scopus 로고
    • What are the baselines for protein fold recognition?
    • McGuffin, L., Bryson, K., & Jones, D. (2001). What are the baselines for protein fold recognition?. Bioinformatics, 17, 63-72.
    • (2001) Bioinformatics , vol.17 , pp. 63-72
    • McGuffin, L.1    Bryson, K.2    Jones, D.3
  • 40
    • 0029304928 scopus 로고
    • A strong version of the redundancy-capacity theorem of universal coding
    • Merhav, N., & Feder, M. (1995). A strong version of the redundancy-capacity theorem of universal coding. IEEE Transactions on Information Theory, 41(3), 714-722.
    • (1995) IEEE Transactions on Information Theory , vol.41 , Issue.3 , pp. 714-722
    • Merhav, N.1    Feder, M.2
  • 41
    • 0027635149 scopus 로고
    • On loss functions which minimize to conditional expected values and posterior probabilities
    • Miller, J., Goodman, R., & Smyth, P. (1993). On loss functions which minimize to conditional expected values and posterior probabilities. IEEE Transactions on Information Theory, 39(4:), 1404-1408.
    • (1993) IEEE Transactions on Information Theory , vol.39 , Issue.4 , pp. 1404-1408
    • Miller, J.1    Goodman, R.2    Smyth, P.3
  • 42
    • 0025516650 scopus 로고
    • Implementing the PPM data compression scheme
    • Moffat, A. (1990). Implementing the PPM data compression scheme. IEEE Transactions on Communications, 38(11), 1917-1921.
    • (1990) IEEE Transactions on Communications , vol.38 , Issue.11 , pp. 1917-1921
    • Moffat, A.1
  • 43
    • 0028961335 scopus 로고
    • SCOP: A structural classification of proteins database for the investigation of sequences and structures
    • Murzin, A., Brenner, S., Hubbard, T., & Chothia, C. (1995). SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology, 247, 536-540.
    • (1995) Journal of Molecular Biology , vol.247 , pp. 536-540
    • Murzin, A.1    Brenner, S.2    Hubbard, T.3    Chothia, C.4
  • 47
    • 0142084741 scopus 로고    scopus 로고
    • Always good turing: Asymptotically optimal probability estimation
    • Orlitsky, A., Santhanam, N., & Zhang, J. (2003). Always Good Turing: Asymptotically optimal probability estimation. Science, 302(5644), 427-431.
    • (2003) Science , vol.302 , Issue.5644 , pp. 427-431
    • Orlitsky, A.1    Santhanam, N.2    Zhang, J.3
  • 48
    • 0036649406 scopus 로고    scopus 로고
    • Playing with virtual musicians: The continuator in practice
    • Pachet, F. (2002). Playing with virtual musicians: The continuator in practice. IEEE MultiMedia, 9(3), 77-82.
    • (2002) IEEE MultiMedia , vol.9 , Issue.3 , pp. 77-82
    • Pachet, F.1
  • 49
    • 0032509105 scopus 로고    scopus 로고
    • Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods
    • Park, J., Karplus, K., Barrett, C., Hughey, R., Haussler, D., Hubbard, T., & Chothia, C. (1998). Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. Journal of Molecular Biology, 284, 1201-1210.
    • (1998) Journal of Molecular Biology , vol.284 , pp. 1201-1210
    • Park, J.1    Karplus, K.2    Barrett, C.3    Hughey, R.4    Haussler, D.5    Hubbard, T.6    Chothia, C.7
  • 50
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, L. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(3), 257-286.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.3 , pp. 257-286
    • Rabiner, L.1
  • 53
    • 0021466584 scopus 로고
    • Universal coding, information, prediction, and estimation
    • Rissanen, J. (1984). Universal coding, information, prediction, and estimation. IEEE Transactions on Information Theory, 30(4), 629-636.
    • (1984) IEEE Transactions on Information Theory , vol.30 , Issue.4 , pp. 629-636
    • Rissanen, J.1
  • 55
    • 0030282113 scopus 로고    scopus 로고
    • The power of amnesia: Learning probabilistic automata with variable memory length
    • Ron, D., Singer, Y., & Tishby, N. (1996). The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning, 25(2-3), 117-149.
    • (1996) Machine Learning , vol.25 , Issue.2-3 , pp. 117-149
    • Ron, D.1    Singer, Y.2    Tishby, N.3
  • 56
    • 0018782641 scopus 로고
    • Hierarchic organization of domains in globular proteins
    • Rose, G. (1979). Hierarchic organization of domains in globular proteins. Journal of Molecular Biology, 134, 447-470.
    • (1979) Journal of Molecular Biology , vol.134 , pp. 447-470
    • Rose, G.1
  • 57
    • 0033901221 scopus 로고    scopus 로고
    • Implementing the context tree weighting method for text compression
    • Sadakane, K., Okazaki, T., & Imai, H. (2000). Implementing the context tree weighting method for text compression. In Data Compression Conference, pp. 123-132.
    • (2000) Data Compression Conference , pp. 123-132
    • Sadakane, K.1    Okazaki, T.2    Imai, H.3
  • 58
    • 0030736448 scopus 로고    scopus 로고
    • Redundancy of the Lempel-Ziv incremental parsing rule
    • Savari, S. (1997). Redundancy of the Lempel-Ziv incremental parsing rule. IEEE Transactions on Information Theory, 43, 9-21.
    • (1997) IEEE Transactions on Information Theory , vol.43 , pp. 9-21
    • Savari, S.1
  • 60
    • 84948416199 scopus 로고    scopus 로고
    • PPM: One step to practicality
    • Shkarin, D. (2002). PPM: One step to practicality. In Data Compression Conference, pp. 202-212.
    • (2002) Data Compression Conference , pp. 202-212
    • Shkarin, D.1
  • 61
    • 0037753853 scopus 로고    scopus 로고
    • Adaptive mixtures of probabilistic transducers
    • Singer, Y. (1997). Adaptive mixtures of probabilistic transducers. Neural Computation, 9(8), 1711-1733.
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1711-1733
    • Singer, Y.1
  • 62
    • 0028671942 scopus 로고
    • Dynamical encoding of cursive handwriting
    • Singer, Y., & Tishby, N. (1994). Dynamical encoding of cursive handwriting. Biological Cybernetics, 71(3), 227-237.
    • (1994) Biological Cybernetics , vol.71 , Issue.3 , pp. 227-237
    • Singer, Y.1    Tishby, N.2
  • 63
    • 0030925920 scopus 로고    scopus 로고
    • Pfam: A comprehensive database of protein domain families based on seed alignments
    • Sonnhammer, E., Eddy, S., & Durbin, R. (1997). Pfam: A comprehensive database of protein domain families based on seed alignments. Proteins, 28, 405-420.
    • (1997) Proteins , vol.28 , pp. 405-420
    • Sonnhammer, E.1    Eddy, S.2    Durbin, R.3
  • 64
    • 0030660581 scopus 로고    scopus 로고
    • A genomic perspective on protein families
    • Tatusov, R., Eugene, V., & David, J. (1997). A genomic perspective on protein families. Science, 278, 631-637.
    • (1997) Science , vol.278 , pp. 631-637
    • Tatusov, R.1    Eugene, V.2    David, J.3
  • 67
    • 0030699597 scopus 로고    scopus 로고
    • A context-tree weighting method for text generating sources
    • Tjalkens, T., Volf, P., & Willems, F. (1997). A context-tree weighting method for text generating sources. In Data Compression Conference, p. 472.
    • (1997) Data Compression Conference , pp. 472
    • Tjalkens, T.1    Volf, P.2    Willems, F.3
  • 70
    • 0032022518 scopus 로고    scopus 로고
    • The context-tree weighting method: Extensions
    • Willems, F. (1998). The context-tree weighting method: Extensions. IEEE Transactions on Information Theory, 44(2), 792-798.
    • (1998) IEEE Transactions on Information Theory , vol.44 , Issue.2 , pp. 792-798
    • Willems, F.1
  • 72
    • 0026187945 scopus 로고
    • The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression
    • Witten, I., & Bell, T. (1991). The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression. IEEE Transactions on Information Theory, 37(4), 1085-1094.
    • (1991) IEEE Transactions on Information Theory , vol.37 , Issue.4 , pp. 1085-1094
    • Witten, I.1    Bell, T.2
  • 74
    • 0036307493 scopus 로고    scopus 로고
    • Within the twilight zone: A sensitive profile-profile comparison tool based on information theory
    • Yona, G., & Levitt, M. (2002). Within the twilight zone: A sensitive profile-profile comparison tool based on information theory. Journal of Molecular Biology, 315, 1257-1275.
    • (2002) Journal of Molecular Biology , vol.315 , pp. 1257-1275
    • Yona, G.1    Levitt, M.2
  • 75
    • 0032726692 scopus 로고    scopus 로고
    • Protomap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space
    • Yona, G., Linial, N., & Linial, M. (1999). Protomap: Automatic classification of protein sequences, a hierarchy of protein families, and local maps of the protein space. Proteins, 37, 360-378.
    • (1999) Proteins , vol.37 , pp. 360-378
    • Yona, G.1    Linial, N.2    Linial, M.3
  • 76
    • 0018019231 scopus 로고
    • Compression of individual sequences via variable-rate coding
    • Ziv, J., & Lempel, A. (1978). Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory, 24, 530-536.
    • (1978) IEEE Transactions on Information Theory , vol.24 , pp. 530-536
    • Ziv, J.1    Lempel, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.