메뉴 건너뛰기




Volumn 44, Issue 3, 1998, Pages 1319-1327

Nonparametric entropy estimation for stationary processesand random fields, with applications to english text

Author keywords

Entropy of english; Entropy rate; Pattern matching; Universal data compression

Indexed keywords

ALGORITHMS; DATA COMPRESSION; ENCODING (SYMBOLS); ESTIMATION; MATHEMATICAL TRANSFORMATIONS; PATTERN RECOGNITION; PROBABILITY; RANDOM PROCESSES;

EID: 0032073870     PISSN: 00189448     EISSN: None     Source Type: Journal    
DOI: 10.1109/18.669425     Document Type: Article
Times cited : (211)

References (33)
  • 1
    • 0028428283 scopus 로고
    • The strong law of large numbers for sequential decisions under uncertainty
    • May
    • P. H. Algoet, "The strong law of large numbers for sequential decisions under uncertainty," IEEE Trans. Inform. Theory, vol. 40, pp. 609-634, May 1994.
    • (1994) IEEE Trans. Inform. Theory , vol.40 , pp. 609-634
    • Algoet, P.H.1
  • 2
    • 0000039074 scopus 로고
    • The strong ergodic theorem for densities: Generalized Shannon-McMillan-Breiman theorem
    • Nov.
    • A. R. Barron, "The strong ergodic theorem for densities: Generalized Shannon-McMillan-Breiman theorem," Ann. Probab., vol. 13, pp. 1292-1303, Nov. 1985.
    • (1985) Ann. Probab. , vol.13 , pp. 1292-1303
    • Barron, A.R.1
  • 3
    • 0001531440 scopus 로고
    • The individual ergodic theorem of information theory
    • Correction: ibid., vol. 31, pp. 809-810, 1960
    • L. Breiman, "The individual ergodic theorem of information theory," Ann. Math. Statist., vol. 28, pp. 809-811, 1957. Correction: ibid., vol. 31, pp. 809-810, 1960.
    • (1957) Ann. Math. Statist. , vol.28 , pp. 809-811
    • Breiman, L.1
  • 4
    • 0027873987 scopus 로고
    • Using difficulty of prediction to decrease computation: Fast sort, priority queue and convex hull on entropy bounded inputs
    • Los Alamitos, CA: IEEE Computer Soc. Press
    • S. Chen and J. H. Reif, "Using difficulty of prediction to decrease computation: Fast sort, priority queue and convex hull on entropy bounded inputs," in Proc. 34th Symp. Foundations of Computer Science. Los Alamitos, CA: IEEE Computer Soc. Press, 1993, pp. 104-112.
    • (1993) Proc. 34th Symp. Foundations of Computer Science , pp. 104-112
    • Chen, S.1    Reif, J.H.2
  • 5
    • 0029228575 scopus 로고
    • Fast pattern matching for entropy bounded text
    • (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press
    • _, "Fast pattern matching for entropy bounded text," in Proc. DCC'95 Data Compression Conf. (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press, 1995, pp. 282-291.
    • (1995) Proc. DCC'95 Data Compression Conf. , pp. 282-291
  • 7
    • 0013296475 scopus 로고    scopus 로고
    • Sur le propriétés asymptotiques de mouvement régis par certains types de chaines simples
    • W. Doeblin, "Sur le propriétés asymptotiques de mouvement régis par certains types de chaines simples," Bull. Math. Soc. Raum. Sci., vol. 39, no. 1 pp. 57-115,
    • Bull. Math. Soc. Raum. Sci. , vol.39 , Issue.1 , pp. 57-115
    • Doeblin, W.1
  • 8
    • 33747671280 scopus 로고
    • and Bull. Math. Soc. Raum. Sci., vol. 39, no. 2 pp. 3-61, 1937.
    • (1937) Bull. Math. Soc. Raum. Sci. , vol.39 , Issue.2 , pp. 3-61
  • 9
    • 84994364597 scopus 로고
    • On the entropy of DNA: Algorithms and measurements based on memory and rapid convergence
    • Philadelphia, PA: Soc. Industr. Appl. Math.
    • M. Farach, M. Noordewier, S. Savari, L. Shepp, A. J. Wyner, and J. Ziv, "On the entropy of DNA: Algorithms and measurements based on memory and rapid convergence," in Proc. ACM-SIAM Symp. Discrete Algorithms. Philadelphia, PA: Soc. Industr. Appl. Math., 1995, pp. 48-57.
    • (1995) Proc. ACM-SIAM Symp. Discrete Algorithms , pp. 48-57
    • Farach, M.1    Noordewier, M.2    Savari, S.3    Shepp, L.4    Wyner, A.J.5    Ziv, J.6
  • 10
    • 0024668120 scopus 로고
    • Estimating the information content of symbol sequences and efficient codes
    • May
    • P. Grassberger, "Estimating the information content of symbol sequences and efficient codes," IEEE Trans. Inform. Theory, vol. 35, pp. 669-675, May 1989.
    • (1989) IEEE Trans. Inform. Theory , vol.35 , pp. 669-675
    • Grassberger, P.1
  • 11
    • 33747700190 scopus 로고    scopus 로고
    • What can we do with small corpora: Document categorization via cross-entropy
    • to be published in SimCat'97
    • P. Juola, "What can we do with small corpora: Document categorization via cross-entropy," Comparative Linguistics, to be published in SimCat'97.
    • Comparative Linguistics
    • Juola, P.1
  • 12
    • 0039336710 scopus 로고
    • Prefixes and the entropy rate for long-range sources
    • F. P. Kelly, Ed. Chichester, U.K.: Wiley
    • I. Kontoyiannis and Yu. M. Suhov, "Prefixes and the entropy rate for long-range sources," in Probability Statistics and Optimization, F. P. Kelly, Ed. Chichester, U.K.: Wiley, 1994, pp. 89-98.
    • (1994) Probability Statistics and Optimization , pp. 89-98
    • Kontoyiannis, I.1    Suhov, Yu.M.2
  • 13
    • 33747682061 scopus 로고    scopus 로고
    • Stationary entropy estimation via string matching
    • Snowbird, UT, Apr.
    • _, "Stationary entropy estimation via string matching," in Proc. Data Compression Conf. DCC'96 (Snowbird, UT, Apr. 1996).
    • (1996) Proc. Data Compression Conf. DCC'96
  • 14
  • 15
    • 23944464159 scopus 로고
    • The ergodic theorem for a sequence of functions
    • P. T. Maker, "The ergodic theorem for a sequence of functions," Duke Math. J., vol. 6, pp. 27-30, 1940.
    • (1940) Duke Math. J. , vol.6 , pp. 27-30
    • Maker, P.T.1
  • 16
    • 0027266419 scopus 로고
    • Entropy and data compression schemes
    • Jan.
    • D. S. Ornstein and B. Weiss, "Entropy and data compression schemes," IEEE Trans. Inform. Theory, vol. 39, pp. 78-83, Jan. 1993.
    • (1993) IEEE Trans. Inform. Theory , vol.39 , pp. 78-83
    • Ornstein, D.S.1    Weiss, B.2
  • 18
    • 33747654331 scopus 로고
    • Trondheim, Norway, June 27-July 1
    • also presented at the IEEE Int. Symp. on Information Theory, Trondheim, Norway, June 27-July 1, 1994.
    • (1994) IEEE Int. Symp. on Information Theory
  • 21
    • 0000437595 scopus 로고
    • Asymptotic growth of a class of random trees
    • B. Pittel, "Asymptotic growth of a class of random trees," Ann. Probab., vol. 13, no. 2, pp. 414-427, 1985.
    • (1985) Ann. Probab. , vol.13 , Issue.2 , pp. 414-427
    • Pittel, B.1
  • 22
    • 0040176500 scopus 로고
    • An entropy estimator for a class of infinite alphabet processes
    • Statistical Lab., Univ. of Cambridge, Cambridge, U.K., May
    • A. N. Quas, "An entropy estimator for a class of infinite alphabet processes," Statistical Lab., Univ. of Cambridge, Cambridge, U.K., Tech. Rep. 95-3, May 1995.
    • (1995) Tech. Rep. 95-3
    • Quas, A.N.1
  • 23
    • 0027553276 scopus 로고
    • Universal redundancy rates do not exist
    • Mar.
    • P. C. Shields, "Universal redundancy rates do not exist," IEEE Trans. Inform. Theory, vol. 39, pp. 520-524, Mar. 1993.
    • (1993) IEEE Trans. Inform. Theory , vol.39 , pp. 520-524
    • Shields, P.C.1
  • 24
    • 0009364458 scopus 로고
    • Entropy and prefixes
    • P. C. Shields, "Entropy and prefixes," Ann. Probab., vol. 20, pp. 403-409, 1992.
    • (1992) Ann. Probab. , vol.20 , pp. 403-409
    • Shields, P.C.1
  • 25
    • 0029277594 scopus 로고
    • Universal redundancy rates for the class of B-processes do not exist
    • Mar.
    • P. C. Shields and B. Weiss, "Universal redundancy rates for the class of B-processes do not exist," IEEE Trans. Inform. Theory, vol. 41, pp. 508-512, Mar. 1995.
    • (1995) IEEE Trans. Inform. Theory , vol.41 , pp. 508-512
    • Shields, P.C.1    Weiss, B.2
  • 26
    • 0027666860 scopus 로고
    • Asymptotic properties of data compression and suffix trees
    • Sept.
    • W. Szpankowski, "Asymptotic properties of data compression and suffix trees," IEEE Trans. Inform. Theory, vol. 39, pp. 1647-1659, Sept. 1993.
    • (1993) IEEE Trans. Inform. Theory , vol.39 , pp. 1647-1659
    • Szpankowski, W.1
  • 27
    • 0030642782 scopus 로고    scopus 로고
    • Models of English text
    • (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press
    • W. J. Teahan and J. G. Cleary, "Models of English text," in Proc. DCC'97 Data Compression Conf. (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press, 1997, pp. 12-21.
    • (1997) Proc. DCC'97 Data Compression Conf. , pp. 12-21
    • Teahan, W.J.1    Cleary, J.G.2
  • 28
    • 0029716119 scopus 로고    scopus 로고
    • The entropy of English using PPM-based models
    • (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press
    • W. J. Teahan and J. G. Cleary, "The entropy of English using PPM-based models," in Proc. DCC'96 Data Compression Conf. (Snowbird, UT). Los Alamitos, CA: IEEE Computer Soc. Press, 1996, pp. 53-62.
    • (1996) Proc. DCC'96 Data Compression Conf. , pp. 53-62
    • Teahan, W.J.1    Cleary, J.G.2
  • 30
    • 0031234273 scopus 로고    scopus 로고
    • The redundancy and distribution of the phrase lengths of the fixed-database Lempel-Ziv algorithm
    • Sept.
    • _, "The redundancy and distribution of the phrase lengths of the fixed-database Lempel-Ziv algorithm," IEEE Trans. Inform. Theory, vol. 43, pp. 1452-1464, Sept. 1997.
    • (1997) IEEE Trans. Inform. Theory , vol.43 , pp. 1452-1464
  • 31
    • 0024768981 scopus 로고
    • Some asymptotic properties of entropy of a stationary ergodic data source with applications to data compression
    • Nov.
    • A. D. Wyner and J. Ziv, "Some asymptotic properties of entropy of a stationary ergodic data source with applications to data compression," IEEE Trans. Inform. Theory, vol. 35, pp. 1250-1258, Nov. 1989.
    • (1989) IEEE Trans. Inform. Theory , vol.35 , pp. 1250-1258
    • Wyner, A.D.1    Ziv, J.2
  • 32
    • 0017493286 scopus 로고
    • A universal algorithm for sequential data compression
    • May
    • J. Ziv and A. Lempel, "A universal algorithm for sequential data compression," IEEE Trans. Inform. Theory, vol. IT-23, pp. 337-343, May 1977.
    • (1977) IEEE Trans. Inform. Theory , vol.IT-23 , pp. 337-343
    • Ziv, J.1    Lempel, A.2
  • 33
    • 0018019231 scopus 로고
    • Compression of individual sequences via variable rate coding
    • Sept.
    • _, "Compression of individual sequences via variable rate coding," IEEE Trans. Inform. Theory, vol. IT-24, pp. 530-536, Sept. 1978.
    • (1978) IEEE Trans. Inform. Theory , vol.IT-24 , pp. 530-536


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.