메뉴 건너뛰기




Volumn 26, Issue 3, 2000, Pages 374-393

A compression-based algorithm for Chinese word segmentation

Author keywords

[No Author keywords available]

Indexed keywords

CHINESE LANGUAGE; CHINESE WORD SEGMENTATION; DELIMITERS; FULL-TEXT SEARCH; GENERAL METHOD; KEYPHRASE EXTRACTION; LANGUAGE MODEL; TEXT COMPRESSIONS;

EID: 0001277731     PISSN: 08912017     EISSN: None     Source Type: Journal    
DOI: 10.1162/089120100561746     Document Type: Article
Times cited : (105)

References (27)
  • 1
    • 0021372019 scopus 로고
    • Sequential coding algorithms: A survey and cost analysis
    • Anderson, John B. and Seshadri Mohan. 1984. Sequential coding algorithms: A survey and cost analysis. IEEE Transactions on Communications, 32(2):169-176.
    • (1984) IEEE Transactions on Communications , vol.32 , Issue.2 , pp. 169-176
    • Anderson, J.B.1    Mohan, S.2
  • 3
    • 84867919822 scopus 로고
    • Transformation-based error-driven learning and natural processing: A case study in part-of-speech tagging
    • Brill, Eric. 1995. Transformation-based error-driven learning and natural processing: A case study in part-of-speech tagging. Computational Linguistics, 21(4):543-565.
    • (1995) Computational Linguistics , vol.21 , Issue.4 , pp. 543-565
    • Brill, E.1
  • 5
    • 0039077668 scopus 로고    scopus 로고
    • Technical issues in building an information system for Chinese
    • Center for Intelligent Information Retrieval, University of Massachusetts, Amherst
    • Broglio, John, Jamie P. Callan, and W. Bruce Croft. 1996. Technical issues in building an information system for Chinese. CIIR Technical Report IR-86, Center for Intelligent Information Retrieval, University of Massachusetts, Amherst.
    • (1996) CIIR Technical Report IR-86
    • Broglio, J.1    Callan, J.P.2    Croft, W.B.3
  • 7
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary, John G. and Ian H. Witten. 1984. Data compression using adaptive coding and partial string matching. IEEE Transactions on Communications, 32(4):396-402.
    • (1984) IEEE Transactions on Communications , vol.32 , Issue.4 , pp. 396-402
    • Cleary, J.G.1    Witten, I.H.2
  • 9
    • 85029362060 scopus 로고    scopus 로고
    • A new statistical formula for Chinese text segmentation incorporating contextual information
    • Dai, Yubin, Christopher S. G. Khoo, and Teck Ee Loh. 1999. A new statistical formula for Chinese text segmentation incorporating contextual information. In Proceedings of ACM SIGIR99, pages 82-89.
    • (1999) Proceedings of ACM SIGIR99 , pp. 82-89
    • Dai, Y.1    Khoo, C.S.G.2    Loh, T.E.3
  • 10
    • 0001918328 scopus 로고
    • Stemming algorithms
    • William B. Frakes and Ricardo Baeza-Yates, editors. Prentice Hall, Englewood Cliffs, NJ
    • Frakes, William B. 1992. Stemming algorithms. In William B. Frakes and Ricardo Baeza-Yates, editors, Information Retrieval: Data Structures and Algorithms. Prentice Hall, Englewood Cliffs, NJ, pages 131-160.
    • (1992) Information Retrieval: Data Structures and Algorithms , pp. 131-160
    • Frakes, W.B.1
  • 12
    • 0039077664 scopus 로고    scopus 로고
    • Error driven segmentation of Chinese
    • Hockenmaier, Julia and Chris Brew. 1998. Error driven segmentation of Chinese. Communications of COLIPS, volume 1, number 1, pages 69-84.
    • (1998) Communications of COLIPS , vol.1 , Issue.1 , pp. 69-84
    • Hockenmaier, J.1    Brew, C.2
  • 16
    • 0009107224 scopus 로고
    • New advances in computers and natural language processing in China
    • Liu, Yongquan. 1987. New advances in computers and natural language processing in China. Information Science, 8:64-70.
    • (1987) Information Science , vol.8 , pp. 64-70
    • Liu, Y.1
  • 19
    • 0040261510 scopus 로고    scopus 로고
    • USeg: A retargetable word segmentation procedure for information retrieval
    • UMass Technical Report TR96-2, University of Massachusetts, Amherst, MA
    • Ponte, Jay M. and W. Bruce Croft. 1996. USeg: A retargetable word segmentation procedure for information retrieval. Presented at the Symposium on Document Analysis and Information Retrieval '96 (SDAIR). UMass Technical Report TR96-2, University of Massachusetts, Amherst, MA.
    • (1996) Symposium on Document Analysis and Information Retrieval '96 (SDAIR)
    • Ponte, J.M.1    Croft, W.B.2
  • 20
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner, Lawrence R. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. In Proceedings of the IEEE, 77(2):257-286.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 21
    • 0001076101 scopus 로고    scopus 로고
    • A stochastic finite-state word-segmentation algorithm for Chinese
    • Sproat, Richard, Chilin Shih, William Gail, and Nancy Chang. 1996. A stochastic finite-state word-segmentation algorithm for Chinese. Computational Linguistics, 22(3):377-404.
    • (1996) Computational Linguistics , vol.22 , Issue.3 , pp. 377-404
    • Sproat, R.1    Shih, C.2    Gail, W.3    Chang, N.4
  • 22
    • 0004923403 scopus 로고    scopus 로고
    • Ph.D thesis, University of Waikato, NZ
    • Teahan, W. J. 1998. Modelling English Text. Ph.D thesis, University of Waikato, NZ.
    • (1998) Modelling English Text
    • Teahan, W.J.1
  • 23
    • 0031676150 scopus 로고    scopus 로고
    • Correcting english text using PPM models
    • J. A. Storer and J. H. Reif, editors, Los Alamitos, CA. IEEE Computer Society Press
    • Teahan, W. J., S. Inglis, John G. Cleary, and G. Holmes. 1998. Correcting English text using PPM models. In J. A. Storer and J. H. Reif, editors, Proceeding Data Compression Conference, pages 289-298, Los Alamitos, CA. IEEE Computer Society Press.
    • (1998) Proceeding Data Compression Conference , pp. 289-298
    • Teahan, W.J.1    Inglis, S.2    Cleary, J.G.3    Holmes, G.4
  • 25
    • 0039669800 scopus 로고    scopus 로고
    • A position statement on chinese segmentation
    • University of Pennsylvania, PA
    • Wu, Dekai. 1998. A position statement on Chinese segmentation. Presented at the Chinese Language Processing Workshop, University of Pennsylvania, PA.
    • (1998) Chinese Language Processing Workshop
    • Wu, D.1
  • 26
    • 85100864264 scopus 로고
    • Improving Chinese tokenization with linguistic filters on statistical lexical acquisition
    • Stuttgart, October 94
    • Wu, Dekai and Pascale Fung. 1994. Improving Chinese tokenization with linguistic filters on statistical lexical acquisition. In ANLP-94, Fourth Conference on Applied Natural Language Processing, pages 180-181, Stuttgart, October 94.
    • (1994) ANLP-94, Fourth Conference on Applied Natural Language Processing , pp. 180-181
    • Wu, D.1    Fung, P.2
  • 27
    • 84989592173 scopus 로고
    • Chinese text segmentation for text retrieval achievements and problems
    • Wu, Zimin and Gwyneth Tseng. 1993. Chinese text segmentation for text retrieval achievements and problems. JASIS, 44(9)532-542.
    • (1993) JASIS , vol.44 , Issue.9 , pp. 532-542
    • Wu, Z.1    Tseng, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.