메뉴 건너뛰기




Volumn 2, Issue , 2007, Pages 846-851

Improving Chinese word segmentation with description length gain

Author keywords

Chinese word segmentation; Conditional random fields; Description length gain

Indexed keywords

CHINESE WORD SEGMENTATION; CONDITIONAL RANDOM FIELD; DESCRIPTION LENGTH GAIN; LEXICAL INFORMATION; SEGMENTATION PERFORMANCE; STATE-OF-THE-ART PERFORMANCE;

EID: 77958111978     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (2)

References (31)
  • 1
    • 0032677683 scopus 로고    scopus 로고
    • An efficient, probabilistically sound algorithm for segmentation and word discovery
    • February
    • M. R. Brent. An efficient, probabilistically sound algorithm for segmentation and word discovery. Machine Learning, 34:71-105, February 1999.
    • (1999) Machine Learning , vol.34 , pp. 71-105
    • Brent, M.R.1
  • 2
    • 85119974890 scopus 로고    scopus 로고
    • Character language models for Chinese word segmentation and named entity recognition
    • Sydney, Australia, July Association for Computational Linguistics
    • B. Carpenter. Character language models for Chinese word segmentation and named entity recognition. In Proceedings of the Fifth SIGH AN Workshop on Chinese Language Processing, pages 169-172. Sydney, Australia, July 2006. Association for Computational Linguistics.
    • (2006) Proceedings of the Fifth SIGH an Workshop on Chinese Language Processing , pp. 169-172
    • Carpenter, B.1
  • 3
    • 0030717280 scopus 로고    scopus 로고
    • PAT-tree-based keyword extraction for Chinese information retrieval
    • L.-F. Chien. PAT-trec-based keyword extraction for Chinese information retrieval. In Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 50-58, Philadelphia, 1997. (Pubitemid 127720304)
    • (1997) SIGIR Forum (ACM Special Interest Group on Information Retrieval) , vol.31 , Issue.1 SPEC. ISS. , pp. 50-58
    • Chien, L.-F.1
  • 6
    • 78650869754 scopus 로고    scopus 로고
    • Contextual dependencies in unsupervised word segmentation
    • Sidney, Australia
    • S. Goldwater, T. L. Griffiths, and M. Johnson. Contextual dependencies in unsupervised word segmentation. In COLING-ACL 2006, pages 673-670, Sidney, Australia, 2006.
    • (2006) COLING-ACL 2006 , pp. 673-670
    • Goldwater, S.1    Griffiths, T.L.2    Johnson, M.3
  • 9
    • 85090807987 scopus 로고    scopus 로고
    • Unsupervised learning of word boundary with description length gain
    • M. Osborne and E. T. K. Sang, editors Bergen, Norway
    • C. Kit and Y. Wilks. Unsupervised learning of word boundary with description length gain. In M. Osborne and E. T. K. Sang, editors, CoNLL-99, pages 1-6, Bergen, Norway, 1999.
    • (1999) CoNLL-99 , pp. 1-6
    • Kit, C.1    Wilks, Y.2
  • 10
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • San Francisco, CA, USA Morgan Kaufmann Publishers Inc
    • J. D. Lafferty, A. McCallum, and F. C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In IGML'Ol: Proceedings of the 18th International Conference on Machine Learning, pages 282-289, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
    • (2001) IGML'Ol: Proceedings of the 18th International Conference on Machine Learning , pp. 282-289
    • Lafferty, J.D.1    McCallum, A.2    Pereira, F.C.N.3
  • 11
    • 85119995698 scopus 로고    scopus 로고
    • The third international Chinese language processing bakeoff: Word segmentation and named entity recognition
    • Sidney, Australia
    • G.-A. Levow. The third international Chinese language processing bakeoff: Word segmentation and named entity recognition. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 108-117, Sidney, Australia, 2006.
    • (2006) Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing , pp. 108-117
    • Levow, G.-A.1
  • 13
    • 85116342676 scopus 로고    scopus 로고
    • Chinese segmentation and new word detection using conditional random fields
    • Geneva, Switzerland
    • F. Peng, F. Feng, and A. McCallum. Chinese segmentation and new word detection using conditional random fields. In COLING 2004, pages 562-568, Geneva, Switzerland, 2004.
    • (2004) COLING 2004 , pp. 562-568
    • Peng, F.1    Feng, F.2    McCallum, A.3
  • 14
    • 84958533967 scopus 로고    scopus 로고
    • Self-Supervised Chinese Word Segmentation
    • Advances in Intelligent Data Analysis
    • F. Peng and D. Schuurmans. Self-supervised Chinese word segmentation. In The Forth International Symposium on Intelligent Data Analysis, pages 238-247, Lisbon, Portugal, September 2001. (Pubitemid 33348503)
    • (2001) Lecture Notes in Computer Science , Issue.2189 , pp. 238-247
    • Peng, F.1    Schuurmans, D.2
  • 15
    • 0018015137 scopus 로고
    • Modelling by shortest data description
    • J. Rissanen. Modelling by shortest data description. Automatica, 14:465-471, 1978.
    • (1978) Automatica , vol.14 , pp. 465-471
    • Rissanen, J.1
  • 18
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • 623-656 July, October
    • C. E. Shannon. A mathematical theory of communication. The Bell System Technical Journal, 27:379-423, 623-656, July, October 1948.
    • (1948) The Bell System Technical Journal , vol.27 , pp. 379-423
    • Shannon, C.E.1
  • 21
    • 84872841506 scopus 로고    scopus 로고
    • Chinese word segmentation without using lexicon and hand-crafted training data
    • Montreal, Quebec, Canada
    • M. Sun, D. Shen, and B. K. Tsou. Chinese word segmentation without using lexicon and hand-crafted training data. In COLING-ACL'98, volume 2, pages 1265-1271, Montreal, Quebec, Canada, 1998.
    • (1998) COLING-ACL'98 , vol.2 , pp. 1265-1271
    • Sun, M.1    Shen, D.2    Tsou, B.K.3
  • 22
    • 85120006548 scopus 로고    scopus 로고
    • On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching
    • Sidney, Australia
    • R. T.-H. Tsai, H.-C. Hung, C.-L. Sung, H.-J. Dai, and W.-L. Hsu. On closed task of Chinese word segmentation: An improved CRF model coupled with character clustering and automatically generated template matching. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 108-117, Sidney, Australia, 2006.
    • (2006) Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing , pp. 108-117
    • Tsai, R.T.-H.1    Hung, H.-C.2    Sung, C.-L.3    Dai, H.-J.4    Hsu, W.-L.5
  • 26
    • 0038632285 scopus 로고    scopus 로고
    • Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus
    • M. Yamamoto and K. W. Church. Using suffix arrays to compute term frequency and document frequency for all substrings in a corpus. Computational Linguistics, 27(1):1-30, 2001.
    • (2001) Computational Linguistics , vol.27 , Issue.1 , pp. 1-30
    • Yamamoto, M.1    Church, K.W.2
  • 27
    • 2142726570 scopus 로고    scopus 로고
    • Extraction of Chinese compound words - An experimental study on a very large corpus
    • Hong Kong, China
    • J. Zhang, J. Gao, and M. Zhou. Extraction of Chinese compound words - an experimental study on a very large corpus. In Proceedings of the Second Chinese Language Processing Workshop, pages 132-139, Hong Kong, China, 2000.
    • (2000) Proceedings of the Second Chinese Language Processing Workshop , pp. 132-139
    • Zhang, J.1    Gao, J.2    Zhou, M.3
  • 28
    • 77953769061 scopus 로고    scopus 로고
    • Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model
    • Sidney, Australia
    • M. Zhang, G.-D. Zhou, L.-P. Yang, and D.-H. Ji. Chinese word segmentation and named entity recognition based on a context-dependent mutual information independence model. In Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing, pages 154-157, Sidney, Australia, 2006.
    • (2006) Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing , pp. 154-157
    • Zhang, M.1    Zhou, G.-D.2    Yang, L.-P.3    Ji, D.-H.4
  • 30
    • 38049001432 scopus 로고    scopus 로고
    • Effective tag set selection in Chinese word segmentation via conditional random field modeling
    • Wuhan, China
    • H. Zhao, C.-N. Huang, M. Li, and B.-L. Lu. Effective tag set selection in Chinese word segmentation via conditional random field modeling. In Proceedings of PACLIC-20, pages 87-94, Wuhan, China, 2006.
    • (2006) Proceedings of PACLIC-20 , pp. 87-94
    • Zhao, H.1    Huang, C.-N.2    Li, M.3    Lu, B.-L.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.