메뉴 건너뛰기




Volumn 22, Issue 3, 1996, Pages 377-404

A Stochastic Finite-State Word-Segmentation Algorithm for Chinese

Author keywords

[No Author keywords available]

Indexed keywords

IMAGE SEGMENTATION; SPEECH SYNTHESIS; STOCHASTIC SYSTEMS;

EID: 0001076101     PISSN: 08912017     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (209)

References (50)
  • 1
    • 0003906007 scopus 로고
    • Occasional Publications in Academic Computing, 16. Summer Institute of Linguistics, Dallas, TX
    • Antworth, Evan. 1990. PC-KIMMO: A Two-Level Processor for Morphological Analysis. Occasional Publications in Academic Computing, 16. Summer Institute of Linguistics, Dallas, TX.
    • (1990) PC-KIMMO: A Two-Level Processor for Morphological Analysis
    • Antworth, E.1
  • 5
    • 84886765570 scopus 로고
    • Xianzhishi manzu ji jilu zuijiahua de zhongwen duanci fangfa
    • Taipei. ROCLING
    • Chang, Jyun-Shen, C.-D. Chen, and Shun-De Chen. 1991. Xianzhishi manzu ji jilu zuijiahua de zhongwen duanci fangfa [Chinese word segmentation through constraint satisfaction and statistical optimization]. In Proceedings of ROCLING IV, pages 147-165, Taipei. ROCLING.
    • (1991) Proceedings of ROCLING IV , pp. 147-165
    • Chang, J.-S.1    Chen, C.-D.2    Chen, S.-D.3
  • 7
    • 0003775162 scopus 로고
    • University of California Press, Berkeley, CA
    • Chao, Yuen-Ren. 1968. A Grammar of Spoken Chinese. University of California Press, Berkeley, CA.
    • (1968) A Grammar of Spoken Chinese
    • Chao, Y.-R.1
  • 8
    • 0001029084 scopus 로고
    • Word identification for Mandarin Chinese sentences
    • COLING
    • Chen, Keh-Jiann and Shing-Huan Liu. 1992. Word identification for Mandarin Chinese sentences. In Proceedings of COLING-92, pages 101-107. COLING.
    • (1992) Proceedings of COLING-92 , pp. 101-107
    • Chen, K.-J.1    Liu, S.-H.2
  • 9
    • 0025750735 scopus 로고
    • A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams
    • Church, Kenneth and William Gale. 1991. A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams. Computer Speech and Language, 5(1):19-54.
    • (1991) Computer Speech and Language , vol.5 , Issue.1 , pp. 19-54
    • Church, K.1    Gale, W.2
  • 10
    • 85132028005 scopus 로고
    • Word association norms, mutual information and lexicography
    • Morristown, NJ. Association for Computational Linguistics
    • Church, Kenneth and Patrick Hanks. 1989. Word association norms, mutual information and lexicography. In 27th Annual Meeting of the Association for Computational Linguistics, pages 76-83, Morristown, NJ. Association for Computational Linguistics.
    • (1989) 27th Annual Meeting of the Association for Computational Linguistics , pp. 76-83
    • Church, K.1    Hanks, P.2
  • 11
    • 1542517020 scopus 로고
    • University of Hawaii Press, Honolulu
    • DeFrancis, John. 1984. The Chinese Language. University of Hawaii Press, Honolulu.
    • (1984) The Chinese Language
    • DeFrancis, J.1
  • 15
    • 1542517015 scopus 로고
    • Hanyu zidong fenci de jinlin pipei suanfa ji qi zai QHFY hanying jiqi fanyi xitong zhong de shixian
    • Singapore
    • Gu, Ping and Yuhang Mao. 1994. Hanyu zidong fenci de jinlin pipei suanfa ji qi zai QHFY hanying jiqi fanyi xitong zhong de shixian [The adjacent matching algorithm of Chinese automatic word segmentation and its implementation in the QHFY Chinese-English system]. In International Conference on Chinese Computing, Singapore.
    • (1994) International Conference on Chinese Computing
    • Gu, P.1    Mao, Y.2
  • 16
    • 0027684991 scopus 로고
    • Pitch accent in context: Predicting intonational prominence from text
    • Hirschberg, Julia. 1993. Pitch accent in context: Predicting intonational prominence from text. Artificial Intelligence, 63:305-340.
    • (1993) Artificial Intelligence , vol.63 , pp. 305-340
    • Hirschberg, J.1
  • 17
    • 1542412230 scopus 로고
    • A data-driven approach to psychological reality of the mental lexicon: Two studies on Chinese corpus linguistics
    • December
    • Huang, Chu-Ren, Kathleen Ahrens, and Keh-jiann Chen. 1993. A data-driven approach to psychological reality of the mental lexicon: Two studies on Chinese corpus linguistics. Presented at the conference on Language and its Psychobiological Bases, December.
    • (1993) Conference on Language and Its Psychobiological Bases
    • Huang, C.-R.1    Ahrens, K.2    Chen, K.-J.3
  • 18
    • 84878203695 scopus 로고
    • Regular models of phonological rule systems
    • Kaplan, Ronald and Martin Kay. 1994. Regular models of phonological rule systems. Computational Linguistics, 20:331-378.
    • (1994) Computational Linguistics , vol.20 , pp. 331-378
    • Kaplan, R.1    Kay, M.2
  • 19
    • 0041142320 scopus 로고
    • Two-level morphology with composition
    • COLING
    • Karttunen, Lauri, Ronald Kaplan, and Annie Zaenen. 1992. Two-level morphology with composition. In COLING-92, pages 141-148. COLING.
    • (1992) COLING-92 , pp. 141-148
    • Karttunen, L.1    Kaplan, R.2    Zaenen, A.3
  • 21
    • 0026890725 scopus 로고
    • Robust part-of-speech tagging using a hidden Markov model
    • Submitted
    • Kupiec, Julian. 1992. Robust part-of-speech tagging using a hidden Markov model. Computer Speech and Language. Submitted.
    • (1992) Computer Speech and Language
    • Kupiec, J.1
  • 22
    • 1542726848 scopus 로고
    • Yi zhong zhuyao shiyong yuliaoku biaoji jinxing qiyi jiaozheng de zuida pipei hanyu zidong fenci suanfa sheji
    • Taipei. ROCLING
    • Li, B.Y., S. Lin, C.F. Sun, and M.S. Sun. 1991. Yi zhong zhuyao shiyong yuliaoku biaoji jinxing qiyi jiaozheng de zuida pipei hanyu zidong fenci suanfa sheji [A maximum-matching word segmentation algorithm using corpus tags for disambiguation]. In ROCLlNC IV, pages 135-146, Taipei. ROCLING.
    • (1991) ROCLlNC IV , pp. 135-146
    • Li, B.Y.1    Lin, S.2    Sun, C.F.3    Sun, M.S.4
  • 24
    • 0346479270 scopus 로고
    • Shumian hanyu zidong fenci xitong-CDWS
    • Liang, Nanyuan. 1986. Shumian hanyu zidong fenci xitong-CDWS [A written Chinese automatic segmentation system-CDWS]. Journal of Chinese Information Processing, 1(1):44-52.
    • (1986) Journal of Chinese Information Processing , vol.1 , Issue.1 , pp. 44-52
    • Liang, N.1
  • 25
    • 0348037869 scopus 로고
    • A preliminary study on unknown word problem in Chinese word segmentation
    • ROCLING
    • Lin, Ming-Yu, Tung-Hui Chiang, and Keh-Yi Su. 1993. A preliminary study on unknown word problem in Chinese word segmentation. In ROCLlNG 6, pages 119-141. ROCLING.
    • (1993) ROCLlNG 6 , pp. 119-141
    • Lin, M.-Y.1    Chiang, T.-H.2    Su, K.-Y.3
  • 27
    • 85031617968 scopus 로고
    • Minimization algorithms for sequential transducers
    • Submitted
    • Mohri, Mehryar. 1995. Minimization algorithms for sequential transducers. Theoretical Computer Science. Submitted.
    • (1995) Theoretical Computer Science
    • Mohri, M.1
  • 28
    • 0038293491 scopus 로고
    • A stochastic Japanese morphological analyzer using a forward-DP backward A* N-best search algorithm
    • COLING
    • Nagata, Masaaki. 1994. A stochastic Japanese morphological analyzer using a forward-DP backward A* N-best search algorithm. In Proceedings of COLING-94, pages 201-207. COLING.
    • (1994) Proceedings of COLING-94 , pp. 201-207
    • Nagata, M.1
  • 30
    • 1542621554 scopus 로고
    • Zhongwen cihui qiyi zhi yanjiu - Duanci yu cixing biaoshi
    • ROCLING
    • Peng, Z.-Y. and J-S. Chang. 1993. Zhongwen cihui qiyi zhi yanjiu - duanci yu cixing biaoshi [Research on Chinese lexical ambiguity - segmentation and part-of-speech tagging]. In ROCLING 6, pages 173-193. ROCLING.
    • (1993) ROCLING 6 , pp. 173-193
    • Peng, Z.-Y.1    Chang, J.-S.2
  • 31
    • 0242312781 scopus 로고
    • Weighted rational transductions and their application to human language processing
    • Advanced Research Projects Agency, March 8-11
    • Pereira, Fernando, Michael Riley, and Richard Sproat. 1994. Weighted rational transductions and their application to human language processing. In ARPA Workshop on Human Language Technology, pages 249-254. Advanced Research Projects Agency, March 8-11.
    • (1994) ARPA Workshop on Human Language Technology , pp. 249-254
    • Pereira, F.1    Riley, M.2    Sproat, R.3
  • 36
    • 0028405433 scopus 로고
    • English noun-phrase accent prediction for text-to-speech
    • Sproat, Richard. 1994. English noun-phrase accent prediction for text-to-speech. Computer Speech and Language, 8:79-94.
    • (1994) Computer Speech and Language , vol.8 , pp. 79-94
    • Sproat, R.1
  • 37
    • 0347109434 scopus 로고
    • A finite-state architecture for tokenization and grapheme-to-phoneme conversion for multilingual text analysis
    • Susan Armstrong and Evelyne Tzoukermann, editors, Dublin, Ireland. Association for Computational Linguistics
    • Sproat, Richard. 1995. A finite-state architecture for tokenization and grapheme-to-phoneme conversion for multilingual text analysis. In Susan Armstrong and Evelyne Tzoukermann, editors, Proceedings of the EACL SIGDAT Workshop, pages 65-72, Dublin, Ireland. Association for Computational Linguistics.
    • (1995) Proceedings of the EACL SIGDAT Workshop , pp. 65-72
    • Sproat, R.1
  • 39
    • 1542517009 scopus 로고
    • A corpus-based analysis of Mandarin nominal root compounds
    • Sproat, Richard and Chilin Shih. 1995. A corpus-based analysis of Mandarin nominal root compounds. Journal of East Asian Linguistics, 4(1):1-23.
    • (1995) Journal of East Asian Linguistics , vol.4 , Issue.1 , pp. 1-23
    • Sproat, R.1    Shih, C.2
  • 43
    • 5244299848 scopus 로고
    • A finite-state morphological processor for Spanish
    • COLING
    • Tzoukermann, Evelyne and Mark Liberman. 1990. A finite-state morphological processor for Spanish. In COLING-90, Volume 3, pages 3: 277-286. COLING.
    • (1990) COLING-90, Volume 3 , vol.3 , pp. 3
    • Tzoukermann, E.1    Liberman, M.2
  • 44
    • 0008583550 scopus 로고
    • Recognizing unregistered names for Mandarin word identification
    • COLING
    • Wang, Liang-Jyh, Wei-Chuan Li, and Chao-Huang Chang. 1992. Recognizing unregistered names for Mandarin word identification. In Proceedings of COLING-92, pages 1239-1243. COLING.
    • (1992) Proceedings of COLING-92 , pp. 1239-1243
    • Wang, L.-J.1    Li, W.-C.2    Chang, C.-H.3
  • 45
    • 0026850770 scopus 로고
    • Automatic classification of intonational phrase boundaries
    • Wang, Michelle and Julia Hirschberg. 1992. Automatic classification of intonational phrase boundaries. Computer Speech and Language, 6:175-196.
    • (1992) Computer Speech and Language , vol.6 , pp. 175-196
    • Wang, M.1    Hirschberg, J.2
  • 47
    • 0042738812 scopus 로고
    • Dover, New York. Republication of second edition, published 1927 by Catholic Mission Press
    • Wieger, L. 1965. Chinese Characters. Dover, New York. Republication of second edition, published 1927 by Catholic Mission Press.
    • (1965) Chinese Characters
    • Wieger, L.1
  • 48
    • 85100864264 scopus 로고
    • Improving Chinese tokenization with linguistic filters on statistical lexical acquisition
    • Stuttgart, October
    • Wu, Dekai and Pascale Fung. 1994. Improving Chinese tokenization with linguistic filters on statistical lexical acquisition. In Proceedings of the Fourth Conference on Applied Natural Language Processing, pages 180-181, Stuttgart, October.
    • (1994) Proceedings of the Fourth Conference on Applied Natural Language Processing , pp. 180-181
    • Wu, D.1    Fung, P.2
  • 49
    • 84989592173 scopus 로고
    • Chinese text segmentation for text retrieval: Achievements and problems
    • Wu, Zimin and Gwyneth Tseng. 1993. Chinese text segmentation for text retrieval: Achievements and problems. Journal of the American Society for Information Science, 44(9):532-542.
    • (1993) Journal of the American Society for Information Science , vol.44 , Issue.9 , pp. 532-542
    • Wu, Z.1    Tseng, G.2
  • 50
    • 85120437501 scopus 로고
    • Rule-based word identification for Mandarin Chinese sentences - A unification approach
    • Yeh, Ching-long and Hsi-Jian Lee. 1991. Rule-based word identification for Mandarin Chinese sentences - a unification approach. Computer Processing of Chinese and Oriental Languages, 5(2):97-118.
    • (1991) Computer Processing of Chinese and Oriental Languages , vol.5 , Issue.2 , pp. 97-118
    • Yeh, C.-L.1    Lee, H.-J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.