메뉴 건너뛰기




Volumn , Issue , 2010, Pages 9-30

Text preprocessing

Author keywords

[No Author keywords available]

Indexed keywords

NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 84893263751     PISSN: None     EISSN: None     Source Type: Book    
DOI: None     Document Type: Chapter
Times cited : (26)

References (45)
  • 3
    • 0037600401 scopus 로고    scopus 로고
    • Mostly-unsupervised statistical segmentation of Japanese Kanji sequences
    • Ando, R. K. and L. Lee (2003). Mostly-unsupervised statistical segmentation of Japanese Kanji sequences.Journal of Natural Language Engineering 9, 127-149.
    • (2003) Journal of Natural Language Engineering , vol.9 , pp. 127-149
    • Ando, R.K.1    Lee, L.2
  • 4
    • 33750140618 scopus 로고    scopus 로고
    • Collocation and THai word segmentation
    • Bangkok, Thailand
    • Aroonmanakun, W. (2002). Collocation and THai word segmentation. In Proceedings of SNLPCOCOSDA2002, Bangkok, Thailand.
    • (2002) Proceedings of SNLPCOCOSDA2002
    • Aroonmanakun, W.1
  • 6
    • 85054450868 scopus 로고    scopus 로고
    • Theoretical and computational linguistics: Toward a mutual understanding
    • J. Lawler and H. A. Dry (Eds.), London, U.K.: Routledge
    • Bayer, S., J. Aberdeen, J. Burger, L. Hirschman, D. Palmer, and M. Vilain (1998). Theoretical and computational linguistics: Toward a mutual understanding. In J. Lawler and H. A. Dry (Eds.), Using Computers in Linguistics. London, U.K.: Routledge.
    • (1998) Using Computers in Linguistics.
    • Bayer, S.1    Aberdeen, J.2    Burger, J.3    Hirschman, L.4    Palmer, D.5    Vilain, M.6
  • 13
    • 0004011637 scopus 로고
    • Statistical augmentation of a Chinese machine-readable dictionary
    • (WVLC-94), Kyoto, Japan
    • Fung, P. and D. Wu (1994). Statistical augmentation of a Chinese machine-readable dictionary. In Proceedings of Second Workshop on Very Large Corpora (WVLC-94), Kyoto, Japan.
    • (1994) Proceedings of Second Workshop on Very Large Corpora
    • Fung, P.1    Wu, D.2
  • 14
    • 33646401779 scopus 로고    scopus 로고
    • Chinese word segmentation and named entity recognition: A pragmatic approach
    • Gao, J., M. Li, A. Wu, and C.-N. Huang (2005). Chinese word segmentation and named entity recognition: A pragmatic approach. Computational Linguistics 31(4), 531-574.
    • (2005) Computational Linguistics , vol.31 , Issue.4 , pp. 531-574
    • Gao, J.1    Li, M.2    Wu, A.3    Huang, C.-N.4
  • 17
    • 0039077664 scopus 로고    scopus 로고
    • Error driven segmentation of Chinese
    • Hockenmaier, J. and C. Brew (1998). Error driven segmentation of Chinese. Communications of COLIPS 8(1), 69-84.
    • (1998) Communications of COLIPS , vol.8 , Issue.1 , pp. 69-84
    • Hockenmaier, J.1    Brew, C.2
  • 19
    • 33845487544 scopus 로고    scopus 로고
    • Unsupervised multilingual sentence boundary detection
    • Kiss, T. and J. Strunk (2006). Unsupervised multilingual sentence boundary detection. Computational Linguistics 32(4), 485-525.
    • (2006) Computational Linguistics , vol.32 , Issue.4 , pp. 485-525
    • Kiss, T.1    Strunk, J.2
  • 20
    • 0011078395 scopus 로고
    • Text analysis and word pronunciation in text-to-speech synthesis
    • S. Furui and M. M. Sondhi (Eds.), New York: Marcel Dekker, Inc
    • Liberman, M. Y. and K. W. Church (1992). Text analysis and word pronunciation in text-to-speech synthesis. In S. Furui and M. M. Sondhi (Eds.), Advances in Speech Signal Processing, pp. 791-831. New York: Marcel Dekker, Inc.
    • (1992) Advances in Speech Signal Processing , pp. 791-831
    • Liberman, M.Y.1    Church, K.W.2
  • 21
    • 79952149264 scopus 로고    scopus 로고
    • Bilingually motivated domain-adapted word segmentation for statistical machine translation
    • (EACL 2009), Athens, Greece
    • Ma, Y. and A. Way (2009). Bilingually motivated domain-adapted word segmentation for statistical machine translation. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), Athens, Greece, pp. 549-557
    • (2009) Proceedings of the 12th Conference of the European Chapter of the ACL , pp. 549-557
    • Ma, Y.1    Way, A.2
  • 26
    • 0039484386 scopus 로고    scopus 로고
    • Periods, capitalized words, etc
    • Mikheev, A. (2002). Periods, capitalized words, etc. Computational Linguistics 28(3), 289-318.
    • (2002) Computational Linguistics , vol.28 , Issue.3 , pp. 289-318
    • Mikheev, A.1
  • 27
    • 85054422737 scopus 로고
    • Worterkennungsverfahren als Grundlage einer Universalmethode zur automatischen Segmentierung von Texten in Sätze. Ein Verfahren zur maschinellen Satzgrenzenbestimmung im Englischen
    • Müller, H., V. Amerl, and G. Natalis (1980). Worterkennungsverfahren als Grundlage einer Universalmethode zur automatischen Segmentierung von Texten in Sätze. Ein Verfahren zur maschinellen Satzgrenzenbestimmung im Englischen. Sprache und Datenverarbeitung 1.
    • (1980) Sprache und Datenverarbeitung , pp. 1
    • Müller, H.1    Amerl, V.2    Natalis, G.3
  • 28
    • 0038293491 scopus 로고
    • Astochastic Japanese morphological analyzer using a Forward-DP backward A* n-best search algorithm
    • Kyoto, Japan
    • Nagata, M. (1994). Astochastic Japanese morphological analyzer using a Forward-DP backward A* n-best search algorithm. In Proceedings of COLING94, Kyoto, Japan.
    • (1994) Proceedings of COLING94
    • Nagata, M.1
  • 30
    • 0004227735 scopus 로고
    • C.S.L.I. Lecture Notes, Number 18. Stanford, CA: Center for the Study of Language and Information
    • Nunberg, G. (1990). The Linguistics of Punctuation. C.S.L.I. Lecture Notes, Number 18. Stanford, CA: Center for the Study of Language and Information.
    • (1990) The Linguistics of Punctuation.
    • Nunberg, G.1
  • 32
    • 0347138625 scopus 로고    scopus 로고
    • Adaptive multilingual sentence boundary disambiguation
    • Palmer, D. D. and M. A. Hearst (1997). Adaptive multilingual sentence boundary disambiguation. Computational Linguistics 23(2), 241-67.
    • (1997) Computational Linguistics , vol.23 , Issue.2 , pp. 241-267
    • Palmer, D.D.1    Hearst, M.A.2
  • 35
    • 85118952085 scopus 로고
    • Some applications of tree-based modelling to speech and language indexing
    • San Mateo, CA Morgan Kaufmann
    • Riley, M. D. (1989). Some applications of tree-based modelling to speech and language indexing. In Proceedings of the DARPA Speech and Natural Language Workshop, San Mateo, CA, pp. 339-352. Morgan Kaufmann.
    • (1989) Proceedings of the DARPA Speech and Natural Language Workshop , pp. 339-352
    • Riley, M.D.1
  • 39
    • 0001076101 scopus 로고    scopus 로고
    • A stochastic finite-state word-segmentation algorithm for Chinese
    • Sproat, R. W., C. Shih, W. Gale, and N. Chang (1996). A stochastic finite-state word-segmentation algorithm for Chinese. Computational Linguistics 22(3), 377-404.
    • (1996) Computational Linguistics , vol.22 , Issue.3 , pp. 377-404
    • Sproat, R.W.1    Shih, C.2    Gale, W.3    Chang, N.4
  • 40
    • 0001277731 scopus 로고    scopus 로고
    • A compression-based algorithm for Chinese word segmentation
    • Teahan, W.J., Y. Wen, R. McNab, and I. H. Witten (2000). A compression-based algorithm for Chinese word segmentation. Computational Linguistics 26(3), 375-393.
    • (2000) Computational Linguistics , vol.26 , Issue.3 , pp. 375-393
    • Teahan, W.J.1    Wen, Y.2    McNab, R.3    Witten, I.H.4
  • 44
    • 84989592173 scopus 로고
    • Chinese text segmentation for text retrieval: Achievements and problems
    • Wu, Z. and G. Tseng (1993). Chinese text segmentation for text retrieval: Achievements and problems. Journal of the American Society for Information Science 44(9), 532-542.
    • (1993) Journal of the American Society for Information Science , vol.44 , Issue.9 , pp. 532-542
    • Wu, Z.1    Tseng, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.