메뉴 건너뛰기




Volumn 6, Issue 3-4, 2003, Pages 333-362

Applying machine learning to text segmentation for information retrieval

Author keywords

Chinese information retrieval; EM algorithm; Machine learning; Word segmentation

Indexed keywords


EID: 3843085056     PISSN: 13864564     EISSN: None     Source Type: Journal    
DOI: 10.1023/a:1026028229881     Document Type: Article
Times cited : (65)

References (40)
  • 3
    • 0037956008 scopus 로고    scopus 로고
    • Chinese text segmentation with MBDP-1: Making the most of training corpora
    • France
    • Brent M and Tao X (2001) Chinese text segmentation with MBDP-1: Making the most of training corpora. In: Proceedings of ACL2001, France.
    • (2001) Proceedings of ACL2001
    • Brent, M.1    Tao, X.2
  • 4
    • 0040260430 scopus 로고    scopus 로고
    • Using query zoning and correlation within SMART: TREC-5
    • Buckley C, Singhal A and Mitra M (1997) Using query zoning and correlation within SMART: TREC-5. In: Proceedings of TREC-5, pp. 105-118.
    • (1997) Proceedings of TREC-5 , pp. 105-118
    • Buckley, C.1    Singhal, A.2    Mitra, M.3
  • 9
    • 0021405335 scopus 로고
    • Data compression using adaptive coding and partial string matching
    • Cleary J and Witten I (1984) Data compression using adaptive coding and partial string matching. IEEE Trans on Communications, 32(4):396-402.
    • (1984) IEEE Trans on Communications , vol.32 , Issue.4 , pp. 396-402
    • Cleary, J.1    Witten, I.2
  • 10
    • 0033146296 scopus 로고    scopus 로고
    • On the discovery of novel word-like units from utterances: An artificial-language study with implications for native-language acquisition
    • Dahan D and Brent M (1999) On the discovery of novel word-like units from utterances: An artificial-language study with implications for native-language acquisition. Journal of Experimental Psychology: General, 128:165-185.
    • (1999) Journal of Experimental Psychology: General , vol.128 , pp. 165-185
    • Dahan, D.1    Brent, M.2
  • 11
    • 0002629270 scopus 로고
    • Maximum-likelihood from incomplete data via the EM algorithm
    • Dempster A, Laird N and Rubin D (1977) Maximum-likelihood from incomplete data via the EM algorithm. J. Royal Statist. Soc. Ser., B(39).
    • (1977) J. Royal Statist. Soc. Ser. , vol.B , Issue.39
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 12
    • 3843143146 scopus 로고    scopus 로고
    • Chinese word segmentation accuracy and its effects on information retrieval
    • Foo S and Li H (2001) Chinese word segmentation accuracy and its effects on information retrieval. TEXT Technology.
    • (2001) TEXT Technology
    • Foo, S.1    Li, H.2
  • 15
    • 0343458052 scopus 로고    scopus 로고
    • Term importance, boolean conjunct training, negative terms, and foreign language retrieval: Probabilistic algorithms at TREC-5
    • NIST special publication
    • Gey F, Chen A, He J, Xu L and Meggs J (1996) Term importance, boolean conjunct training, negative terms, and foreign language retrieval: Probabilistic algorithms at TREC-5. In: Proceedings of the Fifth Text REtrieval Conference, pp. 181-190. NIST special publication.
    • (1996) Proceedings of the Fifth Text REtrieval Conference , pp. 181-190
    • Gey, F.1    Chen, A.2    He, J.3    Xu, L.4    Meggs, J.5
  • 16
    • 0039077664 scopus 로고    scopus 로고
    • Error driven segmentation of Chinese
    • Hockenmaier J and Brew C (1998) Error driven segmentation of Chinese. Communications of COLIPS, 8(1):69-84.
    • (1998) Communications of COLIPS , vol.8 , Issue.1 , pp. 69-84
    • Hockenmaier, J.1    Brew, C.2
  • 18
    • 0013253788 scopus 로고    scopus 로고
    • Okapi Chinese text retrieval experiments at TREC-6
    • Huang X and Roberton S (1998) Okapi Chinese text retrieval experiments at TREC-6. In: Proceedings of TREC-6, pp. 137-142.
    • (1998) Proceedings of TREC-6 , pp. 137-142
    • Huang, X.1    Roberton, S.2
  • 20
    • 3843129989 scopus 로고
    • Chinese segmentation and its disambiguation
    • New Mexico State University, Las Cruces, New Mexico
    • Jin W (1992) Chinese segmentation and its disambiguation. In: MCCS-92-227, Computing Research Laboratory, New Mexico State University, Las Cruces, New Mexico.
    • (1992) MCCS-92-227, Computing Research Laboratory
    • Jin, W.1
  • 23
    • 0001283644 scopus 로고    scopus 로고
    • Improving English and Chinese ad-hoc retrieval: A tipster text phase 3 project report
    • Kwok KL (2000) Improving English and Chinese ad-hoc retrieval: A tipster text phase 3 project report. Information Retrieval, 3(4):313-338.
    • (2000) Information Retrieval , vol.3 , Issue.4 , pp. 313-338
    • Kwok, K.L.1
  • 24
    • 0008633134 scopus 로고    scopus 로고
    • TREC-5 English and Chinese retrieval experiments using PIRCS
    • Kwok KL and Grunfeld L (1996) TREC-5 English and Chinese retrieval experiments using PIRCS. In: Proceedings of TREC-5.
    • (1996) Proceedings of TREC-5
    • Kwok, K.L.1    Grunfeld, L.2
  • 27
    • 0033330466 scopus 로고    scopus 로고
    • Chinese information retrieval: Using characters or words?
    • Nie JY and Ren F (1999) Chinese information retrieval: Using characters or words? Information Processing and Management, 35:443-462.
    • (1999) Information Processing and Management , vol.35 , pp. 443-462
    • Nie, J.Y.1    Ren, F.2
  • 32
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner L (1989) A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of IEEE, 77(2).
    • (1989) Proceedings of IEEE , vol.77 , Issue.2
    • Rabiner, L.1
  • 34
    • 0018446997 scopus 로고
    • Search relevance weighting given little relevance information
    • Sparck-Jones K (1979) Search relevance weighting given little relevance information. Journal of Documentation, 35(1).
    • (1979) Journal of Documentation , vol.35 , Issue.1
    • Sparck-Jones, K.1
  • 37
    • 0001277731 scopus 로고    scopus 로고
    • A compression-based algorithm for Chinese word segmentation
    • Teahan WJ, Wen Y, McNab R and Witten IH (2001) A compression-based algorithm for Chinese word segmentation. Computational Linguistics, 26(3):375-393.
    • (2001) Computational Linguistics , vol.26 , Issue.3 , pp. 375-393
    • Teahan, W.J.1    Wen, Y.2    McNab, R.3    Witten, I.H.4
  • 39
  • 40
    • 84989592173 scopus 로고
    • Chinese text segmentation for text retrieval: Achievements and problems
    • Wu Z and Tseng G (1993) Chinese text segmentation for text retrieval: Achievements and problems. Journal of the American Society for Information Science, 44(9):532-542.
    • (1993) Journal of the American Society for Information Science , vol.44 , Issue.9 , pp. 532-542
    • Wu, Z.1    Tseng, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.