메뉴 건너뛰기




Volumn 34, Issue 2, 2004, Pages 834-844

Mining Pinyin-to-Character Conversion Rules From Large-Scale Corpus: A Rough Set Approach

Author keywords

Data generalization; Data mining; Pinyin to character conversion; Rough set

Indexed keywords

DATA MINING; INFORMATION RETRIEVAL; INTERPOLATION; MATHEMATICAL MODELS; NATURAL LANGUAGE PROCESSING SYSTEMS; ROUGH SET THEORY;

EID: 1842587892     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2003.817101     Document Type: Article
Times cited : (16)

References (25)
  • 1
    • 85095025073 scopus 로고    scopus 로고
    • A new statistical approach to Chinese Pinyin input
    • Hong Kong
    • Z. Chen and K.-F. Lee, "A new statistical approach to Chinese Pinyin input," in ACL-2000, Hong Kong, 2000, pp. 241-247.
    • (2000) ACL-2000 , pp. 241-247
    • Chen, Z.1    Lee, K.-F.2
  • 2
    • 85016472884 scopus 로고    scopus 로고
    • Toward a unified approach to statistical language modeling for Chinese
    • G. Jianfeng, J. Goodman, M. Li, and K.-F. Lee, "Toward a unified approach to statistical language modeling for Chinese," ACM Trans. Asian Lang. Infonn. Process., vol. 1, no. 1, pp. 3-33, 2002.
    • (2002) ACM Trans. Asian Lang. Infonn. Process. , vol.1 , Issue.1 , pp. 3-33
    • Jianfeng, G.1    Goodman, J.2    Li, M.3    Lee, K.-F.4
  • 4
    • 0033873049 scopus 로고    scopus 로고
    • Variable n-grams and extensions for conversational speech language modeling
    • Jan.
    • M. Siu and M. Ostendorf, "Variable n-grams and extensions for conversational speech language modeling, "IEEE Trans. Speech Audio Processing, vol. 8, pp. 63-75, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 63-75
    • Siu, M.1    Ostendorf, M.2
  • 5
    • 0025446887 scopus 로고    scopus 로고
    • A cache based natural language model for speech recognition
    • R. Kuhn and R. de Mori, "A cache based natural language model for speech recognition," IEEE Trans. Pattern Anal. Machine Intell., vol. 14, pp. 570-583, 1999.
    • (1999) IEEE Trans. Pattern Anal. Machine Intell. , vol.14 , pp. 570-583
    • Kuhn, R.1    De Mori, R.2
  • 6
    • 0035497388 scopus 로고    scopus 로고
    • A bit of progress in language modeling
    • J. Goodman, "A bit of progress in language modeling," Comput. Speech Lang., vol. 15, pp. 403-434, 2001.
    • (2001) Comput. Speech Lang. , vol.15 , pp. 403-434
    • Goodman, J.1
  • 7
    • 0027929445 scopus 로고
    • On structuring probabilistic dependences in stochastic language modeling
    • H. Ney, U. Essen, and R. Kneser, "On structuring probabilistic dependences in stochastic language modeling," Comput. Speech Lang., vol. 8, pp. 1-38, 1994.
    • (1994) Comput. Speech Lang. , vol.8 , pp. 1-38
    • Ney, H.1    Essen, U.2    Kneser, R.3
  • 8
    • 0030181951 scopus 로고    scopus 로고
    • A maximum entropy approach to adaptive statistical language modeling
    • R. Rosenfeld, "A maximum entropy approach to adaptive statistical language modeling," Comput. Speech Lang., vol. 10, pp. 187-228, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 187-228
    • Rosenfeld, R.1
  • 9
    • 0041079041 scopus 로고    scopus 로고
    • Probabilistic top-down parsing and language modeling
    • B. Roark, "Probabilistic top-down parsing and language modeling," Computat. Linguist., vol. 27, no. 2, pp. 249-276, 2001.
    • (2001) Computat. Linguist. , vol.27 , Issue.2 , pp. 249-276
    • Roark, B.1
  • 10
    • 0034295822 scopus 로고    scopus 로고
    • Structured language modeling
    • C. Chelba and F. Jelinek, "Structured language modeling," Comput. Speech Lang., vol. 14, no. 4, pp. 283-332, 2001.
    • (2001) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 283-332
    • Chelba, C.1    Jelinek, F.2
  • 12
    • 0032650074 scopus 로고    scopus 로고
    • Variable-length category n-gram language models
    • T. R. Niesler and P. C. Woodland, "Variable-length category n-gram language models," Comput. Speech Lang., vol. 13, pp. 99-124, 1999.
    • (1999) Comput. Speech Lang. , vol.13 , pp. 99-124
    • Niesler, T.R.1    Woodland, P.C.2
  • 13
    • 84867919822 scopus 로고    scopus 로고
    • Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging
    • E. Brill, "Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging," Computat. Linguist., vol. 21, no. 4, pp. 543-565, 2001.
    • (2001) Computat. Linguist. , vol.21 , Issue.4 , pp. 543-565
    • Brill, E.1
  • 15
    • 0032188308 scopus 로고    scopus 로고
    • Rough set theory and its applications to data analysis
    • _, "Rough set theory and its applications to data analysis," Cybern. Syst., vol. 29, pp. 661-668, 1998.
    • (1998) Cybern. Syst. , vol.29 , pp. 661-668
  • 16
    • 0032205549 scopus 로고    scopus 로고
    • Uncertainty measures of rough set prediction
    • I. Duntsch and G. Gediga, "Uncertainty measures of rough set prediction," Artif. Intell., vol. 106, pp. 109-137, 1998.
    • (1998) Artif. Intell. , vol.106 , pp. 109-137
    • Duntsch, I.1    Gediga, G.2
  • 17
    • 27144441097 scopus 로고    scopus 로고
    • An evaluation of statistical approaches to text categorization
    • Y. Yang, "An evaluation of statistical approaches to text categorization," Inform. Retrieval, vol. 1-2, no. 1, pp. 69-90, 1999.
    • (1999) Inform. Retrieval , vol.1-2 , Issue.1 , pp. 69-90
    • Yang, Y.1
  • 18
    • 0037649373 scopus 로고    scopus 로고
    • A comparative study on chinese text categorization methods
    • Melbourne, Australia
    • J. He, A. H. Tan, and C. L. Tan, "A comparative study on chinese text categorization methods," in PRICA 2000 Workshop on Text and Web Mining, Melbourne, Australia, 2000, pp. 24-35.
    • (2000) PRICA 2000 Workshop on Text and Web Mining , pp. 24-35
    • He, J.1    Tan, A.H.2    Tan, C.L.3
  • 19
    • 0032180332 scopus 로고    scopus 로고
    • Rough computational methods for information systems
    • J. W. Guan, D. Bell, and A. Bell, "Rough computational methods for information systems," Artif. Intell., vol. 105, pp. 77-103, 1998.
    • (1998) Artif. Intell. , vol.105 , pp. 77-103
    • Guan, J.W.1    Bell, D.2    Bell, A.3
  • 21
    • 0029780174 scopus 로고    scopus 로고
    • Mining knowledge rules from databases: A rough set approach
    • New Orleans, LA
    • X. Hu and N. Cercone, "Mining knowledge rules from databases: a rough set approach," in Proce. 12th Int. Conf. Data Engineering (ICDE'96), New Orleans, LA, 1996, pp. 96-105.
    • (1996) Proce. 12th Int. Conf. Data Engineering (ICDE'96) , pp. 96-105
    • Hu, X.1    Cercone, N.2
  • 22
    • 0033329799 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. F. Chen and J. T. Goodman, "An empirical study of smoothing techniques for language modeling," Comput. Speech Lang., vol. 13, pp. 359-394, 1999.
    • (1999) Comput. Speech Lang. , vol.13 , pp. 359-394
    • Chen, S.F.1    Goodman, J.T.2
  • 24
    • 0030235637 scopus 로고    scopus 로고
    • Error reduction through learning multiple descriptions
    • K. M. Ali and M. J. Pazzani, "Error reduction through learning multiple descriptions," Mach. Learn., vol. 23, no. 3, pp. 173-202, 1996.
    • (1996) Mach. Learn. , vol.23 , Issue.3 , pp. 173-202
    • Ali, K.M.1    Pazzani, M.J.2
  • 25
    • 0033106616 scopus 로고    scopus 로고
    • Interpolation of n-grams and mutual-information based trigger pair language models for mandarin speech recognition
    • Z. GuoDong and L. KimTeng, "Interpolation of n-grams and mutual-information based trigger pair language models for mandarin speech recognition," Comput. Speech Lang., vol. 13, pp. 125-141, 1998.
    • (1998) Comput. Speech Lang. , vol.13 , pp. 125-141
    • Guodong, Z.1    Kimteng, L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.