메뉴 건너뛰기




Volumn 17, Issue 1, 2009, Pages 13-23

An iterative relative entropy minimization-based data selection approach for n-gram model adaptation

Author keywords

Data selection; Language model adaptation; Relative entropy

Indexed keywords

COMMUNITY IS; DATA SELECTION; DIALOG SYSTEMS; LANGUAGE MODEL; LANGUAGE MODEL ADAPTATION; LANGUAGE MODELING; LARGE VOCABULARY; MEDICAL DOMAINS; N-GRAM LANGUAGE MODELS; N-GRAM MODELS; PERFORMANCE IMPROVEMENTS; RELATIVE ENTROPY; RELATIVE-ENTROPY MINIMIZATION; SPECIFIC NATURE; SPEECH RECOGNITION SYSTEMS; STATE OF THE ART; SUBSET SELECTION; TEXT MATERIALS; WORD ERROR RATE;

EID: 70350780149     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2006654     Document Type: Article
Times cited : (23)

References (40)
  • 1
    • 85010217018 scopus 로고    scopus 로고
    • Web-based models for natural language processing
    • M. Lapata and F. Keller, "Web-based models for natural language processing," ACM Trans. Speech Lang. Process., vol.2, pp. 1-31, 2005.
    • (2005) ACM Trans. Speech Lang. Process , vol.2 , pp. 1-31
    • Lapata, M.1    Keller, F.2
  • 2
    • 85146431257 scopus 로고    scopus 로고
    • Using the web to overcome data sparseness
    • F. Keller, M. Lapata, and O. Ourioupina, "Using the web to overcome data sparseness," in Proc. EMNLP, 2002, pp. 230-237.
    • (2002) Proc. EMNLP , pp. 230-237
    • Keller, F.1    Lapata, M.2    Ourioupina, O.3
  • 3
    • 0345376175 scopus 로고    scopus 로고
    • The web as a parallel corpus
    • P. Resnik and N. A. Smith, "The web as a parallel corpus," Comput. Linguist., vol.29, pp. 349-380, 2003.
    • (2003) Comput. Linguist. , vol.29 , pp. 349-380
    • Resnik, P.1    Smith, N.A.2
  • 5
    • 29144436747 scopus 로고    scopus 로고
    • Web-data augmented language model for Mandarin speech recognition
    • T. Ng, M. Ostendorf, M.-Y. Hwang, M. Siu, I. Bulyko, and X. Lei, "Web-data augmented language model for Mandarin speech recognition," in Proc. ICASSP, 2005, pp. 589-592.
    • (2005) Proc. ICASSP , pp. 589-592
    • Ng, T.1    Ostendorf, M.2    Hwang, M.-Y.3    Siu, M.4    Bulyko, I.5    Lei, X.6
  • 6
    • 33646762871 scopus 로고    scopus 로고
    • Rapid language model development using external resources for new spoken dialog domains
    • R. Sarikaya, A. Gravano, and Y. Gao, "Rapid language model development using external resources for new spoken dialog domains," in Proc. ICASSP, 2005, pp. 573-576.
    • (2005) Proc. ICASSP , pp. 573-576
    • Sarikaya, R.1    Gravano, A.2    Gao, Y.3
  • 7
    • 0033886806 scopus 로고    scopus 로고
    • Text classification from labeled and unlabeled documents using EM
    • K. Nigam, A. K. McCallum, S. Thrun, and T. Mitchell, "Text classification from labeled and unlabeled documents using EM," J. Mach. Learn., vol.39, pp. 103-134, 2000.
    • (2000) J. Mach. Learn. , vol.39 , pp. 103-134
    • Nigam, K.1    McCallum, A.K.2    Thrun, S.3    Mitchell, T.4
  • 8
    • 33745456231 scopus 로고    scopus 로고
    • Semi-supervised learning literature survey
    • Dec., [Online]. Available
    • X. Zhu, "Semi-supervised learning literature survey," Univ. of Wisconsin- Madison, Comput. Sci., Tech. Rep. 1530, Dec. 2005 [Online]. Available: http://www.cs.wisc.edu/jerryzhu/pub/ssl-survey.pdf.
    • (2005) Univ. of Wisconsin- Madison, Comput. Sci., Tech. Rep. , vol.1530
    • Zhu, X.1
  • 9
    • 80053375672 scopus 로고    scopus 로고
    • Text data acquisition for domain-specific language models
    • A. Sethy, P. G. Georgiou, and S. Narayanan, "Text data acquisition for domain-specific language models," in Proc. EMNLP, 2006, pp. 382-389.
    • (2006) Proc. EMNLP , pp. 382-389
    • Sethy, A.1    Georgiou, P.G.2    Narayanan, S.3
  • 12
    • 34547534437 scopus 로고    scopus 로고
    • A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts
    • T. Misu and T. Kawahara, "A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts," in Proc. ICSLP, 2006, pp. 9-12.
    • (2006) Proc. ICSLP , pp. 9-12
    • Misu, T.1    Kawahara, T.2
  • 13
    • 33745184882 scopus 로고    scopus 로고
    • Building topic specific language models from web-data using competitive models
    • A. Sethy, P. Georgiou, and S. Narayanan, "Building topic specific language models from web-data using competitive models," in Proc. Eurospeech, 2005, pp. 1293-1296.
    • (2005) Proc. Eurospeech , pp. 1293-1296
    • Sethy, A.1    Georgiou, P.2    Narayanan, S.3
  • 14
    • 34547527296 scopus 로고    scopus 로고
    • Bootstrapping language models for dialogue systems
    • K. Weilhammer, M. N. Stuttlem, and S. Young, "Bootstrapping language models for dialogue systems," in Proc. ICSLP, 2006, pp. 1482-1485.
    • (2006) Proc. ICSLP , pp. 1482-1485
    • Weilhammer, K.1    Stuttlem, M.N.2    Young, S.3
  • 15
    • 78149306870 scopus 로고    scopus 로고
    • Building text classifiers using positive and unlabeled examples
    • B. Liu, Y. Dai, X. Li, W. S. Lee, and P. Yu, "Building text classifiers using positive and unlabeled examples," in Proc. ICDM, 2003, pp. 179-189.
    • (2003) Proc. ICDM , pp. 179-189
    • Liu, B.1    Dai, Y.2    Li, X.3    Lee, W.S.4    Yu, P.5
  • 17
    • 85149114506 scopus 로고    scopus 로고
    • Measures of distributional similarity
    • L. Lee, "Measures of distributional similarity," in Proc. ACL, 1999, pp. 25-32.
    • (1999) Proc. ACL , pp. 25-32
    • Lee, L.1
  • 18
    • 85024115120 scopus 로고    scopus 로고
    • An empirical study of smoothing techniques for language modeling
    • S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," in Proc. ACL, 1996, pp. 310-318.
    • (1996) Proc. ACL , pp. 310-318
    • Chen, S.F.1    Goodman, J.2
  • 23
    • 84959118000 scopus 로고    scopus 로고
    • The Fisher Corpus: A resource for the next generations of speech-to-text
    • C. Cieri, D. Miller, and K. Walker, "The Fisher Corpus: A resource for the next generations of speech-to-text," in Proc. LREC, 2004, pp. 69-71.
    • (2004) Proc. LREC , pp. 69-71
    • Cieri, C.1    Miller, D.2    Walker, K.3
  • 27
    • 78649250427 scopus 로고    scopus 로고
    • Measuring convergence in language model estimation using relative entropy
    • A. Sethy, B. Ramabhadran, and S. Narayanan, "Measuring convergence in language model estimation using relative entropy," in Proc. ICSLP, 2004, pp. 1057-1060.
    • (2004) Proc. ICSLP , pp. 1057-1060
    • Sethy, A.1    Ramabhadran, B.2    Narayanan, S.3
  • 28
    • 44949090835 scopus 로고    scopus 로고
    • Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
    • I. Bulyko, M. Ostendorf, and A. Stolcke, "Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures," in Proc. HLT, 2003, pp. 7-9.
    • (2003) Proc. HLT , pp. 7-9
    • Bulyko, I.1    Ostendorf, M.2    Stolcke, A.3
  • 29
    • 70350746981 scopus 로고    scopus 로고
    • Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation
    • July
    • O. Cetin and A. Stolcke, "Language Modeling in the ICSI-SRI Spring 2005 Meeting Speech Recognition Evaluation," ICSI, Tech. Rep. TR-05-006, July 2005.
    • (2005) ICSI, Tech. Rep. TR-05-006
    • Cetin, O.1    Stolcke, A.2
  • 30
    • 0030635419 scopus 로고    scopus 로고
    • Analyzing and predicting language model improvements
    • R. Iyer, M. Ostendorf, and M. Meteer, "Analyzing and predicting language model improvements," in Proc. ASRU, 1997, pp. 254-261.
    • (1997) Proc. ASRU , pp. 254-261
    • Iyer, R.1    Ostendorf, M.2    Meteer, M.3
  • 33
    • 85135271674 scopus 로고    scopus 로고
    • Finding consensus among words: Lattice-based word error minimization
    • L. Mangu, E. Brill, and A. Stolcke, "Finding consensus among words: Lattice-based word error minimization," in Proc. Eurospeech, 1999, pp. 495-498.
    • (1999) Proc. Eurospeech , pp. 495-498
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 34
    • 80053267981 scopus 로고    scopus 로고
    • Mining key phrase translations from web corpora
    • F. Huang, Y. Zhang, and S. Vogel, "Mining key phrase translations from web corpora," in Proc. EMNLP, 2005, pp. 483-490.
    • (2005) Proc. EMNLP , pp. 483-490
    • Huang, F.1    Zhang, Y.2    Vogel, S.3
  • 35
    • 0033894473 scopus 로고    scopus 로고
    • Large vocabulary speech recognition with multispan statistical language models
    • Jan.
    • J. Bellegarda, "Large vocabulary speech recognition with multispan statistical language models," IEEE Trans. Speech Audio Process., vol.8, no.1, pp. 76-84, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.1 , pp. 76-84
    • Bellegarda, J.1
  • 36
    • 80053341989 scopus 로고    scopus 로고
    • Style and topic language model adaptation using HMM-LDA
    • B. J. Hsu and J. Glass, "Style and topic language model adaptation using HMM-LDA," in Proc. EMNLP, 2006, pp. 373-381.
    • (2006) Proc. EMNLP , pp. 373-381
    • Hsu, B.J.1    Glass, J.2
  • 39
    • 0348198473 scopus 로고    scopus 로고
    • Finite-state transducers in language and speech processing
    • M. Mohri, "Finite-state transducers in language and speech processing," Comput. Linguist., vol.23, pp. 269-311, 1997.
    • (1997) Comput. Linguist. , vol.23 , pp. 269-311
    • Mohri, M.1
  • 40
    • 0031321299 scopus 로고    scopus 로고
    • Accurate computation of the relative entropy between stochastic regular grammars
    • R. C. Carrasco, "Accurate computation of the relative entropy between stochastic regular grammars," RAIRO (Theoretical Informatics and Applications), vol.31, pp. 437-444, 1997
    • (1997) RAIRO (Theoretical Informatics and Applications) , vol.31 , pp. 437-444
    • Carrasco, R.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.