메뉴 건너뛰기




Volumn 12, Issue 4, 2004, Pages 391-400

Language model and speaking rate adaptation for spontaneous presentation speech recognition

Author keywords

Acoustic modeling; Language model adaptation; Pronunciation modeling; Speaking rate; Spontaneous speech recognition

Indexed keywords

ACOUSTICS; CONTEXT FREE GRAMMARS; DATABASE SYSTEMS; DECODING; FORMAL LANGUAGES; HUMAN COMPUTER INTERACTION; MATHEMATICAL MODELS; SPEECH SYNTHESIS;

EID: 3042704466     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2004.828641     Document Type: Conference Paper
Times cited : (42)

References (37)
  • 9
    • 0034848039 scopus 로고    scopus 로고
    • Duration normalization for improved recognition of spontaneous and read speech via missing feature methods
    • J. Nedel and R. Stern, "Duration normalization for improved recognition of spontaneous and read speech via missing feature methods," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, 2001, pp. 313-316.
    • (2001) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP) , vol.1 , pp. 313-316
    • Nedel, J.1    Stern, R.2
  • 11
    • 85027454087 scopus 로고    scopus 로고
    • Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
    • M. Finke and A. Waibel, "Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition," in Proc. European Conf. Speech Communication and Technology (EUROSPEECH), 1997, pp. 2379-2382.
    • (1997) Proc. European Conf. Speech Communication and Technology (EUROSPEECH) , pp. 2379-2382
    • Finke, M.1    Waibel, A.2
  • 19
    • 0033318198 scopus 로고    scopus 로고
    • Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
    • J. M. Kessens, M. Wester, and H. Strik, "Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation," Speech Commun., vol. 29, no. 2-4, pp. 193-207, 1999.
    • (1999) Speech Commun. , vol.29 , Issue.2-4 , pp. 193-207
    • Kessens, J.M.1    Wester, M.2    Strik, H.3
  • 20
    • 0030376346 scopus 로고    scopus 로고
    • Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting
    • Philadelphia, PA
    • R. R. Sarukkai and D. H. Ballard, "Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boosting," in Proc. Int. Conf. Spoken Language Processing (ICSLP), vol. 1, Philadelphia, PA, 1996, pp. 208-211.
    • (1996) Proc. Int. Conf. Spoken Language Processing (ICSLP) , vol.1 , pp. 208-211
    • Sarukkai, R.R.1    Ballard, D.H.2
  • 21
    • 0030369272 scopus 로고    scopus 로고
    • Modeling long distance dependence in language: Topic mixtures vs. dynamic cache models
    • Philadelphia, PA
    • R. Iyer and M. Ostendorf, "Modeling long distance dependence in language: Topic mixtures vs. dynamic cache models," in Proc. Int. Conf. Spoken Language Processing (ICSLP), vol. 1, Philadelphia, PA, 1996, pp. 236-239.
    • (1996) Proc. Int. Conf. Spoken Language Processing (ICSLP) , vol.1 , pp. 236-239
    • Iyer, R.1    Ostendorf, M.2
  • 23
    • 84962808249 scopus 로고    scopus 로고
    • Automatic transcription of lecture speech using topic-independent language modeling
    • K. Kato, H. Nanjo, and T. Kawahara, "Automatic transcription of lecture speech using topic-independent language modeling," in Proc. Int. Conf. Spoken Language Processing (ICSLP), vol. 1, 2000, pp. 162-165.
    • (2000) Proc. Int. Conf. Spoken Language Processing (ICSLP) , vol.1 , pp. 162-165
    • Kato, K.1    Nanjo, H.2    Kawahara, T.3
  • 25
    • 85009274873 scopus 로고    scopus 로고
    • Unsupervised language model adaptation for lecture speech transcription
    • Denver, CO
    • T. Niesler and D. Willett, "Unsupervised language model adaptation for lecture speech transcription," in Proc. Int. Conf. Spoken Language Processing (ICSLP), Denver, CO, 2002, pp. 1413-1416.
    • (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 1413-1416
    • Niesler, T.1    Willett, D.2
  • 27
    • 85009250844 scopus 로고    scopus 로고
    • Speaking rate compensation based on likelihood criterion in acoustic model training and decoding
    • K. Okuda, T. Kawahara, and S. Nakamura, "Speaking rate compensation based on likelihood criterion in acoustic model training and decoding," in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2002, pp. 2589-2592.
    • (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 2589-2592
    • Okuda, K.1    Kawahara, T.2    Nakamura, S.3
  • 32
    • 0033709101 scopus 로고    scopus 로고
    • Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
    • K. Hirose and K. Iwano, "Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP), vol. 3, 2000, pp. 1763-1766.
    • (2000) Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP) , vol.3 , pp. 1763-1766
    • Hirose, K.1    Iwano, K.2
  • 35
    • 85009148152 scopus 로고    scopus 로고
    • Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system
    • A. Kai, Y. Hirose, and S. Nakagawa, "Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system," in Proc. Int. Conf. Spoken Language Processing (ICSLP), vol. 6, 1998, pp. 2427-2430.
    • (1998) Proc. Int. Conf. Spoken Language Processing (ICSLP) , vol.6 , pp. 2427-2430
    • Kai, A.1    Hirose, Y.2    Nakagawa, S.3
  • 37
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech and Lang., vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Comput. Speech and Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.