메뉴 건너뛰기




Volumn 46, Issue 2, 2005, Pages 171-188

Implicit modelling of pronunciation variation in automatic speech recognition

Author keywords

Acoustic modelling; Automatic speech recognition; Conversational speech recognition; Hidden markov models; Parameter tying; Phonetic decision trees; Pronunciation dictionaries; Pronunciation modelling; Single pronunciations; State clustering

Indexed keywords

DECISION THEORY; MARKOV PROCESSES; MATHEMATICAL MODELS; OPTIMIZATION; SPEECH ANALYSIS; TREES (MATHEMATICS);

EID: 19944415893     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.03.008     Document Type: Conference Paper
Times cited : (33)

References (34)
  • 3
    • 0025629882 scopus 로고
    • Tied mixture continuous parameter modeling for speech recognition
    • J.R. Bellegarda, and D. Nahamoo Tied mixture continuous parameter modeling for speech recognition IEEE Trans. ASSP 38 12 1990 2033 2045
    • (1990) IEEE Trans. ASSP , vol.38 , Issue.12 , pp. 2033-2045
    • Bellegarda, J.R.1    Nahamoo, D.2
  • 5
    • 0343367219 scopus 로고    scopus 로고
    • Automatic rule-based generation of word pronunciation networks
    • Cremelie, N., Martens, J.-P., 1997. Automatic rule-based generation of word pronunciation networks. In: Proceedings of EUROSPEECH'97. pp. 2459-2462.
    • (1997) Proceedings of EUROSPEECH'97 , pp. 2459-2462
    • Cremelie, N.1    Martens, J.-P.2
  • 6
    • 85027454087 scopus 로고    scopus 로고
    • Speaking mode dependent pronunciation modelling in large vocabulary continuous speech recognition
    • Rhodes
    • Finke, M., Waibel, A., 1997. Speaking mode dependent pronunciation modelling in large vocabulary continuous speech recognition. In: Proceedings of EUROSPEECH' 97, Vol. 5. Rhodes, pp. 2379-2382.
    • (1997) Proceedings of EUROSPEECH' 97 , vol.5 , pp. 2379-2382
    • Finke, M.1    Waibel, A.2
  • 9
    • 0342931765 scopus 로고    scopus 로고
    • The switchboard transcription project
    • Center for Language and Speech Processing, Johns Hopkins University
    • Greenberg, S., 1996. The Switchboard transcription project. 1996 LVCSR summer workshop technical reports, Center for Language and Speech Processing, Johns Hopkins University. Available from: < http://www.icsi.berkeley.edu/ real/stp>.
    • (1996) 1996 LVCSR Summer Workshop Technical Reports
    • Greenberg, S.1
  • 12
    • 85135269907 scopus 로고    scopus 로고
    • Dynamic HMM selection for continuous speech recognition
    • September
    • Hain, T., Woodland, P.C., September 1999. Dynamic HMM selection for continuous speech recognition. In: Proceedings of EUROSPEECH'99, Vol. 3. pp. 1327-1330.
    • (1999) Proceedings of EUROSPEECH'99 , vol.3 , pp. 1327-1330
    • Hain, T.1    Woodland, P.C.2
  • 15
    • 0034847002 scopus 로고    scopus 로고
    • New features in the cu-htk system for transcription of conversational telephone speech
    • Hain, T., Woodland, P.C., Evermann, G., Povey, D., 2001. New features in the cu-htk system for transcription of conversational telephone speech. In: Proceedings of ICASSP'01. pp. 57-60.
    • (2001) Proceedings of ICASSP'01 , pp. 57-60
    • Hain, T.1    Woodland, P.C.2    Evermann, G.3    Povey, D.4
  • 16
    • 0000250399 scopus 로고
    • Semi-continuous hidden Markov models for speech signals
    • X.D. Huang, and M.A. Jack Semi-continuous hidden Markov models for speech signals Computer Speech and Language 3 1989 239 251
    • (1989) Computer Speech and Language , vol.3 , pp. 239-251
    • Huang, X.D.1    Jack, M.A.2
  • 19
    • 0002237531 scopus 로고    scopus 로고
    • Probabilistic classification of HMM states for large vocabulary continuous speech recognition
    • April
    • Luo, X., Jelinek, F., April 1999. Probabilistic classification of HMM states for large vocabulary continuous speech recognition. In: Proceedings of ICASSP'99. pp. 2044-2047.
    • (1999) Proceedings of ICASSP'99 , pp. 2044-2047
    • Luo, X.1    Jelinek, F.2
  • 26
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modelling by sharing Gaussian densities across phonetic models
    • M. Saraçlar, H.J. Nock, and S. Khudanpur Pronunciation modelling by sharing Gaussian densities across phonetic models Computer Speech and Language 14 2000 137 160
    • (2000) Computer Speech and Language , vol.14 , pp. 137-160
    • Saraçlar, M.1    Nock, H.J.2    Khudanpur, S.3
  • 28
    • 0033335618 scopus 로고    scopus 로고
    • Modelling pronunciation variation for ASR: A survey of the literature
    • H. Strik, and C. Cucchiarini Modelling pronunciation variation for ASR: a survey of the literature Speech Communication 29 1999 225 246
    • (1999) Speech Communication , vol.29 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 32
    • 0343367122 scopus 로고
    • Multiple-pronunciation lexical modeling in a speaker-independent speech understanding system
    • Wooters, C., Stolcke, A., 1994. Multiple-pronunciation lexical modeling in a speaker-independent speech understanding system. In: Proceedings of ICSLP'94, Vol. 3. pp. 1363-1367.
    • (1994) Proceedings of ICSLP'94 , vol.3 , pp. 1363-1367
    • Wooters, C.1    Stolcke, A.2
  • 33
    • 0028530231 scopus 로고
    • State clustering in hidden Markov model-based continuous speech recognition
    • S.J. Young, and P.C Woodland State clustering in hidden Markov model-based continuous speech recognition Computer Speech and Language 8 1994 369 383
    • (1994) Computer Speech and Language , vol.8 , pp. 369-383
    • Young, S.J.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.