메뉴 건너뛰기




Volumn 46, Issue 2, 2005, Pages 153-170

A framework for predicting speech recognition errors

Author keywords

Automatic speech recognition; Error prediction; Lexical adaptation; Lexicon optimization; Pronunciation modeling

Indexed keywords

ALGORITHMS; ERROR ANALYSIS; FORMAL LANGUAGES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; OPTIMIZATION;

EID: 19944369427     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.03.003     Document Type: Conference Paper
Times cited : (22)

References (40)
  • 1
    • 0036293559 scopus 로고    scopus 로고
    • The graphical models toolkit: An open source software system for speech and time-series processing
    • Orlando, FL, June 2002
    • Bilmes, J., Zweig, G., 2002. The graphical models toolkit: An open source software system for speech and time-series processing. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, Orlando, FL, June 2002, pp. 3916-3919.
    • (2002) IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 3916-3919
    • Bilmes, J.1    Zweig, G.2
  • 5
    • 0029915359 scopus 로고    scopus 로고
    • Assessing transcription agreement: Methodological aspects
    • C. Cucchiarini Assessing transcription agreement: methodological aspects Clin. Linguist. Phonet. 10 2 1996 131 155
    • (1996) Clin. Linguist. Phonet. , vol.10 , Issue.2 , pp. 131-155
    • Cucchiarini, C.1
  • 6
    • 85009171168 scopus 로고    scopus 로고
    • Estimating speech recognition error rate without acoustic test data
    • Geneva, Switzerland' September 2003
    • Deng, Y., Mahajan, M., Acero, A., 2003. Estimating speech recognition error rate without acoustic test data. In: Proc. Eurospeech, Geneva, Switzerland', pp. 929-932, September 2003.
    • (2003) Proc. Eurospeech , pp. 929-932
    • Deng, Y.1    Mahajan, M.2    Acero, A.3
  • 8
    • 0037906252 scopus 로고    scopus 로고
    • Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
    • Herndon, Virginia, March 1999
    • Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: Guided automatic pronunciation modeling for broadcast news. In: DARPA Broadcast News Workshop, Herndon, Virginia, March 1999.
    • (1999) DARPA Broadcast News Workshop
    • Fosler-Lussier, E.1    Williams, G.2
  • 10
    • 2442562479 scopus 로고    scopus 로고
    • Segmental minimum Bayes-risk decoding for automatic speech recognition
    • V. Goel, S. Kumar, and W. Byrne Segmental minimum Bayes-risk decoding for automatic speech recognition IEEE Trans. Speech Audio Process. 12 2004 234 249
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 234-249
    • Goel, V.1    Kumar, S.2    Byrne, W.3
  • 11
    • 0345414074 scopus 로고    scopus 로고
    • An efficient image similarity measure based on approximations of kl-divergence between two Gaussian mixtures
    • Goldberger, J., Gordon, S., Greenspan, H., 2003. An efficient image similarity measure based on approximations of kl-divergence between two Gaussian mixtures. In: Internat. Conf. on Computer Vision, pp. 487-493.
    • (2003) Internat. Conf. on Computer Vision , pp. 487-493
    • Goldberger, J.1    Gordon, S.2    Greenspan, H.3
  • 12
    • 0002076795 scopus 로고    scopus 로고
    • Insights into spoken language gleaned from phonetic transcription of the Switchboard Corpus
    • Philadelphia, Pennsylvania, October 1996
    • Greenberg, S., Hollenbach, J., Ellis, D., 1996. Insights into spoken language gleaned from phonetic transcription of the Switchboard Corpus. In: Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96), Philadelphia, Pennsylvania, October 1996, pp. S24-S27.
    • (1996) Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96)
    • Greenberg, S.1    Hollenbach, J.2    Ellis, D.3
  • 13
    • 19944363612 scopus 로고    scopus 로고
    • An introduction to the diagnostic evaluation of the Switchboard-Corpus automatic speech recognition systems
    • College Park, Maryland, May 2000
    • Greenberg, S., Chang, S., Hollenback, J., 2000. An introduction to the diagnostic evaluation of the Switchboard-Corpus automatic speech recognition systems. In: Proc. NIST Speech Transcription Workshop, College Park, Maryland, May 2000.
    • (2000) Proc. NIST Speech Transcription Workshop
    • Greenberg, S.1    Chang, S.2    Hollenback, J.3
  • 14
    • 85031583096 scopus 로고
    • New words: Effect on recognition performance and incorporation issues
    • Madrid, Spain, 1995
    • Hetherington, L., 1995. New words: Effect on recognition performance and incorporation issues. In: Proc. Eurospeech, Madrid, Spain, 1995, pp. 1645-1648.
    • (1995) Proc. Eurospeech , pp. 1645-1648
    • Hetherington, L.1
  • 16
    • 0033335617 scopus 로고    scopus 로고
    • Maximum likelihood modelling of pronunciation variation
    • T. Holter, and T. Svendsen Maximum likelihood modelling of pronunciation variation Speech Comm. 29 1999 177 191
    • (1999) Speech Comm. , vol.29 , pp. 177-191
    • Holter, T.1    Svendsen, T.2
  • 17
    • 0004149277 scopus 로고
    • Preliminaries to speech analysis
    • Acoustics Laboratory, Massachusetts Instutite of Technology
    • Jakobson, R., Fant, G., Halle, M., 1952. Preliminaries to speech analysis, Tech. Rep. 13, Acoustics Laboratory, Massachusetts Instutite of Technology.
    • (1952) Tech. Rep. , vol.13
    • Jakobson, R.1    Fant, G.2    Halle, M.3
  • 18
    • 0033318198 scopus 로고    scopus 로고
    • Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
    • J.M. Kessens, M. Wester, and H. Strik Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation Speech Comm. 29 2-4 1999 193 207
    • (1999) Speech Comm. , vol.29 , Issue.2-4 , pp. 193-207
    • Kessens, J.M.1    Wester, M.2    Strik, H.3
  • 20
  • 21
    • 0034296009 scopus 로고    scopus 로고
    • Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
    • L. Mangu, E. Brill, and A. Stolcke Finding consensus in speech recognition: Word error minimization and other applications of confusion networks Comput. Speech Lang. 14 4 2000 373 400
    • (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 373-400
    • Mangu, L.1    Brill, E.2    Stolcke, A.3
  • 22
    • 84943154470 scopus 로고    scopus 로고
    • Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
    • Sydney, Australia, December 1998
    • McAllaster, D., Gillick, L., Scattone, F., Newman, M., 1998. Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch. In: Proc. 5th Internat. Conf. on Spoken Language Processing (ICSLP-98), Sydney, Australia, December 1998, pp. 1847-1850.
    • (1998) Proc. 5th Internat. Conf. on Spoken Language Processing (ICSLP-98) , pp. 1847-1850
    • McAllaster, D.1    Gillick, L.2    Scattone, F.3    Newman, M.4
  • 23
    • 84955022115 scopus 로고
    • An analysis of perceptual confusions among some English consonants
    • G. Miller, and P. Nicely An analysis of perceptual confusions among some English consonants J. Acoust. Soc. Amer. 27 1955 338 352
    • (1955) J. Acoust. Soc. Amer. , vol.27 , pp. 338-352
    • Miller, G.1    Nicely, P.2
  • 25
    • 19944420325 scopus 로고    scopus 로고
    • Context-dependent probabilistic hierarchical sub-lexical modelling using finite state transducers
    • Alborg, Denmark
    • Mou, X., Seneff, S., Zue, V., 2000. Context-dependent probabilistic hierarchical sub-lexical modelling using finite state transducers. In: Proc. Eurospeech, Alborg, Denmark, pp. 451-454.
    • (2000) Proc. Eurospeech , pp. 451-454
    • Mou, X.1    Seneff, S.2    Zue, V.3
  • 26
    • 19944411931 scopus 로고    scopus 로고
    • National Institute of Standards and Technology, 2001. SCLITE scoring software. Available as part of the SCTK package from: .
    • (2001) SCLITE Scoring Software
  • 30
    • 0036460984 scopus 로고    scopus 로고
    • Theory and practice of acoustic confusability
    • H. Printz, and P.A. Olsen Theory and practice of acoustic confusability Comput. Speech Lang. 16 2002 131 164
    • (2002) Comput. Speech Lang. , vol.16 , pp. 131-164
    • Printz, H.1    Olsen, P.A.2
  • 37
    • 0020588285 scopus 로고
    • Evaluationg processed speech using the diagnostic rhyme test
    • W. Voiers Evaluationg processed speech using the diagnostic rhyme test Speech Technol. 1 1983 30 39
    • (1983) Speech Technol. , Issue.1 , pp. 30-39
    • Voiers, W.1
  • 38
    • 84968911025 scopus 로고    scopus 로고
    • A comparison of data-derived and knowledge-based modeling of pronunciation variation
    • Bejing, China, October 2000
    • Wester, M., Fosler-Lussier, E., 2000. A comparison of data-derived and knowledge-based modeling of pronunciation variation. In: Proc. Internat. Conf. on Spoken Language Processing, Bejing, China, October 2000, pp. 270-273.
    • (2000) Proc. Internat. Conf. on Spoken Language Processing , pp. 270-273
    • Wester, M.1    Fosler-Lussier, E.2
  • 40
    • 85009085100 scopus 로고    scopus 로고
    • Speech technology integration and research platform: A system study
    • Rhodes, Greece
    • Zhou, Q., Lee, C.-H., Chou, W., Pargellis, A., 1997. Speech technology integration and research platform: a system study. In: Proc. Eurospeech, Rhodes, Greece, pp. 621-624.
    • (1997) Proc. Eurospeech , pp. 621-624
    • Zhou, Q.1    Lee, C.-H.2    Chou, W.3    Pargellis, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.