SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 46, Issue 2, 2005, Pages 153-170

A framework for predicting speech recognition errors

(3) Fosler Lussier, Eric a Amdal, Ingunn b Kuo, Hong Kwang Jeff c

a Ohio State University (United States)

b NORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY (Norway)

c IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

Automatic speech recognition; Error prediction; Lexical adaptation; Lexicon optimization; Pronunciation modeling

Indexed keywords

ALGORITHMS; ERROR ANALYSIS; FORMAL LANGUAGES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; OPTIMIZATION;

AUTOMATIC SPEECH RECOGNITION (ASR); ERROR PREDICTION; LEXICAL ADAPTATION; LEXICON OPTIMIZATION; PRONUNCIATION MODELING;

SPEECH RECOGNITION;

EID: 19944369427 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2005.03.003 Document Type: Conference Paper

Times cited : (22)

References (40)

1
- 0036293559
- The graphical models toolkit: An open source software system for speech and time-series processing
- Orlando, FL, June 2002
- Bilmes, J., Zweig, G., 2002. The graphical models toolkit: An open source software system for speech and time-series processing. In: IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, Orlando, FL, June 2002, pp. 3916-3919.
- (2002) IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 3916-3919
- Bilmes, J.¹ Zweig, G.²

2
- 0003987751
- PhD thesis, Carnegie Melon University
- Chase, L., 1997. Error-responsive feedback mechanisms for speech recognizers. PhD thesis, Carnegie Melon University.
- (1997) Error-responsive Feedback Mechanisms for Speech Recognizers
- Chase, L.¹

3
- 85009129587
- Discriminative training on language model
- Bejing, China, October 2000
- Chen, Z., Lee, K.-F., Li, M.J., 2000. Discriminative training on language model. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2000), Bejing, China, October 2000, pp. 493-496.
- (2000) Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2000) , pp. 493-496
- Chen, Z.¹ Lee, K.-F.² Li, M.J.³

4
- 0004119259
- Harper and Row New York
- N. Chomsky, and M. Halle The sound pattern of English 1968 Harper and Row New York
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

5
- 0029915359
- Assessing transcription agreement: Methodological aspects
- C. Cucchiarini Assessing transcription agreement: methodological aspects Clin. Linguist. Phonet. 10 2 1996 131 155
- (1996) Clin. Linguist. Phonet. , vol.10 , Issue.2 , pp. 131-155
- Cucchiarini, C.¹

6
- 85009171168
- Estimating speech recognition error rate without acoustic test data
- Geneva, Switzerland' September 2003
- Deng, Y., Mahajan, M., Acero, A., 2003. Estimating speech recognition error rate without acoustic test data. In: Proc. Eurospeech, Geneva, Switzerland', pp. 929-932, September 2003.
- (2003) Proc. Eurospeech , pp. 929-932
- Deng, Y.¹ Mahajan, M.² Acero, A.³

7
- 0004119132
- PhD thesis, University of California, Berkeley
- Fosler-Lussier, J.E., 1999. Dynamic pronunciation models for automatic speech recognition. PhD thesis, University of California, Berkeley.
- (1999) Dynamic Pronunciation Models for Automatic Speech Recognition
- Fosler-Lussier, J.E.¹

8
- 0037906252
- Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
- Herndon, Virginia, March 1999
- Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: Guided automatic pronunciation modeling for broadcast news. In: DARPA Broadcast News Workshop, Herndon, Virginia, March 1999.
- (1999) DARPA Broadcast News Workshop
- Fosler-Lussier, E.¹ Williams, G.²

9
- 67349144946
- On the road to improved lexical confusability metrics
- Estes Park, Colorado, 2002
- Fosler-Lussier, E., Amdal, I., Kuo, H.-K.J., 2002. On the road to improved lexical confusability metrics. In: ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexicon Adaptation (PMLA-2002), Estes Park, Colorado, 2002.
- (2002) ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexicon Adaptation (PMLA-2002)
- Fosler-Lussier, E.¹ Amdal, I.² Kuo, H.-K.J.³

10
- 2442562479
- Segmental minimum Bayes-risk decoding for automatic speech recognition
- V. Goel, S. Kumar, and W. Byrne Segmental minimum Bayes-risk decoding for automatic speech recognition IEEE Trans. Speech Audio Process. 12 2004 234 249
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , pp. 234-249
- Goel, V.¹ Kumar, S.² Byrne, W.³

11
- 0345414074
- An efficient image similarity measure based on approximations of kl-divergence between two Gaussian mixtures
- Goldberger, J., Gordon, S., Greenspan, H., 2003. An efficient image similarity measure based on approximations of kl-divergence between two Gaussian mixtures. In: Internat. Conf. on Computer Vision, pp. 487-493.
- (2003) Internat. Conf. on Computer Vision , pp. 487-493
- Goldberger, J.¹ Gordon, S.² Greenspan, H.³

12
- 0002076795
- Insights into spoken language gleaned from phonetic transcription of the Switchboard Corpus
- Philadelphia, Pennsylvania, October 1996
- Greenberg, S., Hollenbach, J., Ellis, D., 1996. Insights into spoken language gleaned from phonetic transcription of the Switchboard Corpus. In: Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96), Philadelphia, Pennsylvania, October 1996, pp. S24-S27.
- (1996) Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96)
- Greenberg, S.¹ Hollenbach, J.² Ellis, D.³

13
- 19944363612
- An introduction to the diagnostic evaluation of the Switchboard-Corpus automatic speech recognition systems
- College Park, Maryland, May 2000
- Greenberg, S., Chang, S., Hollenback, J., 2000. An introduction to the diagnostic evaluation of the Switchboard-Corpus automatic speech recognition systems. In: Proc. NIST Speech Transcription Workshop, College Park, Maryland, May 2000.
- (2000) Proc. NIST Speech Transcription Workshop
- Greenberg, S.¹ Chang, S.² Hollenback, J.³

14
- 85031583096
- New words: Effect on recognition performance and incorporation issues
- Madrid, Spain, 1995
- Hetherington, L., 1995. New words: Effect on recognition performance and incorporation issues. In: Proc. Eurospeech, Madrid, Spain, 1995, pp. 1645-1648.
- (1995) Proc. Eurospeech , pp. 1645-1648
- Hetherington, L.¹

15
- 0002899718
- Prosodic cues to recognition errors
- Keystone, CO
- Hirschberg, J., Litman, D., Swerts, M., 1999. Prosodic cues to recognition errors. In: Proc. Automatic Speech Recognition and Understanding Workshop (ASRU'99), Keystone, CO.
- (1999) Proc. Automatic Speech Recognition and Understanding Workshop (ASRU'99)
- Hirschberg, J.¹ Litman, D.² Swerts, M.³

16
- 0033335617
- Maximum likelihood modelling of pronunciation variation
- T. Holter, and T. Svendsen Maximum likelihood modelling of pronunciation variation Speech Comm. 29 1999 177 191
- (1999) Speech Comm. , vol.29 , pp. 177-191
- Holter, T.¹ Svendsen, T.²

17
- 0004149277
- Preliminaries to speech analysis
- Acoustics Laboratory, Massachusetts Instutite of Technology
- Jakobson, R., Fant, G., Halle, M., 1952. Preliminaries to speech analysis, Tech. Rep. 13, Acoustics Laboratory, Massachusetts Instutite of Technology.
- (1952) Tech. Rep. , vol.13
- Jakobson, R.¹ Fant, G.² Halle, M.³

18
- 0033318198
- Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
- J.M. Kessens, M. Wester, and H. Strik Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation Speech Comm. 29 2-4 1999 193 207
- (1999) Speech Comm. , vol.29 , Issue.2-4 , pp. 193-207
- Kessens, J.M.¹ Wester, M.² Strik, H.³

19
- 0036293856
- Discriminative training of language models for speech recognition
- Orlando, Florida, May 2002
- Kuo, H.-K., Fosler-Lussier, E., Jiang, H., Lee, C.-H., 2002. Discriminative training of language models for speech recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, Orlando, Florida, May 2002, pp. 325-328.
- (2002) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 325-328
- Kuo, H.-K.¹ Fosler-Lussier, E.² Jiang, H.³ Lee, C.-H.⁴

20
- 0033705981
- Lexical modeling of non-native speech for automatic speech recognition
- Istanbul, Turkey, 2000
- Livescu, K., Glass, J., 2000. Lexical modeling of non-native speech for automatic speech recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 2000, pp. 1683-1686.
- (2000) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 1683-1686
- Livescu, K.¹ Glass, J.²

21
- 0034296009
- Finding consensus in speech recognition: Word error minimization and other applications of confusion networks
- L. Mangu, E. Brill, and A. Stolcke Finding consensus in speech recognition: Word error minimization and other applications of confusion networks Comput. Speech Lang. 14 4 2000 373 400
- (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 373-400
- Mangu, L.¹ Brill, E.² Stolcke, A.³

22
- 84943154470
- Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
- Sydney, Australia, December 1998
- McAllaster, D., Gillick, L., Scattone, F., Newman, M., 1998. Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch. In: Proc. 5th Internat. Conf. on Spoken Language Processing (ICSLP-98), Sydney, Australia, December 1998, pp. 1847-1850.
- (1998) Proc. 5th Internat. Conf. on Spoken Language Processing (ICSLP-98) , pp. 1847-1850
- McAllaster, D.¹ Gillick, L.² Scattone, F.³ Newman, M.⁴

23
- 84955022115
- An analysis of perceptual confusions among some English consonants
- G. Miller, and P. Nicely An analysis of perceptual confusions among some English consonants J. Acoust. Soc. Amer. 27 1955 338 352
- (1955) J. Acoust. Soc. Amer. , vol.27 , pp. 338-352
- Miller, G.¹ Nicely, P.²

24
- 84892168937
- Full expansion of context-dependent networks in large vocabulary speech recognition
- Seattle, Washington
- Mohri, M., Riley, M., Hindle, D., Ljolje, A., Pereira, F., 1998. Full expansion of context-dependent networks in large vocabulary speech recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, Seattle, Washington, pp. 665-668.
- (1998) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 665-668
- Mohri, M.¹ Riley, M.² Hindle, D.³ Ljolje, A.⁴ Pereira, F.⁵

25
- 19944420325
- Context-dependent probabilistic hierarchical sub-lexical modelling using finite state transducers
- Alborg, Denmark
- Mou, X., Seneff, S., Zue, V., 2000. Context-dependent probabilistic hierarchical sub-lexical modelling using finite state transducers. In: Proc. Eurospeech, Alborg, Denmark, pp. 451-454.
- (2000) Proc. Eurospeech , pp. 451-454
- Mou, X.¹ Seneff, S.² Zue, V.³

26
- 19944411931
- National Institute of Standards and Technology, 2001. SCLITE scoring software. Available as part of the SCTK package from: .
- (2001) SCLITE Scoring Software

27
- 19944372491
- Linguistic Data Consortium Catalog Number LDC95S27
- Pitrelli, J., Fong, C., 1995. PHONEBOOK: NYNEX isolated words. Linguistic Data Consortium Catalog Number LDC95S27.
- (1995) PHONEBOOK: NYNEX Isolated Words
- Pitrelli, J.¹ Fong, C.²

28
- 34547245961
- Dialogue management in the Bell Labs Communicator system
- Bejing, China, October 2000
- Potamianos, A., Ammicht, E., Kuo, H.-K.J., 2000. Dialogue management in the Bell Labs Communicator system. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2000), Bejing, China, October 2000, pp. 603-606.
- (2000) Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2000) , pp. 603-606
- Potamianos, A.¹ Ammicht, E.² Kuo, H.-K.J.³

29
- 0004161838
- second ed. Cambridge University Press
- W.H. Press, S. Teukolsky, W.T. Vetterling, and B.P. Flannery Numerical recipes in C: The art of scientific computing second ed. 1999 Cambridge University Press
- (1999) Numerical Recipes in C: The Art of Scientific Computing
- Press, W.H.¹ Teukolsky, S.² Vetterling, W.T.³ Flannery, B.P.⁴

30
- 0036460984
- Theory and practice of acoustic confusability
- H. Printz, and P.A. Olsen Theory and practice of acoustic confusability Comput. Speech Lang. 16 2002 131 164
- (2002) Comput. Speech Lang. , vol.16 , pp. 131-164
- Printz, H.¹ Olsen, P.A.²

31
- 0026405248
- A statistical model for generating pronunciation networks
- Riley, M., 1991. A statistical model for generating pronunciation networks. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. 737-740.
- (1991) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 737-740
- Riley, M.¹

32
- 0002802333
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- Kerkrade, Netherlands, April 1998
- Riley, M., Byrne, W., Finke, M., Khudanpur, S., Ljolje, A., McDonough, J., Nock, H., Saraclar, M., Wooters, C., Zavaliagkos, G., 1998. Stochastic pronunciation modelling from hand-labelled phonetic corpora. In: ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, Netherlands, April 1998, pp. 109-116.
- (1998) ESCA Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition , pp. 109-116
- Riley, M.¹ Byrne, W.² Finke, M.³ Khudanpur, S.⁴ Ljolje, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraclar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

33
- 0030715426
- Confidence measures for spontaneous speech recognition
- Munich, Germany
- Schaaf, T., Kemp, T., 1997. Confidence measures for spontaneous speech recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, Munich, Germany, pp. 875-878.
- (1997) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing , pp. 875-878
- Schaaf, T.¹ Kemp, T.²

34
- 19944410257
- Discriminative optimization of the lexical model
- Estes Park, Colorado
- Schramm, H., Beyerlein, P., 2002. Discriminative optimization of the lexical model. In: ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexicon Adaptation (PMLA-2002), Estes Park, Colorado.
- (2002) ISCA Tutorial and Research Workshop on Pronunciation Modeling and Lexicon Adaptation (PMLA-2002)
- Schramm, H.¹ Beyerlein, P.²

35
- 0030363039
- Dictionary learning for spontaneous speech recognition
- Sloboda, T., Waibel, A., 1996. Dictionary learning for spontaneous speech recognition. In: Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96), pp. 2328-2331.
- (1996) Proc. 4th Internat. Conf. on Spoken Language Processing (ICSLP-96) , pp. 2328-2331
- Sloboda, T.¹ Waibel, A.²

36
- 0004161686
- Kluwer Dordrecht
- R. Sproat Multilingual Text-to-Speech synthesis: The Bell Labs approach 1998 Kluwer Dordrecht
- (1998) Multilingual Text-to-Speech Synthesis: The Bell Labs Approach
- Sproat, R.¹

37
- 0020588285
- Evaluationg processed speech using the diagnostic rhyme test
- W. Voiers Evaluationg processed speech using the diagnostic rhyme test Speech Technol. 1 1983 30 39
- (1983) Speech Technol. , Issue.1 , pp. 30-39
- Voiers, W.¹

38
- 84968911025
- A comparison of data-derived and knowledge-based modeling of pronunciation variation
- Bejing, China, October 2000
- Wester, M., Fosler-Lussier, E., 2000. A comparison of data-derived and knowledge-based modeling of pronunciation variation. In: Proc. Internat. Conf. on Spoken Language Processing, Bejing, China, October 2000, pp. 270-273.
- (2000) Proc. Internat. Conf. on Spoken Language Processing , pp. 270-273
- Wester, M.¹ Fosler-Lussier, E.²

39
- 0037795517
- PhD thesis, University of Sheffield, Sheffield, England
- Williams, D.A.G., 1999. Knowing what you don't know: roles for confidence measures in automatic speech recognition. PhD thesis, University of Sheffield, Sheffield, England.
- (1999) Knowing what you don't Know: Roles for Confidence Measures in Automatic Speech Recognition
- Williams, D.A.G.¹

40
- 85009085100
- Speech technology integration and research platform: A system study
- Rhodes, Greece
- Zhou, Q., Lee, C.-H., Chou, W., Pargellis, A., 1997. Speech technology integration and research platform: a system study. In: Proc. Eurospeech, Rhodes, Greece, pp. 621-624.
- (1997) Proc. Eurospeech , pp. 621-624
- Zhou, Q.¹ Lee, C.-H.² Chou, W.³ Pargellis, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.