SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 17, Issue 1, 2003, Pages 69-85

Pronunciation modeling for ASR - Knowledge-based and data-derived methods

(1) Wester, Mirjam a

a RADBOUD UNIVERSITY NIJMEGEN (Netherlands)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL GRAMMARS; DECISION THEORY; ERROR ANALYSIS; KNOWLEDGE BASED SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; NEURAL NETWORKS; PATTERN RECOGNITION SYSTEMS; PROBABILITY;

AUTOMATIC SPEECH RECOGNITION SYSTEM; FINITE STATE GRAMMARS; PHONE RECOGNITION;

SPEECH RECOGNITION;

EID: 0038141324 PISSN: 08852308 EISSN: None Source Type: Journal
DOI: 10.1016/S0885-2308(02)00030-X Document Type: Article

Times cited : (45)

References (35)

1
- 85009062592
- Joint pronunciation modelling of non-native speakers using data-driven methods
- Amdall, I., Korkmazskiy, F., Surendran, A., 2000. Joint pronunciation modelling of non-native speakers using data-driven methods. In: Proc. ICSLP '00, Beijing, vol. III, pp. 622-625.
- (2000) Proc. ICSLP '00, Beijing , vol.3 , pp. 622-625
- Amdall, I.¹ Korkmazskiy, F.² Surendran, A.³

2
- 0038582437
- De CELEX lexicale databank
- Baayen, H., 1991. De CELEX lexicale databank. Forum Lett. 32 (3), 221-231.
- (1991) Forum Lett. , vol.32 , Issue.3 , pp. 221-231
- Baayen, H.¹

3
- 0003573244
- Connectionist speech recognition: A hybrid approach
- Kluwer Academic Publishers, Dordrecht
- Bourlard, H., Morgan, N., 1993. Connectionist Speech Recognition: A Hybrid Approach. Kluwer Academic Publishers, Dordrecht.
- (1993)
- Bourlard, H.¹ Morgan, N.²

4
- 0003721728
- Phonological structures for speech recognition
- Ph.D. thesis, University of California, Berkeley, CA
- Cohen, M., 1989. Phonological structures for speech recognition. Ph.D. thesis, University of California, Berkeley, CA.
- (1989)
- Cohen, M.¹

5
- 0033329496
- In search of better pronunciation models for speech recognition
- Cremelie, N., Martens, J.-P., 1999. In search of better pronunciation models for speech recognition. Speech Commun. 29, 115-136.
- (1999) Speech Commun. , vol.29 , pp. 115-136
- Cremelie, N.¹ Martens, J.-P.²

6
- 0033185227
- Improving continuous speech recognition in Spanish by phone-class semicontinuous HMMs with pausing and multiple pronunciations
- Ferreiros, J., Pardo, J., 1999. Improving continuous speech recognition in Spanish by phone-class semicontinuous HMMs with pausing and multiple pronunciations. Speech Commun. 29, 65-76.
- (1999) Speech Commun. , vol.29 , pp. 65-76
- Ferreiros, J.¹ Pardo, J.²

7
- 0037568163
- Modelling pronunciation variability for special domains
- Flach, G., 1995. Modelling pronunciation variability for special domains. In: Proc. EUROSPEECH '95, Madrid, pp. 1743-1746.
- (1995) Proc. EUROSPEECH '95, Madrid , pp. 1743-1746
- Flach, G.¹

8
- 0004119132
- Dynamic pronunciation models for automatic speech recognition
- Ph.D. thesis, University of California, Berkeley, CA
- Fosler-Lussier, E., 1999. Dynamic pronunciation models for automatic speech recognition. Ph.D. thesis, University of California, Berkeley, CA.
- (1999)
- Fosler-Lussier, E.¹

9
- 0037906252
- Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
- Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: guided automatic pronunciation modeling for Broadcast News. In: DARPA Broadcast News Workshop, Herndon, VA., pp. 171-174.
- (1999) DARPA Broadcast News Workshop, Herndon, VA , pp. 171-174
- Fosler-Lussier, E.¹ Williams, G.²

10
- 0033357399
- Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation
- Greenberg, S., 1999. Speaking in shorthand - a syllable-centric perspective for understanding pronunciation variation. Speech Commun. 29, 159-176.
- (1999) Speech Commun. , vol.29 , pp. 159-176
- Greenberg, S.¹

11
- 0033335617
- Maximum likelihood modeling of pronunciation variation
- Holter, T., Svendsen, T., 1999. Maximum likelihood modeling of pronunciation variation. Speech Commun. 29, 177-191.
- (1999) Speech Commun. , vol.29 , pp. 177-191
- Holter, T.¹ Svendsen, T.²

12
- 0037795515
- Prosody in NIROS with FONPARS and ALFEIOS
- In: de Haan, P., Oostdijk, N. (Eds.)
- Kerkhoff, J., Rietveld, T., 1994. Prosody in NIROS with FONPARS and ALFEIOS. In: de Haan, P., Oostdijk, N. (Eds.), Proc. Dept. Lang. Speech, Univ. Nijmegen, vol. 18, pp. 107-119.
- (1994) Proc. Dept. Lang. Speech, Univ. Nijmegen , vol.18 , pp. 107-119
- Kerkhoff, J.¹ Rietveld, T.²

13
- 0038243730
- A data-driven method for modeling pronunciation variation
- (submitted)
- Kessens, J., Cucchiarini, C., Strik, H., 2001. A data-driven method for modeling pronunciation variation. Speech Commun. (submitted).
- (2001) Speech Commun.
- Kessens, J.¹ Cucchiarini, C.² Strik, H.³

14
- 0033318198
- Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
- Kessens, J., Wester, M., Strik, H., 1999. Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation. Speech Commun. 29, 193-207.
- (1999) Speech Commun. , vol.29 , pp. 193-207
- Kessens, J.¹ Wester, M.² Strik, H.³

15
- 0030351374
- On designing pronunciation lexicons for large vocabulary, continuous speech recognition
- Lamel, L., Adda, G., 1996. On designing pronunciation lexicons for large vocabulary, continuous speech recognition. In: Proc. ICSLP '96, Philadelphia, PA, pp. 6-9.
- (1996) Proc. ICSLP '96, Philadelphia, PA , pp. 6-9
- Lamel, L.¹ Adda, G.²

16
- 23144431835
- Pronunciation modeling for large vocabulary conversational speech recognition
- Ma, K., Zavaliagkos, G., Iyer, R., 1998. Pronunciation modeling for large vocabulary conversational speech recognition. In: Proc. ICSLP '98, Sydney, pp. 2455-2458.
- (1998) Proc. ICSLP '98, Sydney , pp. 2455-2458
- Ma, K.¹ Zavaliagkos, G.² Iyer, R.³

17
- 84943154470
- Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
- McAllaster, D., Gillick, L., Scattone, F., Newman, M. 1998. Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch. In: Proc. ICSLP '98, Sydney, pp. 1847-1850.
- (1998) Proc. ICSLP '98, Sydney , pp. 1847-1850
- McAllaster, D.¹ Gillick, L.² Scattone, F.³ Newman, M.⁴

18
- 0038243729
- The use of lexicons in text-to-speech-systems
- In: van Eynde, F., Gibbon, D. (Eds.); Kluwer Academic Publishers, Dordrecht; (Chapter 7)
- Quazza, S., van den Heuvel, H., 2000. The use of lexicons in text-to-speech-systems. In: van Eynde, F., Gibbon, D. (Eds.), Lexicon Development for Speech and Language Processing. Kluwer Academic Publishers, Dordrecht, pp. 207-233 (Chapter 7).
- (2000) Lexicon Development for Speech and Language Processing , pp. 207-233
- Quazza, S.¹ Van Den Heuvel, H.²

19
- 0033353288
- Stochastic pronunciation modelling from hand-labelled phonetic corpora
- Riley, M., Byrne, W., Finke, M., Khudanpur, S., Ljolje, A., McDonough, J., Nock, H., Saraçlar, M., Wooters, C., Zavaliagkos, G., 1999. Stochastic pronunciation modelling from hand-labelled phonetic corpora. Speech Commun. 29, 209-224.
- (1999) Speech Commun. , vol.29 , pp. 209-224
- Riley, M.¹ Byrne, W.² Finke, M.³ Khudanpur, S.⁴ Ljolje, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraçlar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

20
- 0003921935
- Automatic generation of detailed pronunciation lexicons
- In: Lee, C.-H., Soong, F., Paliwal, K. (Eds.); Kluwer Academic Publishers, Dordrecht; Chapter 12
- Riley, M., Ljolje, A., 1996. Automatic generation of detailed pronunciation lexicons. In: Lee, C.-H., Soong, F., Paliwal, K. (Eds.), Automatic Speech and Speaker Recognition: Advanced Topics. Kluwer Academic Publishers, Dordrecht, pp. 285-302, Chapter 12.
- (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 285-302
- Riley, M.¹ Ljolje, A.²

21
- 0036567797
- Connectionist speech recognition of broadcast news
- Robinson, A., Cook, G., Ellis, D., Fosler-Lussier, E., Renals, S., Williams, D., 2001. Connectionist speech recognition of Broadcast News. Speech Commun. 37, 27-45.
- (2001) Speech Commun. , vol.37 , pp. 27-45
- Robinson, A.¹ Cook, G.² Ellis, D.³ Fosler-Lussier, E.⁴ Renals, S.⁵ Williams, D.⁶

22
- 0037906244
- Modeling pronunciation variations and coarticulation with finite-state transducers in CSR
- Safra, S., Lehtinen, G., Huber, K., 1998. Modeling pronunciation variations and coarticulation with finite-state transducers in CSR. In: Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade, pp. 125-130.
- (1998) Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade , pp. 125-130
- Safra, S.¹ Lehtinen, G.² Huber, K.³

23
- 0000114416
- Pronunciation modeling by sharing Gaussian densities across phonetic models
- Saraçlar, M., Nock, H., Khudanpur, S., 2000. Pronunciation modeling by sharing Gaussian densities across phonetic models. Comput. Speech Lang. 14, 137-160.
- (2000) Comput. Speech Lang. , vol.14 , pp. 137-160
- Saraçlar, M.¹ Nock, H.² Khudanpur, S.³

24
- 0030363039
- Dictionary learning for spontaneous speech recognition
- Sloboda, T., Waibel, A., 1996. Dictionary learning for spontaneous speech recognition. In: Proc. ICSLP '96, Philadelphia, PA, pp. 2328-2331.
- (1996) Proc. ICSLP '96, Philadelphia, PA , pp. 2328-2331
- Sloboda, T.¹ Waibel, A.²

25
- 85135378383
- The philips research system for large-vocabulary continuous-speech recognition
- Steinbiss, V., Ney, H., Haeb-Umbach, R., Tran, B.-H., Essen, U., Kneser, R., Oerder, M., Meier, H.-G., Aubert, X., Dugast, C., Geller, D., 1993. The Philips research system for large-vocabulary continuous-speech recognition. In: Proc. EUROSPEECH '93, Berlin, pp. 2125-2128.
- (1993) Proc. EUROSPEECH '93, Berlin , pp. 2125-2128
- Steinbiss, V.¹ Ney, H.² Haeb-Umbach, R.³ Tran, B.-H.⁴ Essen, U.⁵ Kneser, R.⁶ Oerder, M.⁷ Meier, H.-G.⁸ Aubert, X.⁹ Dugast, C.¹⁰ Geller, D.¹¹

26
- 0033335618
- Modeling pronunciation variation for ASR: A survey of the literature
- Strik, H., Cucchiarini, C., 1999. Modeling pronunciation variation for ASR: a survey of the literature. Speech Commun. 29, 225-246.
- (1999) Speech Commun. , vol.29 , pp. 225-246
- Strik, H.¹ Cucchiarini, C.²

27
- 0031364891
- A spoken dialogue system for the Dutch public transport information service
- Strik, H., Russel, A., van den Heuvel, H., Cucchiarini, C., Boves, L., 1997. A spoken dialogue system for the Dutch public transport information service. Int. J. Speech Technol. 2 (2), 119-129.
- (1997) Int. J. Speech Technol. , vol.2 , Issue.2 , pp. 119-129
- Strik, H.¹ Russel, A.² Van Den Heuvel, H.³ Cucchiarini, C.⁴ Boves, L.⁵

28
- 0030672090
- Automatic alternative transcription generation and vocabulary selection for flexible word recognizers
- Torre, D., Villarrubia, L., Hernández, L., Elvira, J., 1966. Automatic alternative transcription generation and vocabulary selection for flexible word recognizers. In: Proc. ICASSP '96, Munich, pp. 1463-1466.
- (1996) Proc. ICASSP '96, Munich , pp. 1463-1466
- Torre, D.¹ Villarrubia, L.² Hernández, L.³ Elvira, J.⁴

29
- 0033096914
- Acoustic characteristics of lexical stress in continuous telephone speech
- van Kuijk, D., Boves, L., 1999. Acoustic characteristics of lexical stress in continuous telephone speech. Speech Commun. 27, 95-111.
- (1999) Speech Commun. , vol.27 , pp. 95-111
- Van Kuijk, D.¹ Boves, L.²

30
- 84968911025
- A comparison of data-derived and knowledge-based modeling of pronunciation variation
- Wester, M., Fosler-Lussier, E., 2000. A comparison of data-derived and knowledge-based modeling of pronunciation variation. In: Proc. ICSLP '00, Beijing, vol. I, pp. 270-273.
- (2000) Proc. ICSLP '00, Beijing , vol.1 , pp. 270-273
- Wester, M.¹ Fosler-Lussier, E.²

31
- 85009084203
- Pronunciation variation in ASR: Which variation to model?
- Wester, M., Kessens, J., Strik, H., 2000. Pronunciation variation in ASR: Which variation to model? In: Proc. ICSLP '00, Beijing, vol. IV, pp. 488-491.
- (2000) Proc. ICSLP '00, Beijing , vol.4 , pp. 488-491
- Wester, M.¹ Kessens, J.² Strik, H.³

32
- 0006274455
- Confidence measures for evaluating pronunciation models
- Williams, G., Renals, S., 1998. Confidence measures for evaluating pronunciation models. In: Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade, pp. 151-155.
- (1998) Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade , pp. 151-155
- Williams, G.¹ Renals, S.²

33
- 0347960618
- Dynamic and static improvements to lexical baseforms
- Wiseman, R., Downey, S., 1998. Dynamic and static improvements to lexical baseforms. In: Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade, pp. 157-162.
- (1998) Proc. ESCA Workshop Model. Pronunciation Variation Autom. Speech Recognit., Kerkrade , pp. 157-162
- Wiseman, R.¹ Downey, S.²

34
- 0003957032
- Data mining, practical machine learning tools and techniques with java implementations
- Morgan Kaufmann Publishers, Los Altos, CA
- Witten, I., Frank, E., 2000. Data Mining, Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, Los Altos, CA.
- (2000)
- Witten, I.¹ Frank, E.²

35
- 0038810063
- On the importance of exception and cross-word rules for the data-driven creation of lexica for ASR
- Yang, Q., Martens, J.-P., 2000. On the importance of exception and cross-word rules for the data-driven creation of lexica for ASR. In: Proc. 11 ProRisc Workshop, Veldhoven, The Netherlands, pp. 589-593.
- (2000) Proc. 11 ProRisc Workshop, Veldhoven, The Netherlands , pp. 589-593
- Yang, Q.¹ Martens, J.-P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.