메뉴 건너뛰기




Volumn 17, Issue 1, 2003, Pages 69-85

Pronunciation modeling for ASR - Knowledge-based and data-derived methods

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL GRAMMARS; DECISION THEORY; ERROR ANALYSIS; KNOWLEDGE BASED SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; NEURAL NETWORKS; PATTERN RECOGNITION SYSTEMS; PROBABILITY;

EID: 0038141324     PISSN: 08852308     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0885-2308(02)00030-X     Document Type: Article
Times cited : (45)

References (35)
  • 1
    • 85009062592 scopus 로고    scopus 로고
    • Joint pronunciation modelling of non-native speakers using data-driven methods
    • Amdall, I., Korkmazskiy, F., Surendran, A., 2000. Joint pronunciation modelling of non-native speakers using data-driven methods. In: Proc. ICSLP '00, Beijing, vol. III, pp. 622-625.
    • (2000) Proc. ICSLP '00, Beijing , vol.3 , pp. 622-625
    • Amdall, I.1    Korkmazskiy, F.2    Surendran, A.3
  • 2
    • 0038582437 scopus 로고
    • De CELEX lexicale databank
    • Baayen, H., 1991. De CELEX lexicale databank. Forum Lett. 32 (3), 221-231.
    • (1991) Forum Lett. , vol.32 , Issue.3 , pp. 221-231
    • Baayen, H.1
  • 3
    • 0003573244 scopus 로고
    • Connectionist speech recognition: A hybrid approach
    • Kluwer Academic Publishers, Dordrecht
    • Bourlard, H., Morgan, N., 1993. Connectionist Speech Recognition: A Hybrid Approach. Kluwer Academic Publishers, Dordrecht.
    • (1993)
    • Bourlard, H.1    Morgan, N.2
  • 4
    • 0003721728 scopus 로고
    • Phonological structures for speech recognition
    • Ph.D. thesis, University of California, Berkeley, CA
    • Cohen, M., 1989. Phonological structures for speech recognition. Ph.D. thesis, University of California, Berkeley, CA.
    • (1989)
    • Cohen, M.1
  • 5
    • 0033329496 scopus 로고    scopus 로고
    • In search of better pronunciation models for speech recognition
    • Cremelie, N., Martens, J.-P., 1999. In search of better pronunciation models for speech recognition. Speech Commun. 29, 115-136.
    • (1999) Speech Commun. , vol.29 , pp. 115-136
    • Cremelie, N.1    Martens, J.-P.2
  • 6
    • 0033185227 scopus 로고    scopus 로고
    • Improving continuous speech recognition in Spanish by phone-class semicontinuous HMMs with pausing and multiple pronunciations
    • Ferreiros, J., Pardo, J., 1999. Improving continuous speech recognition in Spanish by phone-class semicontinuous HMMs with pausing and multiple pronunciations. Speech Commun. 29, 65-76.
    • (1999) Speech Commun. , vol.29 , pp. 65-76
    • Ferreiros, J.1    Pardo, J.2
  • 7
    • 0037568163 scopus 로고
    • Modelling pronunciation variability for special domains
    • Flach, G., 1995. Modelling pronunciation variability for special domains. In: Proc. EUROSPEECH '95, Madrid, pp. 1743-1746.
    • (1995) Proc. EUROSPEECH '95, Madrid , pp. 1743-1746
    • Flach, G.1
  • 8
    • 0004119132 scopus 로고    scopus 로고
    • Dynamic pronunciation models for automatic speech recognition
    • Ph.D. thesis, University of California, Berkeley, CA
    • Fosler-Lussier, E., 1999. Dynamic pronunciation models for automatic speech recognition. Ph.D. thesis, University of California, Berkeley, CA.
    • (1999)
    • Fosler-Lussier, E.1
  • 9
    • 0037906252 scopus 로고    scopus 로고
    • Not just what, but also when: Guided automatic pronunciation modeling for broadcast news
    • Fosler-Lussier, E., Williams, G., 1999. Not just what, but also when: guided automatic pronunciation modeling for Broadcast News. In: DARPA Broadcast News Workshop, Herndon, VA., pp. 171-174.
    • (1999) DARPA Broadcast News Workshop, Herndon, VA , pp. 171-174
    • Fosler-Lussier, E.1    Williams, G.2
  • 10
    • 0033357399 scopus 로고    scopus 로고
    • Speaking in shorthand - A syllable-centric perspective for understanding pronunciation variation
    • Greenberg, S., 1999. Speaking in shorthand - a syllable-centric perspective for understanding pronunciation variation. Speech Commun. 29, 159-176.
    • (1999) Speech Commun. , vol.29 , pp. 159-176
    • Greenberg, S.1
  • 11
    • 0033335617 scopus 로고    scopus 로고
    • Maximum likelihood modeling of pronunciation variation
    • Holter, T., Svendsen, T., 1999. Maximum likelihood modeling of pronunciation variation. Speech Commun. 29, 177-191.
    • (1999) Speech Commun. , vol.29 , pp. 177-191
    • Holter, T.1    Svendsen, T.2
  • 12
    • 0037795515 scopus 로고
    • Prosody in NIROS with FONPARS and ALFEIOS
    • In: de Haan, P., Oostdijk, N. (Eds.)
    • Kerkhoff, J., Rietveld, T., 1994. Prosody in NIROS with FONPARS and ALFEIOS. In: de Haan, P., Oostdijk, N. (Eds.), Proc. Dept. Lang. Speech, Univ. Nijmegen, vol. 18, pp. 107-119.
    • (1994) Proc. Dept. Lang. Speech, Univ. Nijmegen , vol.18 , pp. 107-119
    • Kerkhoff, J.1    Rietveld, T.2
  • 13
    • 0038243730 scopus 로고    scopus 로고
    • A data-driven method for modeling pronunciation variation
    • (submitted)
    • Kessens, J., Cucchiarini, C., Strik, H., 2001. A data-driven method for modeling pronunciation variation. Speech Commun. (submitted).
    • (2001) Speech Commun.
    • Kessens, J.1    Cucchiarini, C.2    Strik, H.3
  • 14
    • 0033318198 scopus 로고    scopus 로고
    • Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
    • Kessens, J., Wester, M., Strik, H., 1999. Improving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation. Speech Commun. 29, 193-207.
    • (1999) Speech Commun. , vol.29 , pp. 193-207
    • Kessens, J.1    Wester, M.2    Strik, H.3
  • 15
    • 0030351374 scopus 로고    scopus 로고
    • On designing pronunciation lexicons for large vocabulary, continuous speech recognition
    • Lamel, L., Adda, G., 1996. On designing pronunciation lexicons for large vocabulary, continuous speech recognition. In: Proc. ICSLP '96, Philadelphia, PA, pp. 6-9.
    • (1996) Proc. ICSLP '96, Philadelphia, PA , pp. 6-9
    • Lamel, L.1    Adda, G.2
  • 16
    • 23144431835 scopus 로고    scopus 로고
    • Pronunciation modeling for large vocabulary conversational speech recognition
    • Ma, K., Zavaliagkos, G., Iyer, R., 1998. Pronunciation modeling for large vocabulary conversational speech recognition. In: Proc. ICSLP '98, Sydney, pp. 2455-2458.
    • (1998) Proc. ICSLP '98, Sydney , pp. 2455-2458
    • Ma, K.1    Zavaliagkos, G.2    Iyer, R.3
  • 17
    • 84943154470 scopus 로고    scopus 로고
    • Fabricating conversational speech data with acoustic models: A program to examine model-data mismatch
    • McAllaster, D., Gillick, L., Scattone, F., Newman, M. 1998. Fabricating conversational speech data with acoustic models: a program to examine model-data mismatch. In: Proc. ICSLP '98, Sydney, pp. 1847-1850.
    • (1998) Proc. ICSLP '98, Sydney , pp. 1847-1850
    • McAllaster, D.1    Gillick, L.2    Scattone, F.3    Newman, M.4
  • 18
    • 0038243729 scopus 로고    scopus 로고
    • The use of lexicons in text-to-speech-systems
    • In: van Eynde, F., Gibbon, D. (Eds.); Kluwer Academic Publishers, Dordrecht; (Chapter 7)
    • Quazza, S., van den Heuvel, H., 2000. The use of lexicons in text-to-speech-systems. In: van Eynde, F., Gibbon, D. (Eds.), Lexicon Development for Speech and Language Processing. Kluwer Academic Publishers, Dordrecht, pp. 207-233 (Chapter 7).
    • (2000) Lexicon Development for Speech and Language Processing , pp. 207-233
    • Quazza, S.1    Van Den Heuvel, H.2
  • 20
    • 0003921935 scopus 로고    scopus 로고
    • Automatic generation of detailed pronunciation lexicons
    • In: Lee, C.-H., Soong, F., Paliwal, K. (Eds.); Kluwer Academic Publishers, Dordrecht; Chapter 12
    • Riley, M., Ljolje, A., 1996. Automatic generation of detailed pronunciation lexicons. In: Lee, C.-H., Soong, F., Paliwal, K. (Eds.), Automatic Speech and Speaker Recognition: Advanced Topics. Kluwer Academic Publishers, Dordrecht, pp. 285-302, Chapter 12.
    • (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 285-302
    • Riley, M.1    Ljolje, A.2
  • 23
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modeling by sharing Gaussian densities across phonetic models
    • Saraçlar, M., Nock, H., Khudanpur, S., 2000. Pronunciation modeling by sharing Gaussian densities across phonetic models. Comput. Speech Lang. 14, 137-160.
    • (2000) Comput. Speech Lang. , vol.14 , pp. 137-160
    • Saraçlar, M.1    Nock, H.2    Khudanpur, S.3
  • 24
    • 0030363039 scopus 로고    scopus 로고
    • Dictionary learning for spontaneous speech recognition
    • Sloboda, T., Waibel, A., 1996. Dictionary learning for spontaneous speech recognition. In: Proc. ICSLP '96, Philadelphia, PA, pp. 2328-2331.
    • (1996) Proc. ICSLP '96, Philadelphia, PA , pp. 2328-2331
    • Sloboda, T.1    Waibel, A.2
  • 26
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: A survey of the literature
    • Strik, H., Cucchiarini, C., 1999. Modeling pronunciation variation for ASR: a survey of the literature. Speech Commun. 29, 225-246.
    • (1999) Speech Commun. , vol.29 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 28
    • 0030672090 scopus 로고    scopus 로고
    • Automatic alternative transcription generation and vocabulary selection for flexible word recognizers
    • Torre, D., Villarrubia, L., Hernández, L., Elvira, J., 1966. Automatic alternative transcription generation and vocabulary selection for flexible word recognizers. In: Proc. ICASSP '96, Munich, pp. 1463-1466.
    • (1996) Proc. ICASSP '96, Munich , pp. 1463-1466
    • Torre, D.1    Villarrubia, L.2    Hernández, L.3    Elvira, J.4
  • 29
    • 0033096914 scopus 로고    scopus 로고
    • Acoustic characteristics of lexical stress in continuous telephone speech
    • van Kuijk, D., Boves, L., 1999. Acoustic characteristics of lexical stress in continuous telephone speech. Speech Commun. 27, 95-111.
    • (1999) Speech Commun. , vol.27 , pp. 95-111
    • Van Kuijk, D.1    Boves, L.2
  • 30
    • 84968911025 scopus 로고    scopus 로고
    • A comparison of data-derived and knowledge-based modeling of pronunciation variation
    • Wester, M., Fosler-Lussier, E., 2000. A comparison of data-derived and knowledge-based modeling of pronunciation variation. In: Proc. ICSLP '00, Beijing, vol. I, pp. 270-273.
    • (2000) Proc. ICSLP '00, Beijing , vol.1 , pp. 270-273
    • Wester, M.1    Fosler-Lussier, E.2
  • 31
    • 85009084203 scopus 로고    scopus 로고
    • Pronunciation variation in ASR: Which variation to model?
    • Wester, M., Kessens, J., Strik, H., 2000. Pronunciation variation in ASR: Which variation to model? In: Proc. ICSLP '00, Beijing, vol. IV, pp. 488-491.
    • (2000) Proc. ICSLP '00, Beijing , vol.4 , pp. 488-491
    • Wester, M.1    Kessens, J.2    Strik, H.3
  • 34
    • 0003957032 scopus 로고    scopus 로고
    • Data mining, practical machine learning tools and techniques with java implementations
    • Morgan Kaufmann Publishers, Los Altos, CA
    • Witten, I., Frank, E., 2000. Data Mining, Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, Los Altos, CA.
    • (2000)
    • Witten, I.1    Frank, E.2
  • 35
    • 0038810063 scopus 로고    scopus 로고
    • On the importance of exception and cross-word rules for the data-driven creation of lexica for ASR
    • Yang, Q., Martens, J.-P., 2000. On the importance of exception and cross-word rules for the data-driven creation of lexica for ASR. In: Proc. 11 ProRisc Workshop, Veldhoven, The Netherlands, pp. 589-593.
    • (2000) Proc. 11 ProRisc Workshop, Veldhoven, The Netherlands , pp. 589-593
    • Yang, Q.1    Martens, J.-P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.