메뉴 건너뛰기




Volumn 21, Issue 2, 2013, Pages 357-366

Learning lexicons from speech using a pronunciation mixture model

Author keywords

Baseform generation; dictionary training with acoustics via EM; pronunciation learning; stochastic lexicon

Indexed keywords

ACHILLES HEEL; AUTOMATIC SPEECH RECOGNIZERS; BASEFORM GENERATION; BASEFORM PRONUNCIATION; CONTINUOUS SPEECH; HIGH QUALITY; LANGUAGE MODEL; MANUAL INTERVENTION; MIXTURE MODEL; PARAMETER SETTING; PRONUNCIATION LEARNING; SPEECH DATA; STOCHASTIC LEXICON; TRAINING DATA; WEATHER INFORMATION;

EID: 84871369973     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2226158     Document Type: Article
Times cited : (40)

References (34)
  • 1
    • 79959854710 scopus 로고    scopus 로고
    • Learning new word pronunciations from spoken examples
    • I. Badr, I. McGraw, and J. R. Glass, "Learning new word pronunciations from spoken examples, " in Proc. INTERSPEECH, 2010, pp. 2294-2297.
    • (2010) Proc. INTERSPEECH , pp. 2294-2297
    • Badr, I.1    McGraw, I.2    Glass, J.R.3
  • 2
    • 84865763465 scopus 로고    scopus 로고
    • Pronunciation learning from continuous speech
    • I. Badr, I. McGraw, and J. R. Glass, "Pronunciation learning from continuous speech, " in Proc. INTERSPEECH, 2011, pp. 549-552.
    • (2011) Proc. INTERSPEECH , pp. 549-552
    • Badr, I.1    McGraw, I.2    Glass, J.R.3
  • 3
    • 41049105254 scopus 로고    scopus 로고
    • Joint-sequence models for grapheme-tophoneme conversion
    • May
    • M. Bisani and H. Ney, "Joint-sequence models for grapheme-tophoneme conversion, " Speech Commun., vol. 50, no. 5, pp. 434-451, May 2008.
    • (2008) Speech Commun , vol.50 , Issue.5 , pp. 434-451
    • Bisani, M.1    Ney, H.2
  • 4
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm, " J. R. Statist. Soc., Ser. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc., Ser. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 6
    • 0038359548 scopus 로고    scopus 로고
    • A probabilistic framework for segment-based speech recognition
    • J. R. Glass, "A probabilistic framework for segment-based speech recognition, " Comput. Speech Lang., vol. 17, no. 2-3, pp. 137-152, 2003.
    • (2003) Comput. Speech Lang , vol.17 , Issue.2-3 , pp. 137-152
    • Glass, J.R.1
  • 7
    • 85009152019 scopus 로고    scopus 로고
    • The MIT finite-state transducer toolkit for speech and language processing
    • I. L. Hetherington, "The MIT finite-state transducer toolkit for speech and language processing, " in Proc. INTERSPEECH, 2004, pp. 2609-2612.
    • (2004) Proc. INTERSPEECH , pp. 2609-2612
    • Hetherington, I.L.1
  • 8
    • 85009074656 scopus 로고    scopus 로고
    • An efficient implementation of phonological rules using finite-state transducers
    • I. L. Hetherington, "An efficient implementation of phonological rules using finite-state transducers, " in Proc. EuroSpeech, 2001, pp. 1599-1602.
    • (2001) Proc. EuroSpeech , pp. 1599-1602
    • Hetherington, I.L.1
  • 9
    • 19944423811 scopus 로고    scopus 로고
    • Pronunciation modeling using a finite-state transducer representation
    • DOI 10.1016/j.specom.2005.03.004, PII S0167639305000361, Pronunciation Modeling and Lexicon Adaptation
    • T. J. Hazen, I. L. Hetherington, H. Shu, and K. Livescu, "Pronunciation modeling using a finite-state transducer representation, " Speech Commun., vol. 46, no. 2, pp. 189-203, 2005. (Pubitemid 40753202)
    • (2005) Speech Communication , vol.46 , Issue.2 , pp. 189-203
    • Hazen, T.J.1    Hetherington, I.L.2    Shu, H.3    Livescu, K.4
  • 10
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: A survey of the literature
    • DOI 10.1016/S0167-6393(99)00038-2
    • H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature, " Speech Commun., vol. 29, no. 2-4, pp. 225-246, 1999. (Pubitemid 30514833)
    • (1999) Speech Communication , vol.29 , Issue.2 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 12
    • 0035278951 scopus 로고    scopus 로고
    • Confidence measures for large vocabulary continuous speech recognition
    • DOI 10.1109/89.906002, PII S1063667601013281
    • F. Wessel, R. Schlüter, K. Macherey, and H. Ney, "Confidence measures for large vocabulary continuous speech recognition, " IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 288-298, Mar. 2001. (Pubitemid 32286598)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.3 , pp. 288-298
    • Wessel, F.1    Schluter, R.2    Macherey, K.3    Ney, H.4
  • 13
    • 33745202406 scopus 로고    scopus 로고
    • Open vocabulary speech recognition with flat hybrid models
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • M. Bisani and H. Ney, "Open vocabulary speech recognition with flat hybrid models, " in Proc. INTERSPEECH, 2005, pp. 725-728. (Pubitemid 43908165)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 725-728
    • Bisani, M.1    Ney, H.2
  • 14
    • 85009227369 scopus 로고    scopus 로고
    • Conditional and joint models for grapheme-to-phoneme conversion
    • S. F. Chen, "Conditional and joint models for grapheme-to-phoneme conversion, " in Proc. INTERSPEECH, 2003, pp. 2033-2036.
    • (2003) Proc. INTERSPEECH , pp. 2033-2036
    • Chen, S.F.1
  • 15
    • 84878203695 scopus 로고
    • Regular models of phonological rule systems
    • R. M. Kaplan and M. Kay, "Regular models of phonological rule systems, " Comput. Linguist., vol. 20, pp. 331-378, 1994.
    • (1994) Comput. Linguist , vol.20 , pp. 331-378
    • Kaplan, R.M.1    Kay, M.2
  • 17
    • 0039255896 scopus 로고    scopus 로고
    • A multi-strategy approach to improving pronunciation by analogy
    • Y. Marchand and R. I. Damper, "A multi-strategy approach to improving pronunciation by analogy, " Comput. Linguist., vol. 26, pp. 195-219, 2000.
    • (2000) Comput. Linguist , vol.26 , pp. 195-219
    • Marchand, Y.1    Damper, R.I.2
  • 18
    • 19944409831 scopus 로고    scopus 로고
    • Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy
    • DOI 10.1016/j.specom.2005.03.002, PII S0167639305000336, Pronunciation Modeling and Lexicon Adaptation
    • J. Bellegarda, "Unsupervised, language-independent grapheme-tophoneme conversion by latent analogy, " Speech Commun., vol. 46, no. 2, pp. 140-152, 2005. (Pubitemid 40753199)
    • (2005) Speech Communication , vol.46 , Issue.2 , pp. 140-152
    • Bellegarda, J.R.1
  • 19
    • 70450194704 scopus 로고    scopus 로고
    • Grapheme to phoneme conversion using an SMT system
    • A. Laurent, P. Delglise, and S. Meignier, "Grapheme to phoneme conversion using an SMT system, " in Proc. INTERSPEECH, 2009, pp. 708-711.
    • (2009) Proc. INTERSPEECH , pp. 708-711
    • Laurent, A.1    Delglise, P.2    Meignier, S.3
  • 20
    • 70450186703 scopus 로고    scopus 로고
    • Online discriminative training for grapheme-to-phoneme conversion
    • S. Jiampojamarn and G. Kondrak, "Online discriminative training for grapheme-to-phoneme conversion, " in Proc. INTERSPEECH, 2009, pp. 1303-1306.
    • (2009) Proc. INTERSPEECH , pp. 1303-1306
    • Jiampojamarn, S.1    Kondrak, G.2
  • 22
    • 18244423993 scopus 로고    scopus 로고
    • Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition
    • J. Häkkinen, J. Suontausta, S. Riis, and K. J. Jensen, "Assessing text-to-phoneme mapping strategies in speaker independent isolated word recognition, " Speech Commun., vol. 41, no. 2-3, pp. 455-467, 2003.
    • (2003) Speech Commun , vol.41 , Issue.2-3 , pp. 455-467
    • Häkkinen, J.1    Suontausta, J.2    Riis, S.3    Jensen, K.J.4
  • 25
    • 84956975318 scopus 로고    scopus 로고
    • Automatic baseform generation from acoustic data
    • B. Maison, "Automatic baseform generation from acoustic data, " in Proc. INTERSPEECH, 2003, pp. 2545-2548.
    • (2003) Proc. INTERSPEECH , pp. 2545-2548
    • Maison, B.1
  • 29
    • 70349209414 scopus 로고    scopus 로고
    • Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion
    • O. Vinyals, L. Deng, D. Yu, and A. Acero, "Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 4445-4448.
    • (2009) Proc IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP , pp. 4445-4448
    • Vinyals, O.1    Deng, L.2    Yu, D.3    Acero, A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.