메뉴 건너뛰기




Volumn 49, Issue 1, 2007, Pages 59-70

Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition

Author keywords

Acoustic model adaptation; Data driven pronunciation variability; Decision tree; Knowledge based pronunciation variability; Non native speech; Speech recognition; State clustering; State tying

Indexed keywords

ACOUSTIC PROPERTIES; DATA STORAGE EQUIPMENT; DECISION THEORY; KNOWLEDGE BASED SYSTEMS; MATHEMATICAL MODELS; SPEECH ANALYSIS;

EID: 33845875676     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2006.10.006     Document Type: Article
Times cited : (26)

References (19)
  • 1
    • 33845908073 scopus 로고    scopus 로고
    • Binder, N., Gruhn, R., Nakamura, S., 2002. Recognition of non-native speech using dynamic phoneme lattice processing. In: Proc. Spring Meeting of the Acoustical Society of Japan, Yokohama, Japan, pp. 203-204.
  • 2
    • 0035427204 scopus 로고    scopus 로고
    • Recognizing speech of goats, wolves, sheep and ... non-natives
    • Compernolle D.V. Recognizing speech of goats, wolves, sheep and ... non-natives. Speech Comm. 35 (2001) 71-79
    • (2001) Speech Comm. , vol.35 , pp. 71-79
    • Compernolle, D.V.1
  • 3
    • 85009143806 scopus 로고    scopus 로고
    • Gruhn, R., Markov, K., Nakamura, S., 2004. A statistical lexicon for non-native speech recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 1497-1500.
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Leggetter C.J., and Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9 2 (1995) 171-185
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 5
    • 0141591508 scopus 로고    scopus 로고
    • Matsunaga, S., Ogawa, A., Yamaguchi, Y., Imamura, A., 2003. Non-native English speech recognition using bilingual English lexicon and acoustic models. In: Proc. ICASSP, Hong Kong, China, pp. 340-343.
  • 6
    • 33845893472 scopus 로고    scopus 로고
    • Morgan, J., 2004. Making a speech recognizer tolerate non-native speech through Gaussian mixture merging. In: Proc. InSTIL/ICALL Symposium on Computer-Assisted Language Learning, Venice, Italy, pp. 213-216.
  • 7
    • 33845872937 scopus 로고    scopus 로고
    • Paul, D., Baker, J., 1992. The design for the Wall Street Journal-based CSR corpus. In: Proc. DARPA Speech and Language Workshop, Arden House, NY, pp. 357-362.
  • 8
    • 33947614696 scopus 로고    scopus 로고
    • Rhee, S.-C., Lee, S.-H., Kang, S.-K., Lee, Y.-J., 2004. Design and construction of Korean-Spoken English Corpus (K-SEC). In: Proc. ICSLP, Jeju Island, Korea, pp. 2769-2772.
  • 9
    • 33845885255 scopus 로고
    • A comparison of English and Korean for teaching English consonants in the Korea KSL class
    • Ryu S.Y. A comparison of English and Korean for teaching English consonants in the Korea KSL class. Jungang J. English Literature Linguist. 35 (1994) 145-160
    • (1994) Jungang J. English Literature Linguist. , vol.35 , pp. 145-160
    • Ryu, S.Y.1
  • 10
    • 85009080645 scopus 로고    scopus 로고
    • Steidl, S., Stemmer, G., Hacker, C., Noth, E., 2004. Adaptation in the pronunciation space for non-native speech recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 2901-2904.
  • 11
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: a survey of the literature
    • Strik H., and Cucchiarini C. Modeling pronunciation variation for ASR: a survey of the literature. Speech Comm. 29 (1999) 225-246
    • (1999) Speech Comm. , vol.29 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 12
    • 85009094256 scopus 로고    scopus 로고
    • Tomokiyo, L.M., 2000. Lexical and acoustic modeling of non-native speech in LVCSR. In: Proc. ICSLP, Beijing, China, pp. 346-349.
  • 13
    • 85009216453 scopus 로고    scopus 로고
    • Wang, Z., Schultz, T., 2003. Non-native spontaneous speech recognition through polyphone decision tree specialization. In: Proc. EUROSPEECH, Geneva, Switzerland, pp. 1449-1452.
  • 14
    • 33845889879 scopus 로고    scopus 로고
    • Weide, H., 1998. The CMU Pronunciation Dictionary, release 0.6, Carnegie Mellon University.
  • 15
    • 28044453682 scopus 로고    scopus 로고
    • A survey of the Korean learners' problems in mastering English pronunciation
    • Youe H.-M. A survey of the Korean learners' problems in mastering English pronunciation. Malsori 42 (2001) 47-56
    • (2001) Malsori , vol.42 , pp. 47-56
    • Youe, H.-M.1
  • 16
    • 33845876468 scopus 로고    scopus 로고
    • Young, S. et al., 2002. The HTK Book (for HTK Version 3.2), Microsoft Corporation, Cambridge University Engineering Department.
  • 17
    • 33845913368 scopus 로고    scopus 로고
    • Young, S., Odell, J., Woodland, P., 1994. Tree-based state tying for high accuracy acoustic modeling. In: Proc. ARPA Human Language Technology Workshop, Princeton, NJ, pp. 307-312.
  • 18
    • 33845880258 scopus 로고    scopus 로고
    • An analysis of English vowels in the middle school textbooks: in comparison with the Korean vowels
    • Yun H.S. An analysis of English vowels in the middle school textbooks: in comparison with the Korean vowels. J. English Lang. Literature 47 2 (2005) 307-328
    • (2005) J. English Lang. Literature , vol.47 , Issue.2 , pp. 307-328
    • Yun, H.S.1
  • 19
    • 0029745232 scopus 로고    scopus 로고
    • Zavagliakos, G., Schwartz, R., McDonough, J., 1996. Maximum a posteriori adaptation for large scale HMM recognizers. In: Proc. ICASSP, Atlanta, GA, pp. 725-728.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.