SCOPUS 정보 검색 플랫폼

Volumn 49, Issue 1, 2007, Pages 59-70

Acoustic model adaptation based on pronunciation variability analysis for non-native speech recognition

(3) Oh, Yoo Rhee a Yoon, Jae Sam a Kim, Hong Kook a

a Gwangju Institute of Science and Technology (GIST) (South Korea)

Author keywords

Acoustic model adaptation; Data driven pronunciation variability; Decision tree; Knowledge based pronunciation variability; Non native speech; Speech recognition; State clustering; State tying

Indexed keywords

ACOUSTIC PROPERTIES; DATA STORAGE EQUIPMENT; DECISION THEORY; KNOWLEDGE BASED SYSTEMS; MATHEMATICAL MODELS; SPEECH ANALYSIS;

ACOUSTIC MODEL ADAPTATION; DATA-DRIVEN PRONUNCIATION VARIABILITY; DECISION TREE; KNOWLEDGE-BASED PRONUNCIATION VARIABILITY; NON-NATIVE SPEECH; STATE-CLUSTERING; STATE-TYING;

SPEECH RECOGNITION;

EID: 33845875676 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2006.10.006 Document Type: Article

Times cited : (26)

References (19)

1
- 33845908073
- Binder, N., Gruhn, R., Nakamura, S., 2002. Recognition of non-native speech using dynamic phoneme lattice processing. In: Proc. Spring Meeting of the Acoustical Society of Japan, Yokohama, Japan, pp. 203-204.

2
- 0035427204
- Recognizing speech of goats, wolves, sheep and ... non-natives
- Compernolle D.V. Recognizing speech of goats, wolves, sheep and ... non-natives. Speech Comm. 35 (2001) 71-79
- (2001) Speech Comm. , vol.35 , pp. 71-79
- Compernolle, D.V.¹

3
- 85009143806
- Gruhn, R., Markov, K., Nakamura, S., 2004. A statistical lexicon for non-native speech recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 1497-1500.

4
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- Leggetter C.J., and Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9 2 (1995) 171-185
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

5
- 0141591508
- Matsunaga, S., Ogawa, A., Yamaguchi, Y., Imamura, A., 2003. Non-native English speech recognition using bilingual English lexicon and acoustic models. In: Proc. ICASSP, Hong Kong, China, pp. 340-343.

6
- 33845893472
- Morgan, J., 2004. Making a speech recognizer tolerate non-native speech through Gaussian mixture merging. In: Proc. InSTIL/ICALL Symposium on Computer-Assisted Language Learning, Venice, Italy, pp. 213-216.

7
- 33845872937
- Paul, D., Baker, J., 1992. The design for the Wall Street Journal-based CSR corpus. In: Proc. DARPA Speech and Language Workshop, Arden House, NY, pp. 357-362.

8
- 33947614696
- Rhee, S.-C., Lee, S.-H., Kang, S.-K., Lee, Y.-J., 2004. Design and construction of Korean-Spoken English Corpus (K-SEC). In: Proc. ICSLP, Jeju Island, Korea, pp. 2769-2772.

9
- 33845885255
- A comparison of English and Korean for teaching English consonants in the Korea KSL class
- Ryu S.Y. A comparison of English and Korean for teaching English consonants in the Korea KSL class. Jungang J. English Literature Linguist. 35 (1994) 145-160
- (1994) Jungang J. English Literature Linguist. , vol.35 , pp. 145-160
- Ryu, S.Y.¹

10
- 85009080645
- Steidl, S., Stemmer, G., Hacker, C., Noth, E., 2004. Adaptation in the pronunciation space for non-native speech recognition. In: Proc. ICSLP, Jeju Island, Korea, pp. 2901-2904.

11
- 0033335618
- Modeling pronunciation variation for ASR: a survey of the literature
- Strik H., and Cucchiarini C. Modeling pronunciation variation for ASR: a survey of the literature. Speech Comm. 29 (1999) 225-246
- (1999) Speech Comm. , vol.29 , pp. 225-246
- Strik, H.¹ Cucchiarini, C.²

12
- 85009094256
- Tomokiyo, L.M., 2000. Lexical and acoustic modeling of non-native speech in LVCSR. In: Proc. ICSLP, Beijing, China, pp. 346-349.

13
- 85009216453
- Wang, Z., Schultz, T., 2003. Non-native spontaneous speech recognition through polyphone decision tree specialization. In: Proc. EUROSPEECH, Geneva, Switzerland, pp. 1449-1452.

14
- 33845889879
- Weide, H., 1998. The CMU Pronunciation Dictionary, release 0.6, Carnegie Mellon University.

15
- 28044453682
- A survey of the Korean learners' problems in mastering English pronunciation
- Youe H.-M. A survey of the Korean learners' problems in mastering English pronunciation. Malsori 42 (2001) 47-56
- (2001) Malsori , vol.42 , pp. 47-56
- Youe, H.-M.¹

16
- 33845876468
- Young, S. et al., 2002. The HTK Book (for HTK Version 3.2), Microsoft Corporation, Cambridge University Engineering Department.

17
- 33845913368
- Young, S., Odell, J., Woodland, P., 1994. Tree-based state tying for high accuracy acoustic modeling. In: Proc. ARPA Human Language Technology Workshop, Princeton, NJ, pp. 307-312.

18
- 33845880258
- An analysis of English vowels in the middle school textbooks: in comparison with the Korean vowels
- Yun H.S. An analysis of English vowels in the middle school textbooks: in comparison with the Korean vowels. J. English Lang. Literature 47 2 (2005) 307-328
- (2005) J. English Lang. Literature , vol.47 , Issue.2 , pp. 307-328
- Yun, H.S.¹

19
- 0029745232
- Zavagliakos, G., Schwartz, R., McDonough, J., 1996. Maximum a posteriori adaptation for large scale HMM recognizers. In: Proc. ICASSP, Atlanta, GA, pp. 725-728.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.