SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 5, 2010, Pages 1030-1040

Developing objective measures of foreign-accent conversion

(2) Felps, Daniel a Gutierrez Osuna, Ricardo a

a TEXAS A AND M UNIVERSITY (United States)

Author keywords

Accent conversion; Foreign accent recognition; Speaker recognition; Voice conversion

Indexed keywords

ACCENTED SPEECH; ACOUSTIC QUALITY; ACOUSTIC VECTORS; AUTOMATIC SPEECH RECOGNIZERS; CEPSTRAL; COMPUTER ASSISTED; CONVERSION METHODS; DEGREE OF CORRELATIONS; LINEAR DISCRIMINANTS; LISTENING TESTS; MATCH SCORE; NARROW BANDS; OBJECTIVE MEASURE; PERCEPTUAL TEST; SINGLE-ENDED; SPEAKER RECOGNITION; SPECTRAL DISTORTIONS; SPEECH QUALITY; SPEECH SIGNALS; SUBJECTIVE RATING; TARGET SPEAKER; VOICE CONVERSION;

SPEECH PROCESSING; WAVELET TRANSFORMS;

SPEECH RECOGNITION;

EID: 77953714655 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2038818 Document Type: Article

Times cited : (29)

References (69)

1
- 84929064443
- Advances in computer-based speech training: Aids for the profoundly hearing impaired
- C. Watson and D. Kewley-Port, "Advances in computer-based speech training: Aids for the profoundly hearing impaired," Volta-Rev., vol.91, pp. 29-45, 1989.
- (1989) Volta-Rev. , vol.91 , pp. 29-45
- Watson, C.¹ Kewley-Port, D.²

2
- 67650668659
- Intonational foreign accent: Speech technology and foreign language teaching
- M. Jilka and G. Möhler, "Intonational foreign accent: Speech technology and foreign language teaching," in Proc. ESCA Workshop Speech Tech. Lang. Learn., 1998, pp. 115-118.
- (1998) Proc. ESCA Workshop Speech Tech. Lang. Learn. , pp. 115-118
- Jilka, M.¹ Möhler, G.²

3
- 0345274547
- Mahwah NJ: Erlbaum
- R. C. Major, Foreign Accent: The Ontogeny and Phylogeny of Second Language Phonology. Mahwah, NJ: Erlbaum, 2001.
- (2001) Foreign Accent: The Ontogeny and Phylogeny of Second Language Phonology
- Major, R.C.¹

4
- 0032026483
- Continuous probabilistic transform for voice conversion
- Mar.
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol.6, no.2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

5
- 0034841948
- Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
- Salt Lake City, UT
- A. Kain and M. Macon, "Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction," in Proc. ICASSP 2001, Salt Lake City, UT, 2001, pp. 813-816.
- (2001) Proc. ICASSP 2001 , pp. 813-816
- Kain, A.¹ MacOn, M.²

6
- 36949014554
- The effect of listener accent background on accent perception and comprehension
- A. Ikeno and J. H. L. Hansen, "The effect of listener accent background on accent perception and comprehension," EURASIP J. Audio, Speech, Music Process., vol.2007, pp. 1-8, 2007.
- (2007) EURASIP J. Audio, Speech, Music Process. , vol.2007 , pp. 1-8
- Ikeno, A.¹ Hansen, J.H.L.²

7
- 18744409456
- Feedback in computer assisted pronunciation training: Technology push or demand pull?
- A. Neri, C. Cucchiarini, and H. Strik, "Feedback in computer assisted pronunciation training: Technology push or demand pull?," in Proc. CALL Conf., 2002, pp. 179-188.
- (2002) Proc. CALL Conf. , pp. 179-188
- Neri, A.¹ Cucchiarini, C.² Strik, H.³

8
- 67650673714
- Computer assisted pronunciation training: The four 'K's of feedback
- T. K. Hansen, "Computer assisted pronunciation training: The four 'K's of feedback," in Proc. 4rth Int. Conf. Multimedia Inf. Technol. in Education, 2006, pp. 342-346.
- (2006) Proc. 4rth Int. Conf. Multimedia Inf. Technol. in Education , pp. 342-346
- Hansen, T.K.¹

9
- 84979339593
- The role of intonation in foreign accent
- T. van Els and K. de Bot, "The role of intonation in foreign accent," Modern Lang. J., vol.71, pp. 147-155, 1987.
- (1987) Modern Lang. J. , vol.71 , pp. 147-155
- Van Els, T.¹ De Bot, K.²

10
- 36248978278
- Foreign accent
- and Methods, J. G. Carbonell and J. Siekmann, Eds. New York: Springer
- U. Gut, "Foreign accent," in Speaker Classification I: Fundamentals, Features, and Methods, J. G. Carbonell and J. Siekmann, Eds. New York: Springer, 2007, pp. 75-87.
- (2007) Speaker Classification I: Fundamentals, Features , pp. 75-87
- Gut, U.¹

11
- 0030757418
- A study of temporal features and frequency characteristics in American english foreign accent
- L. M. Arslan and J. H. L. Hansen, "A study of temporal features and frequency characteristics in American english foreign accent," JASA, vol.102, pp. 28-40, 1997.
- (1997) JASA , vol.102 , pp. 28-40
- Arslan, L.M.¹ Hansen, J.H.L.²

12
- 84971878476
- Non-segmental factors in foreign accent: Ratings of filtered speech
- M. Munro, "Non-segmental factors in foreign accent: Ratings of filtered speech," Studies in Second Lang. Acquisition, vol.17, pp. 17-34, 1995.
- (1995) Studies in Second Lang. Acquisition , vol.17 , pp. 17-34
- Munro, M.¹

13
- 0029254163
- Non-parametric techniques for pitchscale and time-scale modification of speech
- E. Moulines and J. Laroche, "Non-parametric techniques for pitchscale and time-scale modification of speech," Speech Commun., vol.16, pp. 175-205, 1995.
- (1995) Speech Commun. , vol.16 , pp. 175-205
- Moulines, E.¹ Laroche, J.²

14
- 0030642434
- Effects of temporal correction on intelligibility of foreign-accented English
- K. Tajima, R. Port, and J. Dalby, "Effects of temporal correction on intelligibility of foreign-accented English," J. Phon., vol.25, pp. 1-24, 1997.
- (1997) J. Phon. , vol.25 , pp. 1-24
- Tajima, K.¹ Port, R.² Dalby, J.³

15
- 77953699659
- Towards an automatic foreign accent reduction tool
- K. Cho and J. G. Harris, "Towards an automatic foreign accent reduction tool," in Proc. 3rd Int. Conf. Speech Prosody, 2006.
- (2006) Proc. 3rd Int. Conf. Speech Prosody
- Cho, K.¹ Harris, J.G.²

16
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol.9, pp. 453-467, 1990.
- (1990) Speech Commun. , vol.9 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

17
- 67650668657
- English speech training using voice conversion
- K. Nagano and K. Ozawa, "English speech training using voice conversion," in Proc. ICSLP, 1990, pp. 1169-1172.
- (1990) Proc. ICSLP , pp. 1169-1172
- Nagano, K.¹ Ozawa, K.²

18
- 67650602764
- Lexical stress training of German compounds for Italian speakers by means of resynthesis and emphasis
- M. P. Bissiri, H. R. Pfitzinger, and H. G. Tillmann, "Lexical stress training of German compounds for Italian speakers by means of resynthesis and emphasis," in Proc 11th Australian Int. Conf. Speech Sci. Technol., 2006, pp. 24-29.
- (2006) Proc 11th Australian Int. Conf. Speech Sci. Technol. , pp. 24-29
- Bissiri, M.P.¹ Pfitzinger, H.R.² Tillmann, H.G.³

19
- 4544353416
- Analysis by synthesis of acoustic correlates of British, Australian and American accents
- Q. Yan, S. Vaseghi, D. Rentzos, and C.-H. Ho, "Analysis by synthesis of acoustic correlates of British, Australian and American accents," in Proc. ICASSP, 2004, pp. 637-640.
- (2004) Proc. ICASSP , pp. 637-640
- Yan, Q.¹ Vaseghi, S.² Rentzos, D.³ Ho, C.-H.⁴

20
- 77956396053
- Perception of foreign accentedness in L2 prosody and segments: L1 Japanese speakers learning L2 French
- T. Kamiyama, "Perception of foreign accentedness in L2 prosody and segments: L1 Japanese speakers learning L2 French," in Proc. Speech Prosody: ISCA, 2004.
- (2004) Proc. Speech Prosody: ISCA
- Kamiyama, T.¹

21
- 0030355972
- The MBROLA project: Towards a set of high-quality speech synthesizers free of use for non-commercial purposes
- T. Dutoit, V. Pagel, N. Pierret, F. Bataille, and O. v. d. Vreken, "The MBROLA project: Towards a set of high-quality speech synthesizers free of use for non-commercial purposes," in Proc. ICSLP, 1996, vol.3, pp. 1393-1396.
- (1996) Proc. ICSLP , vol.3 , pp. 1393-1396
- Dutoit, T.¹ Pagel, V.² Pierret, N.³ Bataille, F.⁴ Vreken, O.V.D.⁵

22
- 0038120523
- Institute of Phonetics Sciences Universiteit van Amsterdam
- P. Boersma and D. Weenink, Praat: Doing Phonetics by Computer (Version 4.5.15) Institute of Phonetics Sciences, Universiteit van Amsterdam, 2007.
- (2007) Praat: Doing Phonetics by Computer (Version 4.5.15)
- Boersma, P.¹ Weenink, D.²

23
- 67650666088
- Spoken language conversion with accent morphing
- M. Huckvale and K. Yanagisawa, "Spoken language conversion with accent morphing," in Proc. ISCA Speech Synth. Workshop, 2007, pp. 64-70.
- (2007) Proc. ISCA Speech Synth. Workshop , pp. 64-70
- Huckvale, M.¹ Yanagisawa, K.²

24
- 67650657780
- Foreign accent conversion in computer assisted pronunciation training
- D. Felps, H. Bortfeld, and R. Gutierrez-Osuna, "Foreign accent conversion in computer assisted pronunciation training," Speech Commun., vol.51, pp. 920-932, 2009.
- (2009) Speech Commun. , vol.51 , pp. 920-932
- Felps, D.¹ Bortfeld, H.² Gutierrez-Osuna, R.³

25
- 84994985638
- Foreign accent, comprehensibility, and intelligibility in the speech of second language learners
- M. Munro and T. Derwing, "Foreign accent, comprehensibility, and intelligibility in the speech of second language learners," Lang. Learn. Technol., vol.45, pp. 73-97, 1995.
- (1995) Lang. Learn. Technol. , vol.45 , pp. 73-97
- Munro, M.¹ Derwing, T.²

26
- 0028736841
- Output-based objective speech quality
- L. Jin and R. Kubichek, "Output-based objective speech quality," in Proc. IEEE Veh. Technol. Conf., 1994, pp. 1719-1723.
- (1994) Proc. IEEE Veh. Technol. Conf. , pp. 1719-1723
- Jin, L.¹ Kubichek, R.²

27
- 39649083007
- P.563 - The ITU-T standard for single-ended speech quality assessment
- Nov.
- L. Malfait, J. Berger, and M. Kastner, "P.563-the ITU-T standard for single-ended speech quality assessment," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 1924-1934, Nov. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 1924-1934
- Malfait, L.¹ Berger, J.² Kastner, M.³

28
- 0034428801
- Non-intrusive speechquality assessment using vocal-tract models
- P. Gray, M. P. Hollier, and R. E. Massara, "Non-intrusive speechquality assessment using vocal-tract models," IEE Proc. Vis, Image, Signal Process., vol.147, pp. 493-501, 2000.
- (2000) IEE Proc. Vis, Image, Signal Process. , vol.147 , pp. 493-501
- Gray, P.¹ Hollier, M.P.² Massara, R.E.³

29
- 4544278506
- Perceptual model for non-intrusive speech quality assessment
- K. Doh-Suk and A. Tarraf, "Perceptual model for non-intrusive speech quality assessment," in Proc. ICASSP, 2004, pp. 1060-1063.
- (2004) Proc. ICASSP , pp. 1060-1063
- Doh-Suk, K.¹ Tarraf, A.²

30
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

31
- 1842739879
- Accent issues in large vocabulary continuous speech recognition
- C. Huang, T. Chen, and E. Chang, "Accent issues in large vocabulary continuous speech recognition," Int. J. Speech Technol., vol.7, pp. 141-153, 2004.
- (2004) Int. J. Speech Technol. , vol.7 , pp. 141-153
- Huang, C.¹ Chen, T.² Chang, E.³

32
- 85009063603
- ACCDIST: A metric for comparing speakers' accents
- M. Huckvale, "ACCDIST: A metric for comparing speakers' accents," in Proc. ICSLP, 2004.
- (2004) Proc. ICSLP
- Huckvale, M.¹

33
- 64349124465
- Analysis and synthesis of formant spaces of British, Australian, and American accents
- Feb.
- Q. Yan, S. Vaseghi, D. Rentzos, and C. H. Ho, "Analysis and synthesis of formant spaces of British, Australian, and American accents," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.2, pp. 676-689, Feb. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 676-689
- Yan, Q.¹ Vaseghi, S.² Rentzos, D.³ Ho, C.H.⁴

34
- 33750965925
- Accent classification in speech
- S. Deshpande, S. Chikkerur, and V. Govindaraju, "Accent classification in speech," in Proc. 4th IEEEWorkshop Autom. Identification Adv. Technol., 2005, pp. 139-143.
- (2005) Proc. 4th IEEEWorkshop Autom. Identification Adv. Technol. , pp. 139-143
- Deshpande, S.¹ Chikkerur, S.² Govindaraju, V.³

35
- 84962855264
- Automatic accent identification using Gaussian mixture models
- T. Chen, C. Huang, E. Chang, and J. Wang, "Automatic accent identification using Gaussian mixture models," in Proc. ASRU, 2001.
- (2001) Proc. ASRU
- Chen, T.¹ Huang, C.² Chang, E.³ Wang, J.⁴

36
- 0030165438
- Language accent classification in American english
- L. M. Arslan and J. H. L. Hansen, "Language accent classification in American english," Speech Commun., vol.18, pp. 353-367, 1996.
- (1996) Speech Commun. , vol.18 , pp. 353-367
- Arslan, L.M.¹ Hansen, J.H.L.²

37
- 0036298772
- A comparative analysis of UK and US English accents in recognition and synthesis
- Q. Yan and S. Vaseghi, "A comparative analysis of UK and US English accents in recognition and synthesis," in Proc. ICASSP, 2002, pp. 413-416.
- (2002) Proc. ICASSP , pp. 413-416
- Yan, Q.¹ Vaseghi, S.²

38
- 0004656024
- An approach to the problem of regional accent in automatic speech recognition
- W. Barry, C. Hoequist, and F. Nolan, "An approach to the problem of regional accent in automatic speech recognition," Comput. Speech, Lang., vol.3, pp. 355-366, 1989.
- (1989) Comput. Speech, Lang. , vol.3 , pp. 355-366
- Barry, W.¹ Hoequist, C.² Nolan, F.³

39
- 85009114400
- Visualization of pronunciation habits based upon abstract representation of acoustic observations
- N. Minematsu and S. Nakagawa, "Visualization of pronunciation habits based upon abstract representation of acoustic observations," in Proc. Integration Speech Technol. Into Learn., 2000, pp. 130-137.
- (2000) Proc. Integration Speech Technol. into Learn. , pp. 130-137
- Minematsu, N.¹ Nakagawa, S.²

40
- 0026386218
- Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method
- H. Kuwabara and T. Takagi, "Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method," Speech Commun., vol.10, pp. 491-495, 1991.
- (1991) Speech Commun. , vol.10 , pp. 491-495
- Kuwabara, H.¹ Takagi, T.²

41
- 0015677419
- Multidimensional representation of personal quality of vowels and its acoustical correlates
- Oct.
- H. Matsumoto, S. Hiki, T. Sone, and T. Nimura, "Multidimensional representation of personal quality of vowels and its acoustical correlates," IEEE Trans. Audio Electroacoust., vol.AE-21, no.5, pp. 428-436, Oct. 1973.
- (1973) IEEE Trans. Audio Electroacoust. , vol.AE-21 , Issue.5 , pp. 428-436
- Matsumoto, H.¹ Hiki, S.² Sone, T.³ Nimura, T.⁴

42
- 33646771442
- Towards decomposing the sources of variability in speech
- N. Malayath, H. Hermansky, and A. Kain, "Towards decomposing the sources of variability in speech," in Proc. Eurospeech, 1997, pp. 497-500.
- (1997) Proc. Eurospeech , pp. 497-500
- Malayath, N.¹ Hermansky, H.² Kain, A.³

43
- 0033883193
- The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels
- Y. Lavner, I. Gath, and J. Rosenhouse, "The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels," Speech Commun., vol.30, pp. 9-26, 2000.
- (2000) Speech Commun. , vol.30 , pp. 9-26
- Lavner, Y.¹ Gath, I.² Rosenhouse, J.³

44
- 2942594475
- A tutorial on text-independent speaker verification
- F. Bimbot, "A tutorial on text-independent speaker verification," EURASIP J. Appl. Signal Process., vol.2004, pp. 430-451, 2004.
- (2004) EURASIP J. Appl. Signal Process. , vol.2004 , pp. 430-451
- Bimbot, F.¹

45
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- L. M. Arslan, "Speaker Transformation Algorithm using Segmental Codebooks (STASC)," Speech Commun., vol.28, pp. 211-226, 1999.
- (1999) Speech Commun. , vol.28 , pp. 211-226
- Arslan, L.M.¹

46
- 0028247534
- Conventional, biological and environmental factors in speech communication: A modulation theory
- H. Traunmüller, "Conventional, biological and environmental factors in speech communication: A modulation theory," Phonetica, vol.51, pp. 170-183, 1994.
- (1994) Phonetica , vol.51 , pp. 170-183
- Traunmüller, H.¹

47
- 0003418124
- s'Gravenhage The Netherlands: Mouton
- G. Fant, Acoustic Theory of Speech Production. s'Gravenhage, The Netherlands: Mouton, 1960.
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

48
- 0003483593
- Cambridge, U.K.: Dept. of Eng., Cambridge Univ.
- S. J. Young, The HTK Hidden Markov Model Toolkit: Design and Philosophy. Cambridge, U.K.: Dept. of Eng., Cambridge Univ., 1993.
- (1993) The HTK Hidden Markov Model Toolkit: Design and Philosophy
- Young, S.J.¹

49
- 0016494495
- Selection of acoustic features for speaker identification
- Apr.
- M. Sambur, "Selection of acoustic features for speaker identification," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-23, no.2, pp. 176-182, Apr. 1975.
- (1975) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-23 , Issue.2 , pp. 176-182
- Sambur, M.¹

50
- 67650678075
- Contribution of prosody to the perception of a foreign accent: A study based on Spanish/Italian modified speech
- London, U.K.
- B. Vieru-Dimulescu and P. B. d. Mareüil, "Contribution of prosody to the perception of a foreign accent: A study based on Spanish/Italian modified speech," in Proc. ISCA Workshop Plasticity in Speech Perception, London, U.K., 2005, pp. 66-68.
- (2005) Proc. ISCA Workshop Plasticity in Speech Perception , pp. 66-68
- Vieru-Dimulescu, B.¹ Mareüil, P.B.D.²

51
- 0019606564
- The spectral envelope estimation vocoder
- D. Paul, "The spectral envelope estimation vocoder," IEEE Trans. Acoustics, Speech, and Signal Processing, vol.29, pp. 786-794, 1981.
- (1981) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.29 , pp. 786-794
- Paul, D.¹

52
- 84946753271
- VTLN-based cross-language voice conversion
- D. Sundermann, H. Ney, and H. Hoge, "VTLN-based cross-language voice conversion," in Proc. ASRU, 2003, pp. 676-681.
- (2003) Proc. ASRU , pp. 676-681
- Sundermann, D.¹ Ney, H.² Hoge, H.³

53
- 0021407831
- Signal estimation from modified short-time Fourier transform
- D. Griffin and J. Lim, "Signal estimation from modified short-time Fourier transform," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-32, no.2, pp. 236-243, Apr. 1984. (Pubitemid 14608418)
- (1984) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-32 , Issue.2 , pp. 236-243
- Griffin, D.¹ Lim, J.²

54
- 0031623661
- Spectral voice conversion for text-tospeech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeech synthesis," in Proc. ICASSP, 1998, pp. 285-288.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ MacOn, M.W.²

55
- 84965511190
- Evaluations of foreign accent in extemporaneous and read material
- M. Munro and T. Derwing, "Evaluations of foreign accent in extemporaneous and read material," Lang. Testing, vol.11, pp. 253-266, 1994.
- (1994) Lang. Testing , vol.11 , pp. 253-266
- Munro, M.¹ Derwing, T.²

56
- 33244478680
- 3rd ed. Belmont, CA: Thomson Higher Education
- B. Pelham and H. Blanton, Conducting Research in Psychology, Measuring the Weight of Smoke, 3rd ed. Belmont, CA: Thomson Higher Education, 2007.
- (2007) Conducting Research in Psychology, Measuring the Weight of Smoke
- Pelham, B.¹ Blanton, H.²

57
- 0026206653
- Comparing discrimination and recognition of unfamiliar voices
- J. Kreiman and G. Papcun, "Comparing discrimination and recognition of unfamiliar voices," Speech Commun., vol.10, pp. 265-275, 1991.
- (1991) Speech Commun. , vol.10 , pp. 265-275
- Kreiman, J.¹ Papcun, G.²

58
- 85047674605
- Learning to recognize talkers from natural, sinewave, and reversed speech samples
- S. M. Sheffert, D. B. Pisoni, J. M. Fellowes, and R. E. Remez, "Learning to recognize talkers from natural, sinewave, and reversed speech samples," J. Exp. Psychol. Hum. Percept. Perform., vol.28, pp. 1447-1469, 2002.
- (2002) J. Exp. Psychol. Hum. Percept. Perform. , vol.28 , pp. 1447-1469
- Sheffert, S.M.¹ Pisoni, D.B.² Fellowes, J.M.³ Remez, R.E.⁴

59
- 51449115975
- Cambridge U.K.: Univ. of Cambridge
- K. Vertanen, Baseline WSJ Acoustic Models for HTK and Sphinx: Training Recipes and Recognition Experiments. Cambridge, U.K.: Univ. of Cambridge, 2006.
- (2006) Baseline WSJ Acoustic Models for HTK and Sphinx: Training Recipes and Recognition Experiments
- Vertanen, K.¹

60
- 51449095035
- Release 0.6. Pittsburgh, PA: Carnegie Mellon Univ.
- R. Weide, The CMU Pronunciation Dictionary, Release 0.6. Pittsburgh, PA: Carnegie Mellon Univ., 1998.
- (1998) The CMU Pronunciation Dictionary
- Weide, R.¹

61
- 0003822743
- Cambridge U.K.: Cambridge Univ.
- S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book. Cambridge, U.K.: Cambridge Univ., 1995, vol.1996.
- (1995) The HTK Book , vol.1996
- Young, S.¹ Evermann, G.² Kershaw, D.³ Moore, G.⁴ Odell, J.⁵ Ollason, D.⁶ Valtchev, V.⁷ Woodland, P.⁸

62
- 33645461463
- [Online]. Available:
- P. Meier and S. Muller, "IDEA: International dialects of English archive," [Online]. Available: http://web.ku.edu/∼idea/index.htm 2009
- (2009) IDEA: International Dialects of English Archive
- Meier, P.¹ Muller, S.²

63
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-27, no.2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
- (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll Steven, F.¹

64
- 33646773080
- Pittsburgh PA: Carnegie Mellon Univ. Lang. Technol. Inst.
- J. Kominek and A. Black, CMU ARCTIC Databases for Speech Synthesis. Pittsburgh, PA: Carnegie Mellon Univ. Lang. Technol. Inst., 2003.
- (2003) CMU ARCTIC Databases for Speech Synthesis
- Kominek, J.¹ Black, A.²

65
- 33846516584
- New York: Springer
- C. M. Bishop, Pattern Recognition and Machine Learning. New York: Springer, 2006.
- (2006) Pattern Recognition and Machine Learning
- Bishop, C.M.¹

66
- 0031632634
- Separation of non-spontaneous and spontaneous speech
- O. P. Kenny, D. J. Nelson, J. S. Bodenschatz, and H. A. McMonagle, "Separation of non-spontaneous and spontaneous speech," in Proc. ICASSP, 1998, vol.1, pp. 573-576.
- (1998) Proc. ICASSP , vol.1 , pp. 573-576
- Kenny, O.P.¹ Nelson, D.J.² Bodenschatz, J.S.³ McMonagle, H.A.⁴

67
- 0034704229
- A global geometric framework for nonlinear dimensionality reduction
- J. B. Tenenbaum, V. d. Silva, and J. C. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol.290, pp. 2319-2323, 2000.
- (2000) Science , vol.290 , pp. 2319-2323
- Tenenbaum, J.B.¹ Silva, V.D.² Langford, J.C.³

68
- 84863647359
- Donor selection for voice conversion
- O. Turk and L. M. Arslan, "Donor selection for voice conversion," in Proc. EUSIPCO, 2005.
- (2005) Proc. EUSIPCO
- Turk, O.¹ Arslan, L.M.²

69
- 84867192517
- Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer
- Brisbane, Australia
- A. Harrison, W. Lau, H. Meng, and L.Wang, "Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer," in Proc. Interspeech, Brisbane, Australia, 2008, pp. 2787-2790.
- (2008) Proc. Interspeech , pp. 2787-2790
- Harrison, A.¹ Lau, W.² Meng, H.³ Wang, L.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.