메뉴 건너뛰기




Volumn 18, Issue 5, 2010, Pages 1030-1040

Developing objective measures of foreign-accent conversion

Author keywords

Accent conversion; Foreign accent recognition; Speaker recognition; Voice conversion

Indexed keywords

ACCENTED SPEECH; ACOUSTIC QUALITY; ACOUSTIC VECTORS; AUTOMATIC SPEECH RECOGNIZERS; CEPSTRAL; COMPUTER ASSISTED; CONVERSION METHODS; DEGREE OF CORRELATIONS; LINEAR DISCRIMINANTS; LISTENING TESTS; MATCH SCORE; NARROW BANDS; OBJECTIVE MEASURE; PERCEPTUAL TEST; SINGLE-ENDED; SPEAKER RECOGNITION; SPECTRAL DISTORTIONS; SPEECH QUALITY; SPEECH SIGNALS; SUBJECTIVE RATING; TARGET SPEAKER; VOICE CONVERSION;

EID: 77953714655     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2038818     Document Type: Article
Times cited : (29)

References (69)
  • 1
    • 84929064443 scopus 로고
    • Advances in computer-based speech training: Aids for the profoundly hearing impaired
    • C. Watson and D. Kewley-Port, "Advances in computer-based speech training: Aids for the profoundly hearing impaired," Volta-Rev., vol.91, pp. 29-45, 1989.
    • (1989) Volta-Rev. , vol.91 , pp. 29-45
    • Watson, C.1    Kewley-Port, D.2
  • 2
    • 67650668659 scopus 로고    scopus 로고
    • Intonational foreign accent: Speech technology and foreign language teaching
    • M. Jilka and G. Möhler, "Intonational foreign accent: Speech technology and foreign language teaching," in Proc. ESCA Workshop Speech Tech. Lang. Learn., 1998, pp. 115-118.
    • (1998) Proc. ESCA Workshop Speech Tech. Lang. Learn. , pp. 115-118
    • Jilka, M.1    Möhler, G.2
  • 4
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Mar.
    • Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol.6, no.2, pp. 131-142, Mar. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 5
    • 0034841948 scopus 로고    scopus 로고
    • Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
    • Salt Lake City, UT
    • A. Kain and M. Macon, "Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction," in Proc. ICASSP 2001, Salt Lake City, UT, 2001, pp. 813-816.
    • (2001) Proc. ICASSP 2001 , pp. 813-816
    • Kain, A.1    MacOn, M.2
  • 6
    • 36949014554 scopus 로고    scopus 로고
    • The effect of listener accent background on accent perception and comprehension
    • A. Ikeno and J. H. L. Hansen, "The effect of listener accent background on accent perception and comprehension," EURASIP J. Audio, Speech, Music Process., vol.2007, pp. 1-8, 2007.
    • (2007) EURASIP J. Audio, Speech, Music Process. , vol.2007 , pp. 1-8
    • Ikeno, A.1    Hansen, J.H.L.2
  • 7
    • 18744409456 scopus 로고    scopus 로고
    • Feedback in computer assisted pronunciation training: Technology push or demand pull?
    • A. Neri, C. Cucchiarini, and H. Strik, "Feedback in computer assisted pronunciation training: Technology push or demand pull?," in Proc. CALL Conf., 2002, pp. 179-188.
    • (2002) Proc. CALL Conf. , pp. 179-188
    • Neri, A.1    Cucchiarini, C.2    Strik, H.3
  • 9
    • 84979339593 scopus 로고
    • The role of intonation in foreign accent
    • T. van Els and K. de Bot, "The role of intonation in foreign accent," Modern Lang. J., vol.71, pp. 147-155, 1987.
    • (1987) Modern Lang. J. , vol.71 , pp. 147-155
    • Van Els, T.1    De Bot, K.2
  • 10
    • 36248978278 scopus 로고    scopus 로고
    • Foreign accent
    • and Methods, J. G. Carbonell and J. Siekmann, Eds. New York: Springer
    • U. Gut, "Foreign accent," in Speaker Classification I: Fundamentals, Features, and Methods, J. G. Carbonell and J. Siekmann, Eds. New York: Springer, 2007, pp. 75-87.
    • (2007) Speaker Classification I: Fundamentals, Features , pp. 75-87
    • Gut, U.1
  • 11
    • 0030757418 scopus 로고    scopus 로고
    • A study of temporal features and frequency characteristics in American english foreign accent
    • L. M. Arslan and J. H. L. Hansen, "A study of temporal features and frequency characteristics in American english foreign accent," JASA, vol.102, pp. 28-40, 1997.
    • (1997) JASA , vol.102 , pp. 28-40
    • Arslan, L.M.1    Hansen, J.H.L.2
  • 12
    • 84971878476 scopus 로고
    • Non-segmental factors in foreign accent: Ratings of filtered speech
    • M. Munro, "Non-segmental factors in foreign accent: Ratings of filtered speech," Studies in Second Lang. Acquisition, vol.17, pp. 17-34, 1995.
    • (1995) Studies in Second Lang. Acquisition , vol.17 , pp. 17-34
    • Munro, M.1
  • 13
    • 0029254163 scopus 로고
    • Non-parametric techniques for pitchscale and time-scale modification of speech
    • E. Moulines and J. Laroche, "Non-parametric techniques for pitchscale and time-scale modification of speech," Speech Commun., vol.16, pp. 175-205, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 175-205
    • Moulines, E.1    Laroche, J.2
  • 14
    • 0030642434 scopus 로고    scopus 로고
    • Effects of temporal correction on intelligibility of foreign-accented English
    • K. Tajima, R. Port, and J. Dalby, "Effects of temporal correction on intelligibility of foreign-accented English," J. Phon., vol.25, pp. 1-24, 1997.
    • (1997) J. Phon. , vol.25 , pp. 1-24
    • Tajima, K.1    Port, R.2    Dalby, J.3
  • 16
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol.9, pp. 453-467, 1990.
    • (1990) Speech Commun. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 17
    • 67650668657 scopus 로고
    • English speech training using voice conversion
    • K. Nagano and K. Ozawa, "English speech training using voice conversion," in Proc. ICSLP, 1990, pp. 1169-1172.
    • (1990) Proc. ICSLP , pp. 1169-1172
    • Nagano, K.1    Ozawa, K.2
  • 19
    • 4544353416 scopus 로고    scopus 로고
    • Analysis by synthesis of acoustic correlates of British, Australian and American accents
    • Q. Yan, S. Vaseghi, D. Rentzos, and C.-H. Ho, "Analysis by synthesis of acoustic correlates of British, Australian and American accents," in Proc. ICASSP, 2004, pp. 637-640.
    • (2004) Proc. ICASSP , pp. 637-640
    • Yan, Q.1    Vaseghi, S.2    Rentzos, D.3    Ho, C.-H.4
  • 20
    • 77956396053 scopus 로고    scopus 로고
    • Perception of foreign accentedness in L2 prosody and segments: L1 Japanese speakers learning L2 French
    • T. Kamiyama, "Perception of foreign accentedness in L2 prosody and segments: L1 Japanese speakers learning L2 French," in Proc. Speech Prosody: ISCA, 2004.
    • (2004) Proc. Speech Prosody: ISCA
    • Kamiyama, T.1
  • 21
    • 0030355972 scopus 로고    scopus 로고
    • The MBROLA project: Towards a set of high-quality speech synthesizers free of use for non-commercial purposes
    • T. Dutoit, V. Pagel, N. Pierret, F. Bataille, and O. v. d. Vreken, "The MBROLA project: Towards a set of high-quality speech synthesizers free of use for non-commercial purposes," in Proc. ICSLP, 1996, vol.3, pp. 1393-1396.
    • (1996) Proc. ICSLP , vol.3 , pp. 1393-1396
    • Dutoit, T.1    Pagel, V.2    Pierret, N.3    Bataille, F.4    Vreken, O.V.D.5
  • 24
    • 67650657780 scopus 로고    scopus 로고
    • Foreign accent conversion in computer assisted pronunciation training
    • D. Felps, H. Bortfeld, and R. Gutierrez-Osuna, "Foreign accent conversion in computer assisted pronunciation training," Speech Commun., vol.51, pp. 920-932, 2009.
    • (2009) Speech Commun. , vol.51 , pp. 920-932
    • Felps, D.1    Bortfeld, H.2    Gutierrez-Osuna, R.3
  • 25
    • 84994985638 scopus 로고
    • Foreign accent, comprehensibility, and intelligibility in the speech of second language learners
    • M. Munro and T. Derwing, "Foreign accent, comprehensibility, and intelligibility in the speech of second language learners," Lang. Learn. Technol., vol.45, pp. 73-97, 1995.
    • (1995) Lang. Learn. Technol. , vol.45 , pp. 73-97
    • Munro, M.1    Derwing, T.2
  • 27
    • 39649083007 scopus 로고    scopus 로고
    • P.563 - The ITU-T standard for single-ended speech quality assessment
    • Nov.
    • L. Malfait, J. Berger, and M. Kastner, "P.563-the ITU-T standard for single-ended speech quality assessment," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.6, pp. 1924-1934, Nov. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.6 , pp. 1924-1934
    • Malfait, L.1    Berger, J.2    Kastner, M.3
  • 28
  • 29
    • 4544278506 scopus 로고    scopus 로고
    • Perceptual model for non-intrusive speech quality assessment
    • K. Doh-Suk and A. Tarraf, "Perceptual model for non-intrusive speech quality assessment," in Proc. ICASSP, 2004, pp. 1060-1063.
    • (2004) Proc. ICASSP , pp. 1060-1063
    • Doh-Suk, K.1    Tarraf, A.2
  • 30
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 31
    • 1842739879 scopus 로고    scopus 로고
    • Accent issues in large vocabulary continuous speech recognition
    • C. Huang, T. Chen, and E. Chang, "Accent issues in large vocabulary continuous speech recognition," Int. J. Speech Technol., vol.7, pp. 141-153, 2004.
    • (2004) Int. J. Speech Technol. , vol.7 , pp. 141-153
    • Huang, C.1    Chen, T.2    Chang, E.3
  • 32
    • 85009063603 scopus 로고    scopus 로고
    • ACCDIST: A metric for comparing speakers' accents
    • M. Huckvale, "ACCDIST: A metric for comparing speakers' accents," in Proc. ICSLP, 2004.
    • (2004) Proc. ICSLP
    • Huckvale, M.1
  • 33
    • 64349124465 scopus 로고    scopus 로고
    • Analysis and synthesis of formant spaces of British, Australian, and American accents
    • Feb.
    • Q. Yan, S. Vaseghi, D. Rentzos, and C. H. Ho, "Analysis and synthesis of formant spaces of British, Australian, and American accents," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.2, pp. 676-689, Feb. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 676-689
    • Yan, Q.1    Vaseghi, S.2    Rentzos, D.3    Ho, C.H.4
  • 35
    • 84962855264 scopus 로고    scopus 로고
    • Automatic accent identification using Gaussian mixture models
    • T. Chen, C. Huang, E. Chang, and J. Wang, "Automatic accent identification using Gaussian mixture models," in Proc. ASRU, 2001.
    • (2001) Proc. ASRU
    • Chen, T.1    Huang, C.2    Chang, E.3    Wang, J.4
  • 36
    • 0030165438 scopus 로고    scopus 로고
    • Language accent classification in American english
    • L. M. Arslan and J. H. L. Hansen, "Language accent classification in American english," Speech Commun., vol.18, pp. 353-367, 1996.
    • (1996) Speech Commun. , vol.18 , pp. 353-367
    • Arslan, L.M.1    Hansen, J.H.L.2
  • 37
    • 0036298772 scopus 로고    scopus 로고
    • A comparative analysis of UK and US English accents in recognition and synthesis
    • Q. Yan and S. Vaseghi, "A comparative analysis of UK and US English accents in recognition and synthesis," in Proc. ICASSP, 2002, pp. 413-416.
    • (2002) Proc. ICASSP , pp. 413-416
    • Yan, Q.1    Vaseghi, S.2
  • 38
    • 0004656024 scopus 로고
    • An approach to the problem of regional accent in automatic speech recognition
    • W. Barry, C. Hoequist, and F. Nolan, "An approach to the problem of regional accent in automatic speech recognition," Comput. Speech, Lang., vol.3, pp. 355-366, 1989.
    • (1989) Comput. Speech, Lang. , vol.3 , pp. 355-366
    • Barry, W.1    Hoequist, C.2    Nolan, F.3
  • 39
    • 85009114400 scopus 로고    scopus 로고
    • Visualization of pronunciation habits based upon abstract representation of acoustic observations
    • N. Minematsu and S. Nakagawa, "Visualization of pronunciation habits based upon abstract representation of acoustic observations," in Proc. Integration Speech Technol. Into Learn., 2000, pp. 130-137.
    • (2000) Proc. Integration Speech Technol. into Learn. , pp. 130-137
    • Minematsu, N.1    Nakagawa, S.2
  • 40
    • 0026386218 scopus 로고
    • Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method
    • H. Kuwabara and T. Takagi, "Acoustic parameters of voice individuality and voice-quality control by analysis-synthesis method," Speech Commun., vol.10, pp. 491-495, 1991.
    • (1991) Speech Commun. , vol.10 , pp. 491-495
    • Kuwabara, H.1    Takagi, T.2
  • 41
    • 0015677419 scopus 로고
    • Multidimensional representation of personal quality of vowels and its acoustical correlates
    • Oct.
    • H. Matsumoto, S. Hiki, T. Sone, and T. Nimura, "Multidimensional representation of personal quality of vowels and its acoustical correlates," IEEE Trans. Audio Electroacoust., vol.AE-21, no.5, pp. 428-436, Oct. 1973.
    • (1973) IEEE Trans. Audio Electroacoust. , vol.AE-21 , Issue.5 , pp. 428-436
    • Matsumoto, H.1    Hiki, S.2    Sone, T.3    Nimura, T.4
  • 42
    • 33646771442 scopus 로고    scopus 로고
    • Towards decomposing the sources of variability in speech
    • N. Malayath, H. Hermansky, and A. Kain, "Towards decomposing the sources of variability in speech," in Proc. Eurospeech, 1997, pp. 497-500.
    • (1997) Proc. Eurospeech , pp. 497-500
    • Malayath, N.1    Hermansky, H.2    Kain, A.3
  • 43
    • 0033883193 scopus 로고    scopus 로고
    • The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels
    • Y. Lavner, I. Gath, and J. Rosenhouse, "The effects of acoustic modifications on the identification of familiar voices speaking isolated vowels," Speech Commun., vol.30, pp. 9-26, 2000.
    • (2000) Speech Commun. , vol.30 , pp. 9-26
    • Lavner, Y.1    Gath, I.2    Rosenhouse, J.3
  • 44
    • 2942594475 scopus 로고    scopus 로고
    • A tutorial on text-independent speaker verification
    • F. Bimbot, "A tutorial on text-independent speaker verification," EURASIP J. Appl. Signal Process., vol.2004, pp. 430-451, 2004.
    • (2004) EURASIP J. Appl. Signal Process. , vol.2004 , pp. 430-451
    • Bimbot, F.1
  • 45
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L. M. Arslan, "Speaker Transformation Algorithm using Segmental Codebooks (STASC)," Speech Commun., vol.28, pp. 211-226, 1999.
    • (1999) Speech Commun. , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 46
    • 0028247534 scopus 로고
    • Conventional, biological and environmental factors in speech communication: A modulation theory
    • H. Traunmüller, "Conventional, biological and environmental factors in speech communication: A modulation theory," Phonetica, vol.51, pp. 170-183, 1994.
    • (1994) Phonetica , vol.51 , pp. 170-183
    • Traunmüller, H.1
  • 49
    • 0016494495 scopus 로고
    • Selection of acoustic features for speaker identification
    • Apr.
    • M. Sambur, "Selection of acoustic features for speaker identification," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-23, no.2, pp. 176-182, Apr. 1975.
    • (1975) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-23 , Issue.2 , pp. 176-182
    • Sambur, M.1
  • 50
    • 67650678075 scopus 로고    scopus 로고
    • Contribution of prosody to the perception of a foreign accent: A study based on Spanish/Italian modified speech
    • London, U.K.
    • B. Vieru-Dimulescu and P. B. d. Mareüil, "Contribution of prosody to the perception of a foreign accent: A study based on Spanish/Italian modified speech," in Proc. ISCA Workshop Plasticity in Speech Perception, London, U.K., 2005, pp. 66-68.
    • (2005) Proc. ISCA Workshop Plasticity in Speech Perception , pp. 66-68
    • Vieru-Dimulescu, B.1    Mareüil, P.B.D.2
  • 52
    • 84946753271 scopus 로고    scopus 로고
    • VTLN-based cross-language voice conversion
    • D. Sundermann, H. Ney, and H. Hoge, "VTLN-based cross-language voice conversion," in Proc. ASRU, 2003, pp. 676-681.
    • (2003) Proc. ASRU , pp. 676-681
    • Sundermann, D.1    Ney, H.2    Hoge, H.3
  • 54
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-tospeech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeech synthesis," in Proc. ICASSP, 1998, pp. 285-288.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    MacOn, M.W.2
  • 55
    • 84965511190 scopus 로고
    • Evaluations of foreign accent in extemporaneous and read material
    • M. Munro and T. Derwing, "Evaluations of foreign accent in extemporaneous and read material," Lang. Testing, vol.11, pp. 253-266, 1994.
    • (1994) Lang. Testing , vol.11 , pp. 253-266
    • Munro, M.1    Derwing, T.2
  • 57
    • 0026206653 scopus 로고
    • Comparing discrimination and recognition of unfamiliar voices
    • J. Kreiman and G. Papcun, "Comparing discrimination and recognition of unfamiliar voices," Speech Commun., vol.10, pp. 265-275, 1991.
    • (1991) Speech Commun. , vol.10 , pp. 265-275
    • Kreiman, J.1    Papcun, G.2
  • 60
    • 51449095035 scopus 로고    scopus 로고
    • Release 0.6. Pittsburgh, PA: Carnegie Mellon Univ.
    • R. Weide, The CMU Pronunciation Dictionary, Release 0.6. Pittsburgh, PA: Carnegie Mellon Univ., 1998.
    • (1998) The CMU Pronunciation Dictionary
    • Weide, R.1
  • 63
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-27, no.2, pp. 113-120, Apr. 1979. (Pubitemid 9467471)
    • (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll Steven, F.1
  • 67
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • J. B. Tenenbaum, V. d. Silva, and J. C. Langford, "A global geometric framework for nonlinear dimensionality reduction," Science, vol.290, pp. 2319-2323, 2000.
    • (2000) Science , vol.290 , pp. 2319-2323
    • Tenenbaum, J.B.1    Silva, V.D.2    Langford, J.C.3
  • 68
    • 84863647359 scopus 로고    scopus 로고
    • Donor selection for voice conversion
    • O. Turk and L. M. Arslan, "Donor selection for voice conversion," in Proc. EUSIPCO, 2005.
    • (2005) Proc. EUSIPCO
    • Turk, O.1    Arslan, L.M.2
  • 69
    • 84867192517 scopus 로고    scopus 로고
    • Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer
    • Brisbane, Australia
    • A. Harrison, W. Lau, H. Meng, and L.Wang, "Improving mispronunciation detection and diagnosis of learners' speech with context-sensitive phonological rules based on language transfer," in Proc. Interspeech, Brisbane, Australia, 2008, pp. 2787-2790.
    • (2008) Proc. Interspeech , pp. 2787-2790
    • Harrison, A.1    Lau, W.2    Meng, H.3    Wang, L.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.