메뉴 건너뛰기




Volumn 20, Issue 8, 2012, Pages 2301-2312

Foreign accent conversion through concatenative synthesis in the articulatory domain

Author keywords

Accent conversion; speaker recognition; speech perception; speech synthesis

Indexed keywords

ACOUSTIC FEATURES; ARTICULATORY FEATURES; ELECTROMAGNETIC ARTICULOGRAPHY; LISTENING TESTS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; NON-NATIVE SPEAKERS; SPEAKER DEPENDENTS; SPEAKER RECOGNITION; SPEECH PERCEPTION; STRONG COUPLING; VOCAL TRACT LENGTHS;

EID: 84865392230     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2201474     Document Type: Article
Times cited : (34)

References (43)
  • 2
    • 84972167927 scopus 로고
    • Native reactions to non-native speech: A review of empirical research
    • M. Eisenstein, "Native reactions to non-native speech: A review of empirical research," Studies in Second Lang. Acquisit., vol. 5, no. 02, pp. 160-176, 1983.
    • (1983) Studies in Second Lang. Acquisit. , vol.5 , Issue.2 , pp. 160-176
    • Eisenstein, M.1
  • 3
    • 67650581742 scopus 로고    scopus 로고
    • An overview of spoken language technology for education
    • M. Eskenazi, "An overview of spoken language technology for education," Speech Commun., vol. 51, no. 10, pp. 832-844, 2009.
    • (2009) Speech Commun. , vol.51 , Issue.10 , pp. 832-844
    • Eskenazi, M.1
  • 4
    • 84937381349 scopus 로고    scopus 로고
    • The pedagogy-technology interface in computer assisted pronunciation training
    • A. Neri, C. Cucchiarini, and H. Strik et al., "The pedagogy-technology interface in computer assisted pronunciation training," Comput. Assist. Lang. Learn., vol. 15, no. 5, pp. 441-467, 2002.
    • (2002) Comput. Assist. Lang. Learn. , vol.15 , Issue.5 , pp. 441-467
    • Neri, A.1    Cucchiarini, C.2    Strik, H.3
  • 5
    • 4544358888 scopus 로고    scopus 로고
    • Automatic speech recognition for second language learning: How and why it actually works
    • A. Neri, C. Cucchiarini, and H. Strik, "Automatic speech recognition for second language learning: How and why it actually works," in Proc. Int. Congr. Phon. Sci., 2003, pp. 1157-1160.
    • (2003) Proc. Int. Congr. Phon. Sci. , pp. 1157-1160
    • Neri, A.1    Cucchiarini, C.2    Strik, H.3
  • 6
    • 23844449782 scopus 로고    scopus 로고
    • Software that listens: It's not a question of whether, it's a question of how
    • K. A. Wachowicz and B. Scott, "Software that listens: It's not a question of whether, it's a question of how," CALICO J., vol. 16, no. 3, pp. 253-276, 1999.
    • (1999) CALICO J. , vol.16 , Issue.3 , pp. 253-276
    • Wachowicz, K.A.1    Scott, B.2
  • 7
    • 0040485015 scopus 로고    scopus 로고
    • Negotiation of form, recasts, and explicit correction in relation to error types and learner repair in immersion classrooms
    • R. Lyster, "Negotiation of form, recasts, and explicit correction in relation to error types and learner repair in immersion classrooms," Lang. Learn., vol. 51, no. s1, pp. 265-301, 2001. (Pubitemid 33281959)
    • (2001) Language Learning , vol.51 , Issue.SUPPL. 1 , pp. 265-301
    • Lyster, R.1
  • 8
    • 67650668657 scopus 로고
    • English speech training using voice conversion
    • K. Nagano and K. Ozawa, "English speech training using voice conversion," in Proc. ICSLP, 1990, pp. 1169-1172.
    • (1990) Proc. ICSLP , pp. 1169-1172
    • Nagano, K.1    Ozawa, K.2
  • 9
    • 67650602764 scopus 로고    scopus 로고
    • Lexical stress training of German compounds for Italian speakers by means of resynthesis and emphasis
    • M. P. Bissiri, H. R. Pfitzinger, and H. G. Tillmann, "Lexical stress training of German compounds for Italian speakers by means of resynthesis and emphasis," in Proc. Austral. Int. Conf. Speech Sci. Tech., 2006, pp. 24-29.
    • (2006) Proc. Austral. Int. Conf. Speech Sci. Tech. , pp. 24-29
    • Bissiri, M.P.1    Pfitzinger, H.R.2    Tillmann, H.G.3
  • 10
    • 0036642569 scopus 로고    scopus 로고
    • Enhancing foreign language tutors - In search of the golden speaker
    • DOI 10.1016/S0167-6393(01)00009-7, PII S0167639301000097
    • K. Probst, Y. Ke, and M. Eskenazi, "Enhancing foreign language tutors -In search of the golden speaker," Speech Commun., vol. 37, no. 3-4, pp. 161-173, 2002. (Pubitemid 34524837)
    • (2002) Speech Communication , vol.37 , Issue.3-4 , pp. 161-173
    • Probst, K.1    Ke, Y.2    Eskenazi, M.3
  • 11
    • 67650657780 scopus 로고    scopus 로고
    • Foreign accent conversion in computer assisted pronunciation training
    • D. Felps, H. Bortfeld, and R. Gutierrez-Osuna, "Foreign accent conversion in computer assisted pronunciation training," Speech Commun., vol. 51, no. 10, pp. 920-932, 2009.
    • (2009) Speech Commun. , vol.51 , Issue.10 , pp. 920-932
    • Felps, D.1    Bortfeld, H.2    Gutierrez-Osuna, R.3
  • 12
    • 25044464569 scopus 로고
    • The front-cavity/F2[prime] hypothesis tested by data on tongue movements
    • D. J. Broad and H. Hermansky, "The front-cavity/F2[prime] hypothesis tested by data on tongue movements," J. Acoust. Soc. Amer., vol. 86, no. S1, pp. S113-S114, 1989.
    • (1989) J. Acoust. Soc. Amer. , vol.86 , Issue.S1
    • Broad, D.J.1    Hermansky, H.2
  • 13
    • 85027461504 scopus 로고    scopus 로고
    • Using articulatory position data in voice transformation
    • A. Toth and A. Black, "Using articulatory position data in voice transformation," in Proc. ISCA Speech Synth.Workshop, 2007, pp. 182-185.
    • (2007) Proc. ISCA Speech Synth.Workshop , pp. 182-185
    • Toth, A.1    Black, A.2
  • 14
    • 84937181324 scopus 로고    scopus 로고
    • Listeners and disguised voices: The imitation and perception of dialectal accent
    • D. Markham, "Listeners and disguised voices: The imitation and perception of dialectal accent," Forensic Linguist., vol. 6, no. 2, pp. 290-299, 1999.
    • (1999) Forensic Linguist. , vol.6 , Issue.2 , pp. 290-299
    • Markham, D.1
  • 15
    • 84937385165 scopus 로고    scopus 로고
    • Passing for a native speaker: Identity and success in second language learning
    • I. Piller, "Passing for a native speaker: Identity and success in second language learning," J. Sociolinguist., vol. 6, no. 2, pp. 179-208, 2002.
    • (2002) J. Sociolinguist. , vol.6 , Issue.2 , pp. 179-208
    • Piller, I.1
  • 18
    • 64349124465 scopus 로고    scopus 로고
    • Analysis and synthesis of formant spaces of british, australian, and american accents
    • Q. Yan, S. Vaseghi, and D. Rentzos et al., "Analysis and synthesis of formant spaces of british, australian, and american accents," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 2, pp. 676-689, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.2 , pp. 676-689
    • Yan, Q.1    Vaseghi, S.2    Rentzos, D.3
  • 19
    • 0032680858 scopus 로고    scopus 로고
    • Implications of glottal source for speaker and dialect identification
    • Phoenix, AZ
    • L. R. Yanguas, T. F. Quatieri, and F. Goodman, "Implications of glottal source for speaker and dialect identification," in Proc. ICASSP, Phoenix, AZ, 1999, pp. 813-816.
    • (1999) Proc. ICASSP , pp. 813-816
    • Yanguas, L.R.1    Quatieri, T.F.2    Goodman, F.3
  • 20
    • 84865357645 scopus 로고    scopus 로고
    • [Online]. Available:
    • A. Wrench, MOCHA-TIMIT. [Online]. Available: http://www.cstr.ed.ac.uk/ research/projects/artic/mocha.html
    • MOCHA-TIMIT
    • Wrench, A.1
  • 22
    • 33846669825 scopus 로고    scopus 로고
    • Beyond 2D in articulatory data acquisition and analysis
    • P. Hoole, A. Zierdt, and C. Geng, "Beyond 2D in articulatory data acquisition and analysis," in Proc. Int. Conf. Phon. Sci., 2003, pp. 265-268.
    • (2003) Proc. Int. Conf. Phon. Sci. , pp. 265-268
    • Hoole, P.1    Zierdt, A.2    Geng, C.3
  • 23
    • 79960575108 scopus 로고    scopus 로고
    • Five-dimensional articulography
    • P. Hoole and A. Zierdt, "Five-dimensional articulography," Speech Motor Control, pp. 331-349, 2010.
    • (2010) Speech Motor Control , pp. 331-349
    • Hoole, P.1    Zierdt, A.2
  • 24
    • 51449085174 scopus 로고    scopus 로고
    • Analysis-by-synthesis features for speech recognition
    • Z. Al Bawab, R. Bhiksha, and R. M. Stern, "Analysis-by-synthesis features for speech recognition," in Proc. ICASSP, 2008, pp. 4185-4188.
    • (2008) Proc. ICASSP , pp. 4185-4188
    • Al Bawab, Z.1    Bhiksha, R.2    Stern, R.M.3
  • 25
    • 34247634965 scopus 로고
    • An articulatory model of the tongue based on a statistical analysis
    • S. Maeda, "An articulatory model of the tongue based on a statistical analysis," J. Acoust. Soc. Amer., vol. 65, p. S22, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65
    • Maeda, S.1
  • 26
    • 0030677481 scopus 로고    scopus 로고
    • Speech representation and transformation using adaptive interpolation of weighted spectrum: Vocoder revisited
    • H. Kawahara, "Speech representation and transformation using adaptive interpolation of weighted spectrum: Vocoder revisited," in Proc. ICASSP, 1997, pp. 1303-1306.
    • (1997) Proc. ICASSP , pp. 1303-1306
    • Kawahara, H.1
  • 27
    • 0032141206 scopus 로고    scopus 로고
    • Cepstral domain segmental feature vector normalization for noise robust speech recognition
    • PII S0167639398000338
    • O. Viikki and K. Laurila, "Cepstral domain segmental feature vector normalization for noise robust speech recognition," Speech Commun., vol. 25, no. 1-3, pp. 133-147, 1998. (Pubitemid 128413638)
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 133-147
    • Viikki, O.1    Laurila, K.2
  • 30
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A. J. Hunt and A. W. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP, 1996, pp. 373-376.
    • (1996) Proc. ICASSP , pp. 373-376
    • Hunt, A.J.1    Black, A.W.2
  • 31
    • 34047123652 scopus 로고    scopus 로고
    • Multisyn: Open-domain unit selection for the Festival speech synthesis system
    • DOI 10.1016/j.specom.2007.01.014, PII S0167639307000398
    • R. A. J. Clark, K. Richmond, and S. King, "Multisyn: Open-domain unit selection for the festival speech synthesis system," Speech Commun., vol. 49, no. 4, pp. 317-330, 2007. (Pubitemid 46517714)
    • (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
    • Clark, R.A.J.1    Richmond, K.2    King, S.3
  • 33
    • 0036497601 scopus 로고    scopus 로고
    • A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    • D. T. Chappell and J. H. L. Hansen, "A comparison of spectral smoothing methods for segment concatenation based speech synthesis," Speech Commun., vol. 36, no. 3-4, pp. 343-373, 2002.
    • (2002) Speech Commun. , vol.36 , Issue.3-4 , pp. 343-373
    • Chappell, D.T.1    Hansen, J.H.L.2
  • 34
    • 70450161677 scopus 로고    scopus 로고
    • Pulse density representation of spectrum for statistical speech processing
    • Y. Shiga, "Pulse density representation of spectrum for statistical speech processing," in Proc. Interspeech, 2009, pp. 1771-1774.
    • (2009) Proc. Interspeech , pp. 1771-1774
    • Shiga, Y.1
  • 35
    • 84965511190 scopus 로고
    • Evaluations of foreign accent in extemporaneous and read material
    • M. Munro and T. Derwing, "Evaluations of foreign accent in extemporaneous and read material," Lang. Testing, vol. 11, pp. 253-266, 1994.
    • (1994) Lang. Testing , vol.11 , pp. 253-266
    • Munro, M.1    Derwing, T.2
  • 36
    • 0031647824 scopus 로고    scopus 로고
    • A frequencywarping approach to speaker normalization
    • Jan
    • L. Lee and R. Rose, "A frequencywarping approach to speaker normalization," IEEE Trans. Speech Audio Process., vol. 6, no. 1, pp. 49-60, Jan. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 49-60
    • Lee, L.1    Rose, R.2
  • 37
    • 84936526529 scopus 로고
    • On the quantal nature of speech
    • K. N. Stevens, "On the quantal nature of speech," Phonetics, vol. 17, no. 1, pp. 3-45, 1989.
    • (1989) Phonetics , vol.17 , Issue.1 , pp. 3-45
    • Stevens, K.N.1
  • 38
    • 77955426516 scopus 로고    scopus 로고
    • Automatic voice onset time detection for unvoiced stops (/p/,/t/,/k/) with application to accent classification
    • J. H. L. Hansen, S. S. Gray, and W. Kim, "Automatic voice onset time detection for unvoiced stops (/p/,/t/,/k/) with application to accent classification," Speech Commun., vol. 52, no. 10, pp. 777-789, 2010.
    • (2010) Speech Commun. , vol.52 , Issue.10 , pp. 777-789
    • Hansen, J.H.L.1    Gray, S.S.2    Kim, W.3
  • 39
    • 84966440972 scopus 로고    scopus 로고
    • Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
    • S. R. Hertz, "Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis," in Proc. IEEE Workshop Speech Synth., 2002, pp. 87-90.
    • (2002) Proc. IEEE Workshop Speech Synth. , pp. 87-90
    • Hertz, S.R.1
  • 40
    • 84865372824 scopus 로고    scopus 로고
    • Evaluation of cross-language voice conversion using bilingual and non-bilingual databases
    • M. Mashimo, T. Toda, and H. Kawanami et al., "Evaluation of cross-language voice conversion using bilingual and non-bilingual databases," in Proc. Interspeech, 2002.
    • (2002) Proc. Interspeech
    • Mashimo, M.1    Toda, T.2    Kawanami, H.3
  • 42
    • 68149181313 scopus 로고    scopus 로고
    • A comparison of acoustic features for articulatory inversion
    • M. Á. Carreira-Perpiñán and C. Qin, "A comparison of acoustic features for articulatory inversion," in Proc. Interspeech, 2007, pp. 2469-2472.
    • (2007) Proc. Interspeech , pp. 2469-2472
    • Carreira-Perpiñán, M.A.1    Qin, C.2
  • 43
    • 79959852489 scopus 로고    scopus 로고
    • Estimating missing data sequences in X-ray microbeam recordings
    • Makuhari, Japan
    • C. Qin and M. A. Carreira-Perpinán, "Estimating missing data sequences in X-ray microbeam recordings," in Proc. Interspeech, Makuhari, Japan, 2010, pp. 1592-1595.
    • (2010) Proc. Interspeech , pp. 1592-1595
    • Qin, C.1    Carreira-Perpinán, M.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.