메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2769-2772

Speaker-adaptive speech synthesis based on eigenvoice conversion and language-dependent prosodic conversion in speech-to-speech translation

Author keywords

Eigenvoice conversion; Prosodic conversion; Speaker adaptation; Speech synthesis; Speech to speech translation

Indexed keywords

CONTROL METHODS; CROSS-LINGUAL; EIGENVOICES; EXPERIMENTAL EVALUATION; INPUT AND OUTPUTS; PROSODIC PARAMETER; SPEAKER ADAPTATION; SPECTRAL PARAMETERS; SPEECH-TO-SPEECH TRANSLATION; TEXT-TO-SPEECH SYSTEM; TRANSLATION SYSTEMS; UNSUPERVISED SPEAKER ADAPTATION; VOICE CONVERSION; VOICE QUALITY;

EID: 84865743435     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (13)
  • 2
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A.W. Black. Statistical parametric speech synthesis. Speech Communication, vol.51, no.11, pp.1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 3
    • 84867203039 scopus 로고    scopus 로고
    • Unsupervised adaptation for HMM-based speech synthesis
    • Brisbane, Australia
    • S. King, K. Tokuda, H. Zen, and J. Yamagishi. Unsupervised adaptation for HMM-based speech synthesis, Proc. INTERSPEECH, pp.1869-1872, Brisbane, Australia, 2008.
    • (2008) Proc. INTERSPEECH , pp. 1869-1872
    • King, S.1    Tokuda, K.2    Zen, H.3    Yamagishi, J.4
  • 4
    • 70349218937 scopus 로고    scopus 로고
    • State mapping for cross-language speaker adaptation in TTS
    • Y.-N. Chen, Y. Jiao, Y. Qian, and F.K. Soong. State mapping for cross-language speaker adaptation in TTS. Proc. of ICASSP, pp.4273-4276, 2009.
    • (2009) Proc. of ICASSP , pp. 4273-4276
    • Chen, Y.-N.1    Jiao, Y.2    Qian, Y.3    Soong, F.K.4
  • 5
    • 79953289255 scopus 로고    scopus 로고
    • Unsupervised intralingual and crosslingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction
    • M. Gibson and W. Byrne. Unsupervised intralingual and crosslingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction. IEEE Trans. ASLP, vol.19, no.4, pp.895-904, 2011.
    • (2011) IEEE Trans. ASLP , vol.19 , Issue.4 , pp. 895-904
    • Gibson, M.1    Byrne, W.2
  • 6
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Y. Stylianou, O. Cappé, and E. Moulines. Continuous probabilistic transform for voice conversion. IEEE Trans. SAP, vol.6, no.2, pp.131-142, 1998.
    • (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 7
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda. Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. ASLP, vol.15, no.8, pp.2222-2235, 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 8
    • 0025892924 scopus 로고
    • Statistical analysis of bilingual speaker's speech for cross-language voice conversion
    • M. Abe, K. Shikano, and H. Kuwabara. Statistical analysis of bilingual speaker's speech for cross-language voice conversion. J. Acoust. Soc. Am., vol.90, no.1, pp.76-82, 1991.
    • (1991) J. Acoust. Soc. Am. , vol.90 , Issue.1 , pp. 76-82
    • Abe, M.1    Shikano, K.2    Kuwabara, H.3
  • 9
    • 4544306344 scopus 로고    scopus 로고
    • Cross-language voice conversion evaluation using bilingual databases
    • July
    • M. Mashimo, T. Toda, H. Kawanami. K. Shikano, and N. Campbell. Cross-language voice conversion evaluation using bilingual databases. IPSJ Journal, vol.43, no.7, pp.2177-2185, July 2002.
    • (2002) IPSJ Journal , vol.43 , Issue.7 , pp. 2177-2185
    • Mashimo, M.1    Toda, T.2    Kawanami, H.3    Shikano, K.4    Campbell, N.5
  • 10
    • 77953725318 scopus 로고    scopus 로고
    • INCA algorithm for training voice conversion systems from nonparallel corpora
    • D. Erro, A. Moreno, and A. Bonafonte. INCA algorithm for training voice conversion systems from nonparallel corpora. IEEE Trans. ASLP, vol.18, no.5, pp.944-953, 2010.
    • (2010) IEEE Trans. ASLP , vol.18 , Issue.5 , pp. 944-953
    • Erro, D.1    Moreno, A.2    Bonafonte, A.3
  • 11
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and manyto- one voice conversion based on eigenvoices
    • Hawaii, USA, Apr.
    • T. Toda, Y. Ohtani, and K. Shikano. One-to-many and manyto- one voice conversion based on eigenvoices. Proc. ICASSP, pp.1249-1252, Hawaii, USA, Apr. 2007.
    • (2007) Proc. ICASSP , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 12
    • 70450205902 scopus 로고    scopus 로고
    • Crosslanguage voice conversion based on eigenvoices
    • Brighton, UK, Sep.
    • M. Charlier, Y. Ohtani, T. Toda, A. Moinet, and T. Dutoit. Crosslanguage voice conversion based on eigenvoices. Proc. INTERSPEECH, pp.1635-1638, Brighton, UK, Sep. 2009.
    • (2009) Proc. INTERSPEECH , pp. 1635-1638
    • Charlier, M.1    Ohtani, Y.2    Toda, T.3    Moinet, A.4    Dutoit, T.5
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné. Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Communication, vol.27, no.3-4, pp.187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.