SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2011, Pages 2769-2772

Speaker-adaptive speech synthesis based on eigenvoice conversion and language-dependent prosodic conversion in speech-to-speech translation

(5) Hattori, Nobuhiko a Toda, Tomoki a,b Kawai, Hisashi b Saruwatari, Hiroshi a Shikano, Kiyohiro a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (Japan)

Author keywords

Eigenvoice conversion; Prosodic conversion; Speaker adaptation; Speech synthesis; Speech to speech translation

Indexed keywords

CONTROL METHODS; CROSS-LINGUAL; EIGENVOICES; EXPERIMENTAL EVALUATION; INPUT AND OUTPUTS; PROSODIC PARAMETER; SPEAKER ADAPTATION; SPECTRAL PARAMETERS; SPEECH-TO-SPEECH TRANSLATION; TEXT-TO-SPEECH SYSTEM; TRANSLATION SYSTEMS; UNSUPERVISED SPEAKER ADAPTATION; VOICE CONVERSION; VOICE QUALITY;

SPEECH PROCESSING; SPEECH SYNTHESIS;

TRANSLATION (LANGUAGES);

EID: 84865743435 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (13)

1
- 33751057590
- The ATR multilingual speech-to-speech translation system
- S. Nakamura, K. Markov, H. Nakaiwa, G. Kikui, H. Kawai, T. Jitsuhiro, J.-S. Zhang, H. Yamamoto, E. Sumita, and S. Yamamoto. The ATR multilingual speech-to-speech translation system. IEEE Trans. ASLP, vol.14, no.2, pp.365-376, 2006.
- (2006) IEEE Trans. ASLP , vol.14 , Issue.2 , pp. 365-376
- Nakamura, S.¹ Markov, K.² Nakaiwa, H.³ Kikui, G.⁴ Kawai, H.⁵ Jitsuhiro, T.⁶ Zhang, J.-S.⁷ Yamamoto, H.⁸ Sumita, E.⁹ Yamamoto, S.¹⁰

2
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A.W. Black. Statistical parametric speech synthesis. Speech Communication, vol.51, no.11, pp.1039-1064, 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

3
- 84867203039
- Unsupervised adaptation for HMM-based speech synthesis
- Brisbane, Australia
- S. King, K. Tokuda, H. Zen, and J. Yamagishi. Unsupervised adaptation for HMM-based speech synthesis, Proc. INTERSPEECH, pp.1869-1872, Brisbane, Australia, 2008.
- (2008) Proc. INTERSPEECH , pp. 1869-1872
- King, S.¹ Tokuda, K.² Zen, H.³ Yamagishi, J.⁴

4
- 70349218937
- State mapping for cross-language speaker adaptation in TTS
- Y.-N. Chen, Y. Jiao, Y. Qian, and F.K. Soong. State mapping for cross-language speaker adaptation in TTS. Proc. of ICASSP, pp.4273-4276, 2009.
- (2009) Proc. of ICASSP , pp. 4273-4276
- Chen, Y.-N.¹ Jiao, Y.² Qian, Y.³ Soong, F.K.⁴

5
- 79953289255
- Unsupervised intralingual and crosslingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction
- M. Gibson and W. Byrne. Unsupervised intralingual and crosslingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction. IEEE Trans. ASLP, vol.19, no.4, pp.895-904, 2011.
- (2011) IEEE Trans. ASLP , vol.19 , Issue.4 , pp. 895-904
- Gibson, M.¹ Byrne, W.²

6
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines. Continuous probabilistic transform for voice conversion. IEEE Trans. SAP, vol.6, no.2, pp.131-142, 1998.
- (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

7
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- T. Toda, A.W. Black, and K. Tokuda. Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. ASLP, vol.15, no.8, pp.2222-2235, 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

8
- 0025892924
- Statistical analysis of bilingual speaker's speech for cross-language voice conversion
- M. Abe, K. Shikano, and H. Kuwabara. Statistical analysis of bilingual speaker's speech for cross-language voice conversion. J. Acoust. Soc. Am., vol.90, no.1, pp.76-82, 1991.
- (1991) J. Acoust. Soc. Am. , vol.90 , Issue.1 , pp. 76-82
- Abe, M.¹ Shikano, K.² Kuwabara, H.³

9
- 4544306344
- Cross-language voice conversion evaluation using bilingual databases
- July
- M. Mashimo, T. Toda, H. Kawanami. K. Shikano, and N. Campbell. Cross-language voice conversion evaluation using bilingual databases. IPSJ Journal, vol.43, no.7, pp.2177-2185, July 2002.
- (2002) IPSJ Journal , vol.43 , Issue.7 , pp. 2177-2185
- Mashimo, M.¹ Toda, T.² Kawanami, H.³ Shikano, K.⁴ Campbell, N.⁵

10
- 77953725318
- INCA algorithm for training voice conversion systems from nonparallel corpora
- D. Erro, A. Moreno, and A. Bonafonte. INCA algorithm for training voice conversion systems from nonparallel corpora. IEEE Trans. ASLP, vol.18, no.5, pp.944-953, 2010.
- (2010) IEEE Trans. ASLP , vol.18 , Issue.5 , pp. 944-953
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

11
- 34547496175
- One-to-many and manyto- one voice conversion based on eigenvoices
- Hawaii, USA, Apr.
- T. Toda, Y. Ohtani, and K. Shikano. One-to-many and manyto- one voice conversion based on eigenvoices. Proc. ICASSP, pp.1249-1252, Hawaii, USA, Apr. 2007.
- (2007) Proc. ICASSP , pp. 1249-1252
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

12
- 70450205902
- Crosslanguage voice conversion based on eigenvoices
- Brighton, UK, Sep.
- M. Charlier, Y. Ohtani, T. Toda, A. Moinet, and T. Dutoit. Crosslanguage voice conversion based on eigenvoices. Proc. INTERSPEECH, pp.1635-1638, Brighton, UK, Sep. 2009.
- (2009) Proc. INTERSPEECH , pp. 1635-1638
- Charlier, M.¹ Ohtani, Y.² Toda, T.³ Moinet, A.⁴ Dutoit, T.⁵

13
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné. Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Communication, vol.27, no.3-4, pp.187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.