SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2009, Pages 528-531

State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis

(3) Wu, Yi Jian a,b Nankaku, Yoshihiko a Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

b TTS Group (China)

Author keywords

Cross lingual speaker adaptation; HMM; Speech synthesis; State mapping

Indexed keywords

CROSS-LINGUAL; DATA MAPPINGS; HMM; HMM-BASED SPEECH SYNTHESIS; KULLBACK-LEIBLER DIVERGENCE; MAPPING INFORMATION; SPEAKER ADAPTATION; SPEECH QUALITY; TARGET LANGUAGE; VOICE MODEL;

HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS; TELEPHONE SETS;

MAPPING;

EID: 70450192740 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (70)

References (17)

1
- 84867199514
- EMIME project
- EMIME project: http://www.emime.org

2
- 60849126020
- TC-Star: Cross-language voice conversion revisited
- Spain
- D. Sundermann, H. Hoge, A. Bonafonte, H. Ney and J. Hirschberg, "TC-Star: Cross-language voice conversion revisited," in Proc. of the TC-Star Workshop 2006, Spain, 2006.
- (2006) Proc. of the TC-Star Workshop 2006
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Hirschberg, J.⁵

3
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, pp. 389-392, 1996.
- (1996) Proc. of ICASSP , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

4
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
- (1999) Proc. of ICASSP , vol.5 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

5
- 60849092922
- Cross-Lingual Speaker Adaptation for HMM-based Speech Synthesis
- Y.-J. Wu, S. King and K. Tokuda, "Cross-Lingual Speaker Adaptation for HMM-based Speech Synthesis," in Proc. of ISCSLP, pp. 9-12, 2008.
- (2008) Proc. of ISCSLP , pp. 9-12
- Wu, Y.-J.¹ King, S.² Tokuda, K.³

6
- 70349218937
- State mapping for cross-language speaker adaptation in TTS
- Y.-N. Chen, Y. Jiao, Y. Qian and F.K. Soong, "State mapping for cross-language speaker adaptation in TTS," in Proc. of ICASSP, 2009.
- (2009) Proc. of ICASSP
- Chen, Y.-N.¹ Jiao, Y.² Qian, Y.³ Soong, F.K.⁴

7
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," in Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

8
- 0007985533
- Speaker adaptation for HMM-based speech synthesis system using MLLR
- T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speaker adaptation for HMM-based speech synthesis system using MLLR," in The Third ESCA/COCOSDA Workshop on Speech Synthesis, pp. 273-276, 1998.
- (1998) The Third ESCA/COCOSDA Workshop on Speech Synthesis , pp. 273-276
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

9
- 0142007308
- A training method of average voice model for HMM-based speech synthesis
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A training method of average voice model for HMM-based speech synthesis," in IEICE Trans. of Fundamentals, vol. E86-A, no. 8, pp. 1956-1963, 2003.
- (2003) IEICE Trans. of Fundamentals , vol.E86-A , Issue.8 , pp. 1956-1963
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

10
- 84867203039
- Unsupervised adaptation for HMM-based speech synthesis
- S. King, K. Tokuda, H. Zen and J. Yamagishi, "Unsupervised adaptation for HMM-based speech synthesis," in Proc. of Interspeech, pp. 1869-1872, 2008.
- (2008) Proc. of Interspeech , pp. 1869-1872
- King, S.¹ Tokuda, K.² Zen, H.³ Yamagishi, J.⁴

11
- 51449111086
- A cross-language state mapping approach to bilingual (Mandarin-English) TTS
- H. Liang, Y. Qian, F. Soong and G. Liu, "A cross-language state mapping approach to bilingual (Mandarin-English) TTS," in Proc. of ICASSP, pp. 4641-4644, 2008.
- (2008) Proc. of ICASSP , pp. 4641-4644
- Liang, H.¹ Qian, Y.² Soong, F.³ Liu, G.⁴

12
- 34547507876
- Divergence-based similarity measure for spoken document retrieval
- P. Liu, F.K. Soong and J.-L. Zhou, "Divergence-based similarity measure for spoken document retrieval," in Proc. of ICASSP, pp. 89-92, 2007.
- (2007) Proc. of ICASSP , pp. 89-92
- Liu, P.¹ Soong, F.K.² Zhou, J.-L.³

13
- 70450202069
- J. Kominek and A. Black, The CMU ARCTIC speech databases for speech synthesis research, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
- J. Kominek and A. Black, "The CMU ARCTIC speech databases for speech synthesis research," Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.

14
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
- (1999) Proc. of ICASSP , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

15
- 84867203662
- http://hts.sp.nitech.ac.jp/

16
- 0020596154
- Cepstral analysis synthesis on the mel frequency scale
- S. Imai, "Cepstral analysis synthesis on the mel frequency scale," in Proc. of ICASSP, pp. 93-96, 1983.
- (1983) Proc. of ICASSP , pp. 93-96
- Imai, S.¹

17
- 33745200051
- Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2801-2804, 2005.
- (2005) Proc. of Interspeech , pp. 2801-2804
- Toda, T.¹ Tokuda, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.