SCOPUS 정보 검색 플랫폼

Proceedings - 2008 6th International Symposium on Chinese Spoken Language Processing, ISCSLP 2008

Volumn , Issue , 2008, Pages 9-12

Cross-lingual speaker adaptation for HMM-based speech synthesis

(3) Wu, Yi Jian a King, Simon b Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

b UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

Cross lingual; HMM based speech synthesis; Speaker adaptation

Indexed keywords

HIDDEN MARKOV MODELS; LINGUISTICS; QUERY LANGUAGES; SPEECH SYNTHESIS; TARGETS; TELEPHONE SETS;

CONTEXT DEPENDENTS; CROSS-LINGUAL; HMM-BASED SPEECH SYNTHESIS; MANDARIN CHINESE; MAPPING RULES; ONE-TO-ONE MAPPINGS; PROSODIC FEATURES; SPEAKER ADAPTATION; TARGET SPEAKERS; TRIPHONE; VOICE MODELS;

SPEECH RECOGNITION;

EID: 60849092922 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CHINSL.2008.ECP.14 Document Type: Conference Paper

Times cited : (44)

References (21)

1
- 60849122466
- EMIME project
- EMIME project: http://www.emime.org

2
- 60849118010
- TC-Star project
- TC-Star project: http://www.tc-star.org

3
- 60849126020
- TC-Star: Cross-language voice conversion revisited
- Spain
- D. Sundermann, H. Hoge, A. Bonafonte, H. Ney and J. Hirschberg, "TC-Star: Cross-language voice conversion revisited," in Proc. of the TC-Star Workshop 2006, Spain, 2006.
- (2006) Proc. of the TC-Star Workshop 2006
- Sundermann, D.¹ Hoge, H.² Bonafonte, A.³ Ney, H.⁴ Hirschberg, J.⁵

4
- 47949118319
- New approach to polyglot synthesis: How to speak any language with anyone's voice
- J. Latorre, K. Iwano and S. Furui, "New approach to polyglot synthesis: how to speak any language with anyone's voice," in Proc. of Multilingual Speech and Language Processing, 2006.
- (2006) Proc. of Multilingual Speech and Language Processing
- Latorre, J.¹ Iwano, K.² Furui, S.³

5
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, pp. 389-392, 1996.
- (1996) Proc. of ICASSP , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

6
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
- (1999) Proc. of ICASSP , vol.5 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

7
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," in Computer Speech and Language, vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

8
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," in Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

9
- 0007985533
- Speaker adaptation for HMM-based speech synthesis system using MLLR
- T. Masuko, K. Tokuda, T. Kobayashi and S. Imai, "Speaker adaptation for HMM-based speech synthesis system using MLLR," in The Third ESCA/COCOSDA Workshop on Speech Synthesis, pp. 273-276, 1998.
- (1998) The Third ESCA/COCOSDA Workshop on Speech Synthesis , pp. 273-276
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

10
- 33947669452
- HSMM-based model adaptation algorithms for average-voice-based speech synthesis
- May
- J. Yamagishi, K. Ogata, Y. Nakano, J. Isogai and T. Kobayashi, "HSMM-based model adaptation algorithms for average-voice-based speech synthesis," in Proc. of ICASSP, pp. 77-80, May 2006.
- (2006) Proc. of ICASSP , pp. 77-80
- Yamagishi, J.¹ Ogata, K.² Nakano, Y.³ Isogai, J.⁴ Kobayashi, T.⁵

11
- 0142007308
- A training method of average voice model for HMM-based speech synthesis
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A training method of average voice model for HMM-based speech synthesis," in IEICE Trans. of Fundamentals, vol. E86-A, no. 8, pp. 1956-1963, 2003.
- (2003) IEICE Trans. of Fundamentals , vol.E86-A , Issue.8 , pp. 1956-1963
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

12
- 60849136241
- Alphabet
- http://en.wikipedia.org/wiki/International Phonetic Alphabet
- Phonetic

13
- 51449098031
- Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis
- Mar
- L. Qin, Y.-J. Wu, Z.-H. Ling, R.-H. Wang and L.-R. Dai, "Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis," in Proc. of ICASSP, pp. 3953-3956, Mar. 2008.
- (2008) Proc. of ICASSP , pp. 3953-3956
- Qin, L.¹ Wu, Y.-J.² Ling, Z.-H.³ Wang, R.-H.⁴ Dai, L.-R.⁵

14
- 0141479047
- A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training," in Proc. ICASSP 2003, vol. 1, pp. 716-719, 2003.
- (2003) Proc. ICASSP 2003 , vol.1 , pp. 716-719
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

15
- 84867203039
- Unsupervised adaptation for HMM-based speech synthesis
- accepted
- S. King, K. Tokuda, H. Zen and J. Yamagishi, "Unsupervised adaptation for HMM-based speech synthesis," in Proc. of Interspeech (accepted), 2008.
- (2008) Proc. of Interspeech
- King, S.¹ Tokuda, K.² Zen, H.³ Yamagishi, J.⁴

16
- 60849132933
- J. Kominek and A. Black, The CMU ARCTIC speech databases for speech synthesis research, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
- J. Kominek and A. Black, "The CMU ARCTIC speech databases for speech synthesis research," Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.

17
- 60849119188
- http://www.synsig.org/index.php/Blizzard Challenge 2008
- (2008)

18
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
- (1999) Proc. of ICASSP , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

19
- 60849139326
- http://hts.sp.nitech.ac.jp/

20
- 0020596154
- Cepstral analysis synthesis on the mel frequency scale
- S. Imai, "Cepstral analysis synthesis on the mel frequency scale," in Proc. of ICASSP, pp. 93-96, 1983.
- (1983) Proc. of ICASSP , pp. 93-96
- Imai, S.¹

21
- 33745200051
- Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2801-2804, 2005.
- (2005) Proc. of Interspeech , pp. 2801-2804
- Toda, T.¹ Tokuda, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.