메뉴 건너뛰기




Volumn , Issue , 2008, Pages 9-12

Cross-lingual speaker adaptation for HMM-based speech synthesis

Author keywords

Cross lingual; HMM based speech synthesis; Speaker adaptation

Indexed keywords

HIDDEN MARKOV MODELS; LINGUISTICS; QUERY LANGUAGES; SPEECH SYNTHESIS; TARGETS; TELEPHONE SETS;

EID: 60849092922     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CHINSL.2008.ECP.14     Document Type: Conference Paper
Times cited : (44)

References (21)
  • 1
    • 60849122466 scopus 로고    scopus 로고
    • EMIME project
    • EMIME project: http://www.emime.org
  • 2
    • 60849118010 scopus 로고    scopus 로고
    • TC-Star project
    • TC-Star project: http://www.tc-star.org
  • 5
  • 6
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
    • (1999) Proc. of ICASSP , vol.5 , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 7
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," in Computer Speech and Language, vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Computer Speech and Language , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 8
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," in Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 10
    • 33947669452 scopus 로고    scopus 로고
    • HSMM-based model adaptation algorithms for average-voice-based speech synthesis
    • May
    • J. Yamagishi, K. Ogata, Y. Nakano, J. Isogai and T. Kobayashi, "HSMM-based model adaptation algorithms for average-voice-based speech synthesis," in Proc. of ICASSP, pp. 77-80, May 2006.
    • (2006) Proc. of ICASSP , pp. 77-80
    • Yamagishi, J.1    Ogata, K.2    Nakano, Y.3    Isogai, J.4    Kobayashi, T.5
  • 11
    • 0142007308 scopus 로고    scopus 로고
    • A training method of average voice model for HMM-based speech synthesis
    • J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A training method of average voice model for HMM-based speech synthesis," in IEICE Trans. of Fundamentals, vol. E86-A, no. 8, pp. 1956-1963, 2003.
    • (2003) IEICE Trans. of Fundamentals , vol.E86-A , Issue.8 , pp. 1956-1963
    • Yamagishi, J.1    Tamura, M.2    Masuko, T.3    Tokuda, K.4    Kobayashi, T.5
  • 12
    • 60849136241 scopus 로고    scopus 로고
    • Alphabet
    • http://en.wikipedia.org/wiki/International Phonetic Alphabet
    • Phonetic
  • 13
    • 51449098031 scopus 로고    scopus 로고
    • Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis
    • Mar
    • L. Qin, Y.-J. Wu, Z.-H. Ling, R.-H. Wang and L.-R. Dai, "Minimum generation error lineal regression based model adaptation for HMM-based speech synthesis," in Proc. of ICASSP, pp. 3953-3956, Mar. 2008.
    • (2008) Proc. of ICASSP , pp. 3953-3956
    • Qin, L.1    Wu, Y.-J.2    Ling, Z.-H.3    Wang, R.-H.4    Dai, L.-R.5
  • 14
    • 0141479047 scopus 로고    scopus 로고
    • A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training
    • J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda and T. Kobayashi, "A Training Method for Average Voice Model Based on Shared Decision Tree Context Clustering and Speaker Adaptive Training," in Proc. ICASSP 2003, vol. 1, pp. 716-719, 2003.
    • (2003) Proc. ICASSP 2003 , vol.1 , pp. 716-719
    • Yamagishi, J.1    Tamura, M.2    Masuko, T.3    Tokuda, K.4    Kobayashi, T.5
  • 15
  • 16
    • 60849132933 scopus 로고    scopus 로고
    • J. Kominek and A. Black, The CMU ARCTIC speech databases for speech synthesis research, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
    • J. Kominek and A. Black, "The CMU ARCTIC speech databases for speech synthesis research," Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA, Tech. Rep. CMULTI-03-177, http://festvox.org/cmu arctic/, 2003.
  • 17
    • 60849119188 scopus 로고    scopus 로고
    • http://www.synsig.org/index.php/Blizzard Challenge 2008
    • (2008)
  • 18
    • 0032678076 scopus 로고    scopus 로고
    • Hidden markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
    • (1999) Proc. of ICASSP , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 19
    • 60849139326 scopus 로고    scopus 로고
    • http://hts.sp.nitech.ac.jp/
  • 20
    • 0020596154 scopus 로고
    • Cepstral analysis synthesis on the mel frequency scale
    • S. Imai, "Cepstral analysis synthesis on the mel frequency scale," in Proc. of ICASSP, pp. 93-96, 1983.
    • (1983) Proc. of ICASSP , pp. 93-96
    • Imai, S.1
  • 21
    • 33745200051 scopus 로고    scopus 로고
    • Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. of Interspeech, pp. 2801-2804, 2005.
    • (2005) Proc. of Interspeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.