메뉴 건너뛰기




Volumn 19, Issue 4, 2011, Pages 895-904

Unsupervised intralingual and cross-lingual speaker adaptation for HMM-Based speech synthesis using two-pass decision tree construction

Author keywords

Cross lingual; hidden Markov model (HMM) based speech synthesis; unsupervised speaker adaptation

Indexed keywords

ACOUSTIC MODEL; AUTOMATIC SPEECH RECOGNITION; CROSS-LINGUAL; DECISION TREE CONSTRUCTION; HMM-BASED SPEECH SYNTHESIS; LINGUISTIC ANALYSIS; SPEAKER ADAPTATION; SPEECH SYNTHESIS SYSTEM; SYNTHESIS MODELS; TRAINING DATASET; UNSUPERVISED ADAPTATION; UNSUPERVISED SPEAKER ADAPTATION;

EID: 79953289255     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2066968     Document Type: Article
Times cited : (7)

References (27)
  • 3
    • 24144497811 scopus 로고    scopus 로고
    • Acoustic modeling of speaking styles and emotional expressions in HMM-based speech synthesis
    • J.Yamagishi, K. Onishi, T. Masuko, and T.Kobayashi, "Acoustic modeling of speaking styles and emotional expressions in HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol. E88-D, no. 3, pp. 503-509, 2005.
    • (2005) IEICE Trans. Inf. Syst. , vol.E88-D , Issue.3 , pp. 503-509
    • Yamagishi, J.1    Onishi, K.2    Masuko, T.3    Kobayashi, T.4
  • 4
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • Jan.
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Audio, Speech, Lang. Process., vol. 17, no. 1, pp. 66-83, Jan. 2009.
    • (2009) IEEE Audio, Speech, Lang. Process. , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 5
    • 84867203039 scopus 로고    scopus 로고
    • Unsupervised adaptation for HMM-based speech synthesis
    • S. King, K. Tokuda, H. Zen, and J. Yamagishi, "Unsupervised adaptation for HMM-based speech synthesis," in Proc. Interspeech, 2008, pp. 1869-1872.
    • (2008) Proc. Interspeech , pp. 1869-1872
    • King, S.1    Tokuda, K.2    Zen, H.3    Yamagishi, J.4
  • 6
    • 84856280064 scopus 로고
    • An evaluation of cross-language adaptation for rapid HMM development in a new language
    • B. Wheatley, K. Kondo, W. Anderson, and Y. Muthusamy, "An evaluation of cross-language adaptation for rapid HMM development in a new language," in Proc. ICASSP, 1994, vol. 1, pp. 237-240.
    • (1994) Proc. ICASSP , vol.1 , pp. 237-240
    • Wheatley, B.1    Kondo, K.2    Anderson, W.3    Muthusamy, Y.4
  • 7
    • 0004659972 scopus 로고    scopus 로고
    • MAP-based crosslanguage adaptation augmented by linguistic knowledge: From English to Chinese
    • P. Fung, C. Y. Ma, and W. K. Liu, "MAP-based crosslanguage adaptation augmented by linguistic knowledge: From English to Chinese," in Proc. Eurospeech, 1999, pp. 871-874.
    • (1999) Proc. Eurospeech , pp. 871-874
    • Fung, P.1    Ma, C.Y.2    Liu, W.K.3
  • 8
    • 60849092922 scopus 로고    scopus 로고
    • Cross-lingual speaker adaptation for HMM-based speech synthesis
    • Y. Wu, S. King, and K. Tokuda, "Cross-lingual speaker adaptation for HMM-based speech synthesis," in Proc. ISCSLP, 2008, pp. 1-4.
    • (2008) Proc. ISCSLP , pp. 1-4
    • Wu, Y.1    King, S.2    Tokuda, K.3
  • 9
    • 70450192740 scopus 로고    scopus 로고
    • State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis
    • Y.Wu, Y. Nankaku, and K. Tokuda, "State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis," in Proc. Interspeech, 2009, pp. 528-531.
    • (2009) Proc. Interspeech , pp. 528-531
    • Wu, Y.1    Nankaku, Y.2    Tokuda, K.3
  • 10
    • 70449126171 scopus 로고    scopus 로고
    • The HTS- 2008 system: Yet another evaluation of the speaker-adaptive HMMbased speech synthesis system in the 2008 blizzard challenge
    • J. Yamagishi, H. Zen, Y.-J. Wu, T. Toda, and T. Tokuda, "The HTS- 2008 system: Yet another evaluation of the speaker-adaptive HMMbased speech synthesis system in the 2008 blizzard challenge," in Proc. Blizzard, 2008, p.
    • (2008) Proc. Blizzard
    • Yamagishi, J.1    Zen, H.2    Wu, Y.-J.3    Toda, T.4    Tokuda, T.5
  • 12
    • 70450169407 scopus 로고    scopus 로고
    • Speech recognition with speech synthesis models by marginalising over decision tree leaves
    • J. Dines, L. Saheer, and H. Liang, "Speech recognition with speech synthesis models by marginalising over decision tree leaves," in Proc. Interspeech, 2009, pp. 1395-1398.
    • (2009) Proc. Interspeech , pp. 1395-1398
    • Dines, J.1    Saheer, L.2    Liang, H.3
  • 13
    • 70450185735 scopus 로고    scopus 로고
    • Two-pass decision tree construction for unsupervised adaptation of HMM-based synthesis models
    • M. Gibson, "Two-pass decision tree construction for unsupervised adaptation of HMM-based synthesis models," in Proc. Interspeech, 2009, pp. 1791-1794.
    • (2009) Proc. Interspeech , pp. 1791-1794
    • Gibson, M.1
  • 14
    • 33846405723 scopus 로고    scopus 로고
    • Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
    • DOI 10.1093/ietisy/e90-1.1.325
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of nitech HMM-based speech synthesis system for the blizzard challenge 2005," IEICE Trans. Inf. Syst., vol. E90-D, no. 1, pp. 325-333, 2007. (Pubitemid 46145336)
    • (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 15
    • 78049411002 scopus 로고    scopus 로고
    • Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction
    • M. Gibson, T. Hirsimaki, R. Karhila, M. Kurimo, and W. Byrne, "Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction," in Proc. ICASSP, 2010, pp. 4642-4645.
    • (2010) Proc. ICASSP , pp. 4642-4645
    • Gibson, M.1    Hirsimaki, T.2    Karhila, R.3    Kurimo, M.4    Byrne, W.5
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, pp. 187-207, 1999.
    • (1999) Speech Commun. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 21
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, 2000, pp. 1315-1318.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 22
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans., vol. E90-D, no. 5, pp. 816-824, 2007.
    • (2007) IEICE Trans. , vol.E90-D , Issue.5 , pp. 816-824
    • Tokuda T. Toda1    Tokuda, K.2
  • 24
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep.
    • V. Digalakis, D. Rtischev, and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3
  • 25
    • 0001859044 scopus 로고
    • A technique for the measurement of attitudes
    • R. Likert, "A technique for the measurement of attitudes," Arch. Psychol., vol. 140, pp. 1-55, 1932.
    • (1932) Arch. Psychol. , vol.140 , pp. 1-55
    • Likert, R.1
  • 26
    • 67650832556 scopus 로고    scopus 로고
    • Statistical analysis of the Blizzard Challenge 2007 listening test results
    • R. Clark, M. Podsiadlo, M. Fraser, C. Mayo, and S. King, "Statistical analysis of the Blizzard Challenge 2007 listening test results," in Proc. Blizzard, 2007.
    • (2007) Proc. Blizzard
    • Clark, R.1    Podsiadlo, M.2    Fraser, M.3    Mayo, C.4    King, S.5
  • 27
    • 44949230930 scopus 로고    scopus 로고
    • Europarl: A parallel corpus for statistical machine translation
    • P. Koehn, "Europarl: A parallel corpus for statistical machine translation," in Proc. MT Summit, 2005.
    • (2005) Proc. MT Summit
    • Koehn, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.