메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4220-4224

Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech

Author keywords

hidden Markov modelling; speech synthesis; vocoding

Indexed keywords

HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 84946042252     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178766     Document Type: Conference Paper
Times cited : (15)

References (25)
  • 1
    • 84878419996 scopus 로고    scopus 로고
    • The blizzard challenge 2010
    • Kansai Science City, Japan
    • Simon King and Vasilis Karaiskos, 'The Blizzard Challenge 2010," in Proc. Blizzard Challenge, Kansai Science City, Japan, 2010
    • (2010) Proc. Blizzard Challenge
    • King, S.1    Karaiskos, V.2
  • 4
    • 84910105608 scopus 로고    scopus 로고
    • Measuring a decade of progress in text-tospeech
    • Simon King, "Measuring a decade of progress in text-tospeech," Loquens, vol. 1, no. 1,2014
    • (2014) Loquens , vol.1 , Issue.1
    • King, S.1
  • 6
    • 33745215669 scopus 로고    scopus 로고
    • An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005
    • H. Zen and T. Toda, "An overview of Nitech HMM-based speech synthesis system for Blizzard challenge 2005," in Proc. of Interspeech, 2005, pp. 93-96
    • (2005) Proc. of Interspeech , pp. 93-96
    • Zen, H.1    Toda, T.2
  • 8
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • Toda Tomoki and Keiichi Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE TRANSACTIONS on Information and Systems, vol. 90, no. 5, pp. 816-824,2007
    • (2007) IEICE TRANSACTIONS on Information and Systems , vol.90 , Issue.5 , pp. 816-824
    • Tomoki, T.1    Tokuda, K.2
  • 9
    • 84856237844 scopus 로고    scopus 로고
    • An introduction to statistical parametric speech synthesis
    • Simon King, "An introduction to statistical parametric speech synthesis," Sadhana, vol. 36, no. 5, pp. 837-852,2011
    • (2011) Sadhana , vol.36 , Issue.5 , pp. 837-852
    • King, S.1
  • 10
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in ICASSP 2000. IEEE, 2000, vol. 3, pp. 1315-1318
    • (2000) ICASSP 2000. IEEE , vol.3 , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 11
    • 38549178971 scopus 로고    scopus 로고
    • Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion
    • Springer
    • Korin Richmond, "Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion," in Advances in Nonlinear Speech Processing, pp. 263-272. Springer, 2007
    • (2007) Advances in Nonlinear Speech Processing , pp. 263-272
    • Richmond, K.1
  • 12
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Heiga Zen, Keiichi Tokuda, and Alan W. Black, "Statistical parametric speech synthesis," Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 14
    • 84910070288 scopus 로고    scopus 로고
    • Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis
    • Thomas Merritt, Thomo Raitio, and Simon King, "Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis," in Proc. Interspeech, 2014
    • (2014) Proc. Interspeech
    • Merritt, T.1    Raitio, T.2    King, S.3
  • 15
    • 84910028520 scopus 로고    scopus 로고
    • Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech
    • Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon King, "Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech," in Proc. Interspeech, 2014
    • (2014) Proc. Interspeech
    • Eje Henter, G.1    Merritt, T.2    Shannon, M.3    Mayo, C.4    King, S.5
  • 17
    • 85131821539 scopus 로고
    • Melgeneralized cepstral analysis-A unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis-A unified approach to speech spectral estimation," in ICSLP, 1994
    • (1994) ICSLP
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 19
    • 0035472456 scopus 로고    scopus 로고
    • Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
    • Oct
    • P. J B Jackson and C.H. Shadle, "Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 7, pp. 713-726, Oct 2001
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.7 , pp. 713-726
    • Jackson, P.J.B.1    Shadle, C.H.2
  • 21
    • 80051615235 scopus 로고    scopus 로고
    • Decision tree-based context clustering based on cross validation and hierarchical priors
    • H. Zen and MJ.F Gales, "Decision tree-based context clustering based on cross validation and hierarchical priors," in Proc. ICASSP, 2011,pp. 4560-4563
    • (2011) Proc. ICASSP , pp. 4560-4563
    • Zen, H.1    Gales, M.J.F.2
  • 22
    • 80051606114 scopus 로고    scopus 로고
    • Continuous FO in the sourceexcitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
    • J. Latorre, M.J.F. Gales, S. Buchholz, K. Knill, M. Tamura, Y. Ohtani, and M. Akamine, "Continuous FO in the sourceexcitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?," in Proc. ICASSP, 2011, pp. 4724-4727
    • (2011) Proc. ICASSP , pp. 4724-4727
    • Latorre, J.1    Gales, M.J.F.2    Buchholz, S.3    Knill, K.4    Tamura, M.5    Ohtani, Y.6    Akamine, M.7
  • 23
    • 79551495380 scopus 로고    scopus 로고
    • Listeners weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis
    • Catherine Mayo, Robert A. Clark, and Simon King, "Listeners weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis," Speech Communication, vol. 53, no. 3, pp. 311-326, 2011
    • (2011) Speech Communication , vol.53 , Issue.3 , pp. 311-326
    • Mayo, C.1    Clark, R.A.2    King, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.