SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4220-4224

Attributing modelling errors in HMM synthesis by stepping gradually from natural to modelled speech

(3) Merritt, Thomas a Latorre, Javier b King, Simon a

a UNIVERSITY OF EDINBURGH (United Kingdom)

b TOSHIBA CORPORATION (Japan)

Author keywords

hidden Markov modelling; speech synthesis; vocoding

Indexed keywords

HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

LINE SPECTRAL PAIRS; MODELLING ERROR; MULTI-DIMENSIONAL SCALING; SPECTRAL PARAMETERS; STATISTICAL PARAMETRIC SPEECH SYNTHESIS; SYNTHETIC SPEECH; TEXT-TO-SPEECH SYSTEM; VOCODING;

AUDIO SIGNAL PROCESSING;

EID: 84946042252 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178766 Document Type: Conference Paper

Times cited : (15)

References (25)

1
- 84878419996
- The blizzard challenge 2010
- Kansai Science City, Japan
- Simon King and Vasilis Karaiskos, 'The Blizzard Challenge 2010," in Proc. Blizzard Challenge, Kansai Science City, Japan, 2010
- (2010) Proc. Blizzard Challenge
- King, S.¹ Karaiskos, V.²

2
- 84878419996
- The blizzard challenge 2011
- Thrin, Italy
- Simon King and Vasilis Karaiskos, "The Blizzard Challenge 2011," in Proc. Blizzard Challenge, Thrin, Italy, 2011
- (2011) Proc. Blizzard Challenge
- King, S.¹ Karaiskos, V.²

3
- 84890516589
- The blizzard challenge 2012
- Portland, USA
- Simon King and Vasilis Karaiskos, "The Blizzard Challenge 2012," in Proc. Blizzard Challenge, Portland, USA, 2012
- (2012) Proc. Blizzard Challenge
- King, S.¹ Karaiskos, V.²

4
- 84910105608
- Measuring a decade of progress in text-tospeech
- Simon King, "Measuring a decade of progress in text-tospeech," Loquens, vol. 1, no. 1,2014
- (2014) Loquens , vol.1 , Issue.1
- King, S.¹

5
- 84897830964
- Introduction to the issue on statistical parametric speech synthesis
- Jianhua Tao,K eikichi Hirose,K eiichi Tokuda, Alan W. Black, and Simon King, "Introduction to the issue on statistical parametric speech synthesis," IEEE lournal of Selected Topics in Signal Processing, vol. 8, no. 2, pp. 170-172,2014
- (2014) IEEE Lournal of Selected Topics in Signal Processing , vol.8 , Issue.2 , pp. 170-172
- Tao, J.¹ Eikichi Hirose, K.² Eiichi Tokuda, K.³ Black, A.W.⁴ King, S.⁵

6
- 33745215669
- An overview of nitech HMM-based speech synthesis system for blizzard challenge 2005
- H. Zen and T. Toda, "An overview of Nitech HMM-based speech synthesis system for Blizzard challenge 2005," in Proc. of Interspeech, 2005, pp. 93-96
- (2005) Proc. of Interspeech , pp. 93-96
- Zen, H.¹ Toda, T.²

7
- 84876687945
- Speech synthesis based on hidden Markov models
- May
- K. Tokuda, Y. Nankaku, T. Toda, H. Zen, J. Yamagishi, and K. Oura, "Speech synthesis based on hidden Markov models," Proceedings of the IEEE, vol. 101, no. 5, pp. 1234-1252, May 2013
- (2013) Proceedings of the IEEE , vol.101 , Issue.5 , pp. 1234-1252
- Tokuda, K.¹ Nankaku, Y.² Toda, T.³ Zen, H.⁴ Yamagishi, J.⁵ Oura, K.⁶

8
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- Toda Tomoki and Keiichi Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE TRANSACTIONS on Information and Systems, vol. 90, no. 5, pp. 816-824,2007
- (2007) IEICE TRANSACTIONS on Information and Systems , vol.90 , Issue.5 , pp. 816-824
- Tomoki, T.¹ Tokuda, K.²

9
- 84856237844
- An introduction to statistical parametric speech synthesis
- Simon King, "An introduction to statistical parametric speech synthesis," Sadhana, vol. 36, no. 5, pp. 837-852,2011
- (2011) Sadhana , vol.36 , Issue.5 , pp. 837-852
- King, S.¹

10
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao Kobayashi, and Tadashi Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in ICASSP 2000. IEEE, 2000, vol. 3, pp. 1315-1318
- (2000) ICASSP 2000. IEEE , vol.3 , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

11
- 38549178971
- Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion
- Springer
- Korin Richmond, "Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion," in Advances in Nonlinear Speech Processing, pp. 263-272. Springer, 2007
- (2007) Advances in Nonlinear Speech Processing , pp. 263-272
- Richmond, K.¹

12
- 67651002140
- Statistical parametric speech synthesis
- Heiga Zen, Keiichi Tokuda, and Alan W. Black, "Statistical parametric speech synthesis," Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

13
- 84910063941
- Investigating the shortcomings of HMM synthesis
- Thomas Merritt and Simon King, "Investigating the shortcomings of HMM synthesis," in Proc. 8th ISCA Workshop on Speech Synthesis (SSW8), pp. 185-190
- Proc. 8th ISCA Workshop on Speech Synthesis (SSW8) , pp. 185-190
- Merritt, T.¹ King, S.²

14
- 84910070288
- Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis
- Thomas Merritt, Thomo Raitio, and Simon King, "Investigating source and filter contributions, and their interaction, to statistical parametric speech synthesis," in Proc. Interspeech, 2014
- (2014) Proc. Interspeech
- Merritt, T.¹ Raitio, T.² King, S.³

15
- 84910028520
- Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech
- Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon King, "Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech," in Proc. Interspeech, 2014
- (2014) Proc. Interspeech
- Eje Henter, G.¹ Merritt, T.² Shannon, M.³ Mayo, C.⁴ King, S.⁵

16
- 84897847967
- Building HMM-TTS models on diverse data
- V. Wan, J. Latorre, K. Yanagisawa, N. Braunschweilers, L. Chen, M. Gales, and M. Akamine, "Building HMM-TTS models on diverse data," IEEE lournal of Selected Topics in Signal Processing, vol. 8, no. 2, 2014
- (2014) IEEE Lournal of Selected Topics in Signal Processing , vol.8 , Issue.2
- Wan, V.¹ Latorre, J.² Yanagisawa, K.³ Braunschweilers, N.⁴ Chen, L.⁵ Gales, M.⁶ Akamine, M.⁷

17
- 85131821539
- Melgeneralized cepstral analysis-A unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis-A unified approach to speech spectral estimation," in ICSLP, 1994
- (1994) ICSLP
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

18
- 84863739383
- S Imai, T Kobayashi, K Tokuda, T Masuko, K Koishida, S Sako, and H Zen, "Speech signal processing toolkit (SPTK), version 3.3," 2009
- (2009) Speech Signal Processing Toolkit (SPTK), Version 3.3
- Imai, S.¹ Kobayashi, T.² Tokuda, K.³ Masuko, T.⁴ Koishida, K.⁵ Sako, S.⁶ Zen, H.⁷

19
- 0035472456
- Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
- Oct
- P. J B Jackson and C.H. Shadle, "Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech," IEEE Transactions on Speech and Audio Processing, vol. 9, no. 7, pp. 713-726, Oct 2001
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.7 , pp. 713-726
- Jackson, P.J.B.¹ Shadle, C.H.²

20
- 84878384520
- Ways to implement global variance in statistical speech synthesis
- Hanna Silen, Elina Helander, Jani Nurminen, and Moncef Gabbouj, "Ways to implement global variance in statistical speech synthesis," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Silen, H.¹ Helander, E.² Nurminen, J.³ Moncef Gabbouj⁴

21
- 80051615235
- Decision tree-based context clustering based on cross validation and hierarchical priors
- H. Zen and MJ.F Gales, "Decision tree-based context clustering based on cross validation and hierarchical priors," in Proc. ICASSP, 2011,pp. 4560-4563
- (2011) Proc. ICASSP , pp. 4560-4563
- Zen, H.¹ Gales, M.J.F.²

22
- 80051606114
- Continuous FO in the sourceexcitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?
- J. Latorre, M.J.F. Gales, S. Buchholz, K. Knill, M. Tamura, Y. Ohtani, and M. Akamine, "Continuous FO in the sourceexcitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?," in Proc. ICASSP, 2011, pp. 4724-4727
- (2011) Proc. ICASSP , pp. 4724-4727
- Latorre, J.¹ Gales, M.J.F.² Buchholz, S.³ Knill, K.⁴ Tamura, M.⁵ Ohtani, Y.⁶ Akamine, M.⁷

23
- 79551495380
- Listeners weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis
- Catherine Mayo, Robert A. Clark, and Simon King, "Listeners weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis," Speech Communication, vol. 53, no. 3, pp. 311-326, 2011
- (2011) Speech Communication , vol.53 , Issue.3 , pp. 311-326
- Mayo, C.¹ Clark, R.A.² King, S.³

24
- 0003562364
- Springer
- Ingwer Borg and Patrick J.F. Groenen, Modern Multidimensional Scaling, Springer, 2005
- (2005) Modern Multidimensional Scaling
- Borg, I.¹ Groenen, F.P.J.²

25
- 0003608388
- Sage
- Joseph B Kruskal and Myron Wish, Multidimensional scaling, vol. 11, Sage, 1978
- (1978) Multidimensional Scaling , vol.11
- Kruskal, J.B.¹ Wish, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.