SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 2917-2921

Analysis of spectral enhancement using global variance in HMM-based speech synthesis

(2) Nose, Takashi a Ito, Akinori a

a TOHOKU UNIVERSITY (Japan)

Author keywords

Global variance; HMM based speech synthesis; Over smoothing; Parameter generation; Variance compensation

Indexed keywords

SPEECH SYNTHESIS;

GLOBAL VARIANCE; HMM-BASED SPEECH SYNTHESIS; OVER-SMOOTHING; PARAMETER GENERATION; VARIANCE COMPENSATIONS;

SPEECH COMMUNICATION;

EID: 84910088495 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (24)

1
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.³

2
- 84866846705
- Recent development of HMM-based expressive speech synthesis and its applications
- T. Nose and T. Kobayashi, "Recent development of HMM-based expressive speech synthesis and its applications, " in Proc. APSIPA ASC 2011, 2011, http://www.apsipa.org/proceedings2011/pdf/APSIPA189.pdf.
- (2011) Proc. APSIPA ASC 2011
- Nose, T.¹ Kobayashi, T.²

3
- 85009097254
- Mixed excitation for HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for HMM-based speech synthesis, " in Proc. EUROSPEECH 2001, vol. 3, 2001, pp. 2263-2266.
- (2001) Proc. EUROSPEECH 2001 , vol.3 , pp. 2263-2266
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

4
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

5
- 77953715694
- Statistical textto-speech synthesis based on segment-wise representation with a norm constraint
- S. Tiomkin, D. Malah, and S. Shechtman, "Statistical textto-speech synthesis based on segment-wise representation with a norm constraint, " IEEE Trans. Audio, Speech, and Language Process., vol. 18, no. 5, pp. 1077-1082, 2010.
- (2010) IEEE Trans. Audio, Speech, and Language Process. , vol.18 , Issue.5 , pp. 1077-1082
- Tiomkin, S.¹ Malah, D.² Shechtman, S.³

6
- 84878387899
- Histogram-based spectral equalization for HMM-based speech synthesis using MEL-LSP
- Y. Ohtani, M. Tamura, M. Morita, T. Kagoshima, and M. Akamine, "Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP, " in Proc. INTERSPEECH 2012, 2012, pp. 1155-1158.
- (2012) Proc. INTERSPEECH 2012 , pp. 1155-1158
- Ohtani, Y.¹ Tamura, M.² Morita, M.³ Kagoshima, T.⁴ Akamine, M.⁵

7
- 51449106803
- Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
- Y. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis, " in Proc. ICASSP 2008, 2008, pp. 4621-4624.
- (2008) Proc. ICASSP 2008 , pp. 4621-4624
- Wu, Y.¹ Zen, H.² Nankaku, Y.³ Tokuda, K.⁴

8
- 67650826181
- Trajectory training considering global variance for HMM-based speech synthesis
- T. Toda and S. Young, "Trajectory training considering global variance for HMM-based speech synthesis, " in Proc. ICASSP 2009, 2009, pp. 4025-4028.
- (2009) Proc. ICASSP 2009 , pp. 4025-4028
- Toda, T.¹ Young, S.²

9
- 79959847301
- Global variance modeling on the log power spectrum of LSPS for HMM-based speech synthesis
- Z. Ling, Y. Hu, and L. Dai, "Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis, " in Proc. INTERSPEECH 2010, 2010, pp. 825-828.
- (2010) Proc. INTERSPEECH 2010 , pp. 825-828
- Ling, Z.¹ Hu, Y.² Dai, L.³

10
- 80051648616
- Global variance modeling on frequency domain delta LSP for HMM based speech synthesis
- S. Pan, Y. Nankaku, K. Tokuda, and J. Tao, "Global variance modeling on frequency domain delta LSP for HMMbased speech synthesis, " in Proc. ICASSP 2011, 2011, pp. 4716-4719.
- (2011) Proc. ICASSP 2011 , pp. 4716-4719
- Pan, S.¹ Nankaku, Y.² Tokuda, K.³ Tao, J.⁴

11
- 85008525798
- Product of experts for statistical parametric speech synthesis
- H. Zen, M. Gales, Y. Nankaku, and K. Tokuda, "Product of experts for statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., vol. 20, no. 3, pp. 794-805, 2012.
- (2012) IEEE Trans. Audio, Speech, and Language Process. , vol.20 , Issue.3 , pp. 794-805
- Zen, H.¹ Gales, M.² Nankaku, Y.³ Tokuda, K.⁴

12
- 77957917902
- Minimum generation error training for HMM-based speech synthesis
- Y. Wu and R. Wang, "Minimum generation error training for HMM-based speech synthesis, " in Proc. ICASSP 2006, 2006, pp. 889-892.
- (2006) Proc. ICASSP 2006 , pp. 889-892
- Wu, Y.¹ Wang, R.²

13
- 33749573927
- Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
- H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, " Computer Speech & Language, vol. 21, no. 1, pp. 153-173, 2007.
- (2007) Computer Speech & Language , vol.21 , Issue.1 , pp. 153-173
- Zen, H.¹ Tokuda, K.² Kitamura, T.³

14
- 84897832343
- A parameter generation algorithm using local variance for HMM-based speech synthesis
- T. Nose, V. Chunwijitra, and T. Kobayashi, "A parameter generation algorithm using local variance for HMM-based speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., pp. 221-228, 2014.
- (2014) IEEE Trans. Audio, Speech, and Language Process. , pp. 221-228
- Nose, T.¹ Chunwijitra, V.² Kobayashi, T.³

15
- 0028996993
- Speech parameter generation from HMM using dynamic features
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. ICASSP-95, 1995, pp. 660-663.
- (1995) Proc. ICASSP-95 , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

16
- 84890495160
- Fast, low-artifact speech synthesis considering global variance
- M. Shannon and W. Byrne, "Fast, low-artifact speech synthesis considering global variance, " in Proc. ICASSP 2013, 2013, pp. 7869-7873.
- (2013) Proc. ICASSP 2013 , pp. 7869-7873
- Shannon, M.¹ Byrne, W.²

17
- 84865754815
- Voice conversion using GMM with enhanced global variance
- H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance, " in INTERSPEECH 2011, 2011, pp. 669-672.
- (2011) INTERSPEECH 2011 , pp. 669-672
- Benisty, H.¹ Malah, D.²

18
- 84901793334
- Minimum kullback-leibler divergence parameter generation for HMM-based speech synthesis
- Z.-H. Ling and L.-R. Dai, "Minimum Kullback-Leibler divergence parameter generation for HMM-based speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., vol. 20, no. 5, pp. 1492-1502, 2012.
- (2012) IEEE Trans. Audio, Speech, and Language Process. , vol.20 , Issue.5 , pp. 1492-1502
- Ling, Z.-H.¹ Dai, L.-R.²

19
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, no. 4, pp. 357-363, 1990.
- (1990) Speech Communication , vol.9 , Issue.4 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

20
- 0032673049
- Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. deCheveigne, "Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Decheveigne, A.³

21
- 44449177634
- A hidden semi-markov model-based speech synthesis system
- H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system, " IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

22
- 0033906251
- MDL-based contextdependent subword modeling for speech recognition
- K. Shinoda and T. Watanabe, "MDL-based contextdependent subword modeling for speech recognition, " J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 79-86, 2000.
- (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 79-86
- Shinoda, K.¹ Watanabe, T.²

23
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. ICASSP 2013, 2013, pp. 7962-7966.
- (2013) Proc. ICASSP 2013 , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

24
- 84897902941
- Statistical parametric speech synthesis based on gaussian process regression
- T. Koriyama, T. Nose, and T. Kobayashi, "Statistical parametric speech synthesis based on Gaussian process regression, " IEEE Trans. Audio, Speech, and Language Process., pp. 173-183, 2013.
- (2013) IEEE Trans. Audio, Speech, and Language Process. , pp. 173-183
- Koriyama, T.¹ Nose, T.² Kobayashi, T.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.