메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2917-2921

Analysis of spectral enhancement using global variance in HMM-based speech synthesis

Author keywords

Global variance; HMM based speech synthesis; Over smoothing; Parameter generation; Variance compensation

Indexed keywords

SPEECH SYNTHESIS;

EID: 84910088495     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (24)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.3
  • 2
    • 84866846705 scopus 로고    scopus 로고
    • Recent development of HMM-based expressive speech synthesis and its applications
    • T. Nose and T. Kobayashi, "Recent development of HMM-based expressive speech synthesis and its applications, " in Proc. APSIPA ASC 2011, 2011, http://www.apsipa.org/proceedings2011/pdf/APSIPA189.pdf.
    • (2011) Proc. APSIPA ASC 2011
    • Nose, T.1    Kobayashi, T.2
  • 4
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 5
    • 77953715694 scopus 로고    scopus 로고
    • Statistical textto-speech synthesis based on segment-wise representation with a norm constraint
    • S. Tiomkin, D. Malah, and S. Shechtman, "Statistical textto-speech synthesis based on segment-wise representation with a norm constraint, " IEEE Trans. Audio, Speech, and Language Process., vol. 18, no. 5, pp. 1077-1082, 2010.
    • (2010) IEEE Trans. Audio, Speech, and Language Process. , vol.18 , Issue.5 , pp. 1077-1082
    • Tiomkin, S.1    Malah, D.2    Shechtman, S.3
  • 6
    • 84878387899 scopus 로고    scopus 로고
    • Histogram-based spectral equalization for HMM-based speech synthesis using MEL-LSP
    • Y. Ohtani, M. Tamura, M. Morita, T. Kagoshima, and M. Akamine, "Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP, " in Proc. INTERSPEECH 2012, 2012, pp. 1155-1158.
    • (2012) Proc. INTERSPEECH 2012 , pp. 1155-1158
    • Ohtani, Y.1    Tamura, M.2    Morita, M.3    Kagoshima, T.4    Akamine, M.5
  • 7
    • 51449106803 scopus 로고    scopus 로고
    • Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
    • Y. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis, " in Proc. ICASSP 2008, 2008, pp. 4621-4624.
    • (2008) Proc. ICASSP 2008 , pp. 4621-4624
    • Wu, Y.1    Zen, H.2    Nankaku, Y.3    Tokuda, K.4
  • 8
    • 67650826181 scopus 로고    scopus 로고
    • Trajectory training considering global variance for HMM-based speech synthesis
    • T. Toda and S. Young, "Trajectory training considering global variance for HMM-based speech synthesis, " in Proc. ICASSP 2009, 2009, pp. 4025-4028.
    • (2009) Proc. ICASSP 2009 , pp. 4025-4028
    • Toda, T.1    Young, S.2
  • 9
    • 79959847301 scopus 로고    scopus 로고
    • Global variance modeling on the log power spectrum of LSPS for HMM-based speech synthesis
    • Z. Ling, Y. Hu, and L. Dai, "Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis, " in Proc. INTERSPEECH 2010, 2010, pp. 825-828.
    • (2010) Proc. INTERSPEECH 2010 , pp. 825-828
    • Ling, Z.1    Hu, Y.2    Dai, L.3
  • 10
    • 80051648616 scopus 로고    scopus 로고
    • Global variance modeling on frequency domain delta LSP for HMM based speech synthesis
    • S. Pan, Y. Nankaku, K. Tokuda, and J. Tao, "Global variance modeling on frequency domain delta LSP for HMMbased speech synthesis, " in Proc. ICASSP 2011, 2011, pp. 4716-4719.
    • (2011) Proc. ICASSP 2011 , pp. 4716-4719
    • Pan, S.1    Nankaku, Y.2    Tokuda, K.3    Tao, J.4
  • 12
    • 77957917902 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Y. Wu and R. Wang, "Minimum generation error training for HMM-based speech synthesis, " in Proc. ICASSP 2006, 2006, pp. 889-892.
    • (2006) Proc. ICASSP 2006 , pp. 889-892
    • Wu, Y.1    Wang, R.2
  • 13
    • 33749573927 scopus 로고    scopus 로고
    • Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
    • H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences, " Computer Speech & Language, vol. 21, no. 1, pp. 153-173, 2007.
    • (2007) Computer Speech & Language , vol.21 , Issue.1 , pp. 153-173
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 15
    • 0028996993 scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. ICASSP-95, 1995, pp. 660-663.
    • (1995) Proc. ICASSP-95 , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 16
    • 84890495160 scopus 로고    scopus 로고
    • Fast, low-artifact speech synthesis considering global variance
    • M. Shannon and W. Byrne, "Fast, low-artifact speech synthesis considering global variance, " in Proc. ICASSP 2013, 2013, pp. 7869-7873.
    • (2013) Proc. ICASSP 2013 , pp. 7869-7873
    • Shannon, M.1    Byrne, W.2
  • 17
    • 84865754815 scopus 로고    scopus 로고
    • Voice conversion using GMM with enhanced global variance
    • H. Benisty and D. Malah, "Voice conversion using GMM with enhanced global variance, " in INTERSPEECH 2011, 2011, pp. 669-672.
    • (2011) INTERSPEECH 2011 , pp. 669-672
    • Benisty, H.1    Malah, D.2
  • 18
    • 84901793334 scopus 로고    scopus 로고
    • Minimum kullback-leibler divergence parameter generation for HMM-based speech synthesis
    • Z.-H. Ling and L.-R. Dai, "Minimum Kullback-Leibler divergence parameter generation for HMM-based speech synthesis, " IEEE Trans. Audio, Speech, and Language Process., vol. 20, no. 5, pp. 1492-1502, 2012.
    • (2012) IEEE Trans. Audio, Speech, and Language Process. , vol.20 , Issue.5 , pp. 1492-1502
    • Ling, Z.-H.1    Dai, L.-R.2
  • 20
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. deCheveigne, "Restructuring speech representations using a pitchadaptive time-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Decheveigne, A.3
  • 22
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based contextdependent subword modeling for speech recognition
    • K. Shinoda and T. Watanabe, "MDL-based contextdependent subword modeling for speech recognition, " J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 79-86, 2000.
    • (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 23
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. ICASSP 2013, 2013, pp. 7962-7966.
    • (2013) Proc. ICASSP 2013 , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.