메뉴 건너뛰기




Volumn 8, Issue 2, 2014, Pages 221-228

A parameter generation algorithm using local variance for HMM-Based speech synthesis

Author keywords

HMM based speech synthesis; local variance; over smoothing problem; spectral parameter generation

Indexed keywords

CONVENTIONAL TECHNIQUES; DYNAMIC CHARACTERISTICS; HMM-BASED SPEECH SYNTHESIS; LOCAL VARIANCE; OBJECTIVE EVALUATION; OVER-SMOOTHING PROBLEM; SPECTRAL PARAMETERS; SUBJECTIVE EVALUATIONS;

EID: 84897832343     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2013.2283459     Document Type: Article
Times cited : (13)

References (21)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Commun., vol. 51, no. 11, pp. 1039-1064, 2009
    • (2009) Speech Commun , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 2
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • DOI 10.1093/ietisy/e90-d.2.533
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007 (Pubitemid 46279829)
    • (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 3
    • 84866846705 scopus 로고    scopus 로고
    • Recent development of HMM-based expressive speech synthesis and its applications
    • T. Nose and T. Kobayashi, "Recent development of HMM-based expressive speech synthesis and its applications," in Proc. APSIPA ASC 2011, 2011 [Online]. Available: http://www.apsipa.org/proceedings-2011/pdf/ APSIPA189.pdf
    • (2011) Proc. APSIPA ASC 2011
    • Nose, T.1    Kobayashi, T.2
  • 4
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Sep
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH, Sep. 1999, pp. 2347-2350
    • (1999) Proc. EUROSPEECH , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 5
    • 0028996993 scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • May
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. ICASSP'95, May 1995, pp. 660-663
    • (1995) Proc. ICASSP , vol.95 , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 6
    • 27144515530 scopus 로고    scopus 로고
    • Incorporating a mixed excitation model and postfilter into HMM-based text-to-speech synthesis
    • DOI 10.1002/scj.20354
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Incorporating a mixed excitation model and postfilter into HMMbased text-to-speech synthesis," Syst. Comput. Jpn., vol. 36, no. 12, pp. 43-50, 2005 (Pubitemid 41495150)
    • (2005) Systems and Computers in Japan , vol.36 , Issue.12 , pp. 43-50
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 7
    • 33745200051 scopus 로고    scopus 로고
    • Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. INTERSPEECH '05-Eurospeech, 2005, pp. 2801-2804
    • (2005) Proc. INTERSPEECH '05-Eurospeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2
  • 8
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, May 2007
    • (2007) IEICE Trans. Inf. Syst , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 9
    • 67650819492 scopus 로고    scopus 로고
    • The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge
    • J. Yamagishi, H. Zen, Y.Wu, T. Toda, and K. Tokuda, "The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge," in Proc. Blizzard Challenge Workshop, 2008
    • (2008) Proc. Blizzard Challenge Workshop
    • Yamagishi, J.1    Zen, H.2    Wu, Y.3    Toda, T.4    Tokuda, K.5
  • 10
    • 79959847301 scopus 로고    scopus 로고
    • Global variancemodeling on the log power spectrum of LSPs for HMM-based speech synthesis
    • Sep
    • Z. Ling,Y.Hu, and L.Dai, "Global variancemodeling on the log power spectrum of LSPs for HMM-based speech synthesis," in Proc. INTERSPEECH '10, Sep. 2010, pp. 825-828
    • (2010) Proc. INTERSPEECH , vol.10 , pp. 825-828
    • Lingy, Z.1    Hu, Y.2    Dai, L.3
  • 11
    • 80051648616 scopus 로고    scopus 로고
    • Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis
    • May
    • S. Pan, Y. Nankaku, K. Tokuda, and J. Tao, "Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis," in Proc. ICASSP '11, May 2011, pp. 4716-4719
    • (2011) Proc. ICASSP , vol.11 , pp. 4716-4719
    • Pan, S.1    Nankaku, Y.2    Tokuda, K.3    Tao, J.4
  • 12
    • 85008525798 scopus 로고    scopus 로고
    • Product of experts for statistical parametric speech synthesis
    • Mar
    • H. Zen, M. Gales, Y. Nankaku, and K. Tokuda, "Product of experts for statistical parametric speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 794-805, Mar. 2012
    • (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.3 , pp. 794-805
    • Zen, H.1    Gales, M.2    Nankaku, Y.3    Tokuda, K.4
  • 13
    • 0033350721 scopus 로고    scopus 로고
    • Products of experts
    • G. E. Hinton, "Products of experts," in Proc. ICANN 99, 1999, vol. 1, pp. 1-6
    • (1999) Proc. ICANN 99 , vol.1 , pp. 1-6
    • Hinton, G.E.1
  • 14
    • 33749573927 scopus 로고    scopus 로고
    • Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
    • DOI 10.1016/j.csl.2006.01.002, PII S0885230806000052
    • H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences," Comput. Speech Lang., vol. 21, no. 1, pp. 153-173, 2007 (Pubitemid 44537647)
    • (2007) Computer Speech and Language , vol.21 , Issue.1 , pp. 153-173
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 15
    • 51449106803 scopus 로고    scopus 로고
    • Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
    • Mar
    • Y. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis," in Proc. ICASSP '08, Mar. 2008, pp. 4621-4624
    • (2008) Proc. ICASSP , vol.8 , pp. 4621-4624
    • Wu, Y.1    Zen, H.2    Nankaku, Y.3    Tokuda, K.4
  • 16
    • 77957917902 scopus 로고    scopus 로고
    • Minimum generation error training for HMMbased speech synthesis
    • May
    • Y. Wu and R. Wang, "Minimum generation error training for HMMbased speech synthesis," in Proc. ICASSP '06,May 2006, pp. 889-892
    • (2006) Proc. ICASSP , vol.6 , pp. 889-892
    • Wu, Y.1    Wang, R.2
  • 17
    • 84878412344 scopus 로고    scopus 로고
    • A speech parameter generation algorithm using local variance for HMM-based speech synthesis
    • V. Chunwijitra, T. Nose, and T. Kobayashi, "A speech parameter generation algorithm using local variance for HMM-based speech synthesis," in Proc. INTERSPEECH '12, 2012, pp. 1151-1154
    • (2012) Proc. INTERSPEECH , vol.12 , pp. 1151-1154
    • Chunwijitra, V.1    Nose, T.2    Kobayashi, T.3
  • 18
    • 44449177634 scopus 로고    scopus 로고
    • A hidden semi-Markov model-based speech synthesis system
    • May
    • H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 825-834, May 2007
    • (2007) IEICE Trans. Inf. Syst , vol.90 , Issue.5 , pp. 825-834
    • Zen, H.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 19
    • 0025475528 scopus 로고
    • ATR Japanese speech database as a tool of speech recognition and synthesis
    • A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis," Speech Commun., vol. 9, no. 4, pp. 357-363, 1990
    • (1990) Speech Commun , vol.9 , Issue.4 , pp. 357-363
    • Kurematsu, A.1    Takeda, K.2    Sagisaka, Y.3    Katagiri, S.4    Kuwabara, H.5    Shikano, K.6
  • 20
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, 1999
    • (1999) Speech Commun , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.