SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 8, Issue 2, 2014, Pages 221-228

A parameter generation algorithm using local variance for HMM-Based speech synthesis

(3) Nose, Takashi a Chunwijitra, Vataya a Kobayashi, Takao a

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

HMM based speech synthesis; local variance; over smoothing problem; spectral parameter generation

Indexed keywords

CONVENTIONAL TECHNIQUES; DYNAMIC CHARACTERISTICS; HMM-BASED SPEECH SYNTHESIS; LOCAL VARIANCE; OBJECTIVE EVALUATION; OVER-SMOOTHING PROBLEM; SPECTRAL PARAMETERS; SUBJECTIVE EVALUATIONS;

ALGORITHMS; SPEECH SYNTHESIS; TRAJECTORIES;

PARAMETER ESTIMATION;

EID: 84897832343 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2013.2283459 Document Type: Article

Times cited : (13)

References (21)

1
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Commun., vol. 51, no. 11, pp. 1039-1064, 2009
- (2009) Speech Commun , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

2
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- DOI 10.1093/ietisy/e90-d.2.533
- J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007 (Pubitemid 46279829)
- (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.2 , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

3
- 84866846705
- Recent development of HMM-based expressive speech synthesis and its applications
- T. Nose and T. Kobayashi, "Recent development of HMM-based expressive speech synthesis and its applications," in Proc. APSIPA ASC 2011, 2011 [Online]. Available: http://www.apsipa.org/proceedings-2011/pdf/ APSIPA189.pdf
- (2011) Proc. APSIPA ASC 2011
- Nose, T.¹ Kobayashi, T.²

4
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Sep
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH, Sep. 1999, pp. 2347-2350
- (1999) Proc. EUROSPEECH , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

5
- 0028996993
- Speech parameter generation from HMM using dynamic features
- May
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. ICASSP'95, May 1995, pp. 660-663
- (1995) Proc. ICASSP , vol.95 , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

6
- 27144515530
- Incorporating a mixed excitation model and postfilter into HMM-based text-to-speech synthesis
- DOI 10.1002/scj.20354
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Incorporating a mixed excitation model and postfilter into HMMbased text-to-speech synthesis," Syst. Comput. Jpn., vol. 36, no. 12, pp. 43-50, 2005 (Pubitemid 41495150)
- (2005) Systems and Computers in Japan , vol.36 , Issue.12 , pp. 43-50
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

7
- 33745200051
- Speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis," in Proc. INTERSPEECH '05-Eurospeech, 2005, pp. 2801-2804
- (2005) Proc. INTERSPEECH '05-Eurospeech , pp. 2801-2804
- Toda, T.¹ Tokuda, K.²

8
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- May
- T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 816-824, May 2007
- (2007) IEICE Trans. Inf. Syst , vol.90 , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

9
- 67650819492
- The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge
- J. Yamagishi, H. Zen, Y.Wu, T. Toda, and K. Tokuda, "The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge," in Proc. Blizzard Challenge Workshop, 2008
- (2008) Proc. Blizzard Challenge Workshop
- Yamagishi, J.¹ Zen, H.² Wu, Y.³ Toda, T.⁴ Tokuda, K.⁵

10
- 79959847301
- Global variancemodeling on the log power spectrum of LSPs for HMM-based speech synthesis
- Sep
- Z. Ling,Y.Hu, and L.Dai, "Global variancemodeling on the log power spectrum of LSPs for HMM-based speech synthesis," in Proc. INTERSPEECH '10, Sep. 2010, pp. 825-828
- (2010) Proc. INTERSPEECH , vol.10 , pp. 825-828
- Lingy, Z.¹ Hu, Y.² Dai, L.³

11
- 80051648616
- Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis
- May
- S. Pan, Y. Nankaku, K. Tokuda, and J. Tao, "Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis," in Proc. ICASSP '11, May 2011, pp. 4716-4719
- (2011) Proc. ICASSP , vol.11 , pp. 4716-4719
- Pan, S.¹ Nankaku, Y.² Tokuda, K.³ Tao, J.⁴

12
- 85008525798
- Product of experts for statistical parametric speech synthesis
- Mar
- H. Zen, M. Gales, Y. Nankaku, and K. Tokuda, "Product of experts for statistical parametric speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 794-805, Mar. 2012
- (2012) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.3 , pp. 794-805
- Zen, H.¹ Gales, M.² Nankaku, Y.³ Tokuda, K.⁴

13
- 0033350721
- Products of experts
- G. E. Hinton, "Products of experts," in Proc. ICANN 99, 1999, vol. 1, pp. 1-6
- (1999) Proc. ICANN 99 , vol.1 , pp. 1-6
- Hinton, G.E.¹

14
- 33749573927
- Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences
- DOI 10.1016/j.csl.2006.01.002, PII S0885230806000052
- H. Zen, K. Tokuda, and T. Kitamura, "Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences," Comput. Speech Lang., vol. 21, no. 1, pp. 153-173, 2007 (Pubitemid 44537647)
- (2007) Computer Speech and Language , vol.21 , Issue.1 , pp. 153-173
- Zen, H.¹ Tokuda, K.² Kitamura, T.³

15
- 51449106803
- Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
- Mar
- Y. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis," in Proc. ICASSP '08, Mar. 2008, pp. 4621-4624
- (2008) Proc. ICASSP , vol.8 , pp. 4621-4624
- Wu, Y.¹ Zen, H.² Nankaku, Y.³ Tokuda, K.⁴

16
- 77957917902
- Minimum generation error training for HMMbased speech synthesis
- May
- Y. Wu and R. Wang, "Minimum generation error training for HMMbased speech synthesis," in Proc. ICASSP '06,May 2006, pp. 889-892
- (2006) Proc. ICASSP , vol.6 , pp. 889-892
- Wu, Y.¹ Wang, R.²

17
- 84878412344
- A speech parameter generation algorithm using local variance for HMM-based speech synthesis
- V. Chunwijitra, T. Nose, and T. Kobayashi, "A speech parameter generation algorithm using local variance for HMM-based speech synthesis," in Proc. INTERSPEECH '12, 2012, pp. 1151-1154
- (2012) Proc. INTERSPEECH , vol.12 , pp. 1151-1154
- Chunwijitra, V.¹ Nose, T.² Kobayashi, T.³

18
- 44449177634
- A hidden semi-Markov model-based speech synthesis system
- May
- H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. Syst., vol. E90-D, no. 5, pp. 825-834, May 2007
- (2007) IEICE Trans. Inf. Syst , vol.90 , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

19
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis," Speech Commun., vol. 9, no. 4, pp. 357-363, 1990
- (1990) Speech Commun , vol.9 , Issue.4 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

20
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, 1999
- (1999) Speech Commun , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

21
- 0033906251
- MDL-based context-dependent subword modeling for speech recognition
- K. Shinoda and T.Watanabe, "MDL-based context-dependent subword modeling for speech recognition," J. Acoust. Soc. Jpn. (E), vol. 21, no. 2, pp. 79-86, Mar. 2000. (Pubitemid 30594111)
- (2000) Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi) , vol.21 , Issue.2 , pp. 79-86
- Shinoda Koichi¹ Watanabe Takao²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.