SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2008, Pages 577-580

Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

HMM; Line spectral pairs; Log spectral distortion; Minimum generation error; Speech synthesis

Indexed keywords

EUCLIDEAN DISTANCE; HMM; HMM TRAINING; HMM-BASED SPEECH SYNTHESIS; LINE SPECTRAL PAIRS; LOG SPECTRAL DISTORTIONS; MINIMUM GENERATION ERROR; SAMPLING STRATEGIES; SYNTHESIZED SPEECH;

MAXIMUM LIKELIHOOD; SPEECH SYNTHESIS; VECTOR QUANTIZATION;

SPEECH COMMUNICATION;

EID: 84867214032 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (12)

References (13)

1
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, 1996, pp. 389-392.
- Proc. of ICASSP, 1996 , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

2
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, 1999, vol. 5, pp. 2347-2350.
- Proc. of ICASSP, 1999 , vol.5 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 0028996993
- Speech parameter generation from HMM using dynamic features
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. of ICASSP, 1995, pp. 660-663.
- Proc. of ICASSP, 1995 , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

4
- 33745215669
- An Overview of Nitech HMM-based Speech Synthesis System for Blizzard Challenge 2005
- H. Zen, and T. Toda, "An Overview of Nitech HMM-based Speech Synthesis System for Blizzard Challenge 2005," in Proc. of Eurospeech, 2005, pp. 93-96, 2005.
- (2005) Proc. of Eurospeech , vol.2005 , pp. 93-96
- Zen, H.¹ Toda, T.²

5
- 67650851754
- USTC System for Blizzard Challenge 2006 - An Improved HMM-based Speech Synthesis Method
- Z.-H. Ling, Y.-J. Wu, Y. P. Wang, L. Qin and R. H. Wang, "USTC System for Blizzard Challenge 2006 - an Improved HMM-based Speech Synthesis Method," in Interspeech 2006 satellite meeting, Blizzard Challenge 2006.
- (2006) Interspeech 2006 Satellite Meeting, Blizzard Challenge
- Ling, Z.-H.¹ Wu, Y.-J.² Wang, Y.P.³ Qin, L.⁴ Wang, R.H.⁵

6
- 53049106512
- Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
- J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007", in Blizzard Challenge 2007.
- (2007) Blizzard Challenge
- Yamagishi, J.¹ Zen, H.² Toda, T.³ Tokuda, K.⁴

7
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- Y.-J. Wu and R.H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. of ICASSP, 2006, vol. 1, pp. 889-892.
- Proc. of ICASSP, 2006 , vol.1 , pp. 889-892
- Wu, Y.-J.¹ Wang, R.H.²

8
- 0000920843
- A theory of adaptive pattern classifiers
- S. Amari, "A theory of adaptive pattern classifiers," IEEE Trans. Electron. Comput., vol. EC-16, no. 3, pp. 299-307, 1967.
- (1967) IEEE Trans. Electron. Comput. , vol.EC-16 , Issue.3 , pp. 299-307
- Amari, S.¹

9
- 34547517493
- Full HMM training for minimizing generation error in synthesis
- Y.-J. Wu, R.H. Wang, and F. Soong, "Full HMM training for minimizing generation error in synthesis," in Proc. of ICASSP, 2007, pp. 517-520.
- Proc. of ICASSP, 2007 , pp. 517-520
- Wu, Y.-J.¹ Wang, R.H.² Soong, F.³

10
- 0001810975
- Line spectrum representation of linear predictive coefficients of speech signals
- p. s35(A)
- F. Itakura, "Line spectrum representation of linear predictive coefficients of speech signals," in J. Acoust. Soc. Amer., 1975, vol. 57, p. 535(a), p. s35(A).
- (1975) J. Acoust. Soc. Amer. , vol.57
- Itakura, F.¹

11
- 0004244302
- Prentice-Hall, NJ, USA
- L. Rabiner and B. H. Juang, "Fundamentals of Speech Recognition Englewood Cliffs," Prentice-Hall, NJ, USA, 1993.
- (1993) Fundamentals of Speech Recognition Englewood Cliffs
- Rabiner, L.¹ Juang, B.H.²

12
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, 1999, pp. 229-232.
- Proc. of ICASSP, 1999 , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

13
- 0032673049
- Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse and A. deCheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," in Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² DeCheveigne, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.