SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2009, Pages 4013-4016

Minimum generation error training by using original spectrum as reference for log spectral distortion measure

(2) Wu, Yi Jian a Tokuda, Keiichi a

a NAGOYA INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

HMM; Log spectral distortion; Minimum generation error; Speech synthesis

Indexed keywords

FFT SPECTRUM; HARMONIC FREQUENCY; HMM; HMM TRAINING; HMM-BASED SPEECH SYNTHESIS; LINE SPECTRAL PAIRS; LOG SPECTRAL DISTORTION; LOG SPECTRAL DISTORTIONS; MINIMUM GENERATION ERROR; REFERENCE SPECTRUM; SAMPLING STRATEGIES; SPECTRAL ENVELOPES; SPEECH WAVEFORMS; WEIGHTING FUNCTIONS;

ACOUSTICS; HIDDEN MARKOV MODELS; SIGNAL PROCESSING; SPEECH SYNTHESIS; VECTOR QUANTIZATION;

FAST FOURIER TRANSFORMS;

EID: 67650797362 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2009.4960508 Document Type: Conference Paper

Times cited : (8)

References (12)

1
- 0029725605
- Speech synthesis from HMMs using dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, pp. 389-392, 1996.
- (1996) Proc. of ICASSP , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

2
- 53049106512
- Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
- J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007", in Blizzard Challenge 2007.
- Blizzard Challenge 2007
- Yamagishi, J.¹ Zen, H.² Toda, T.³ Tokuda, K.⁴

3
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
- (1999) Proc. of ICASSP , vol.5 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

4
- 0028996993
- Speech parameter generation fromHMMusing dynamic features
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation fromHMMusing dynamic features," in Proc. of ICASSP, pp. 660-663, 1995.
- (1995) Proc. of ICASSP , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

5
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- Y.-J. Wu and R.H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. of ICASSP, vol. 1, pp. 889-892, 2006.
- (2006) Proc. of ICASSP , vol.1 , pp. 889-892
- Wu, Y.-J.¹ Wang, R.H.²

6
- 0001810975
- Line spectrum representation of linear predictive coeffi-cients of speech signals
- a, p, A
- F. Itakura, "Line spectrum representation of linear predictive coeffi-cients of speech signals," in J. Acoust. Soc. Amer., vol. 57, p. 535(a), p. s35(A), 1975.
- (1975) J. Acoust. Soc. Amer , vol.57
- Itakura, F.¹

7
- 84867214032
- Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis
- Y.-J. Wu and K. Tokuda, "Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis," in Proc. of Interspeech, pp. 577-580, 2008.
- (2008) Proc. of Interspeech , pp. 577-580
- Wu, Y.-J.¹ Tokuda, K.²

8
- 0032673049
- Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse and A. deCheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," in Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² deCheveigne, A.³

9
- 85089837272
- Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)
- M. Akamine and T. Kagoshima, "Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)," in Proc. of ICSLP, 1998.
- (1998) Proc. of ICSLP
- Akamine, M.¹ Kagoshima, T.²

10
- 51449096059
- Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM
- T. Toda and K. Tokuda, "Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM," in Proc of ICASSP, pp. 3925-3928, 2008.
- (2008) Proc of ICASSP , pp. 3925-3928
- Toda, T.¹ Tokuda, K.²

11
- 0000920843
- A theory of adaptive pattern classifiers
- S. Amari, "A theory of adaptive pattern classifiers," IEEE Trans. Electron. Comput., vol. EC-16, no. 3, pp. 299-307, 1967.
- (1967) IEEE Trans. Electron. Comput , vol.EC-16 , Issue.3 , pp. 299-307
- Amari, S.¹

12
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
- (1999) Proc. of ICASSP , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.