메뉴 건너뛰기




Volumn , Issue , 2009, Pages 4013-4016

Minimum generation error training by using original spectrum as reference for log spectral distortion measure

Author keywords

HMM; Log spectral distortion; Minimum generation error; Speech synthesis

Indexed keywords

FFT SPECTRUM; HARMONIC FREQUENCY; HMM; HMM TRAINING; HMM-BASED SPEECH SYNTHESIS; LINE SPECTRAL PAIRS; LOG SPECTRAL DISTORTION; LOG SPECTRAL DISTORTIONS; MINIMUM GENERATION ERROR; REFERENCE SPECTRUM; SAMPLING STRATEGIES; SPECTRAL ENVELOPES; SPEECH WAVEFORMS; WEIGHTING FUNCTIONS;

EID: 67650797362     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960508     Document Type: Conference Paper
Times cited : (8)

References (12)
  • 1
    • 0029725605 scopus 로고    scopus 로고
    • Speech synthesis from HMMs using dynamic features
    • T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis from HMMs using dynamic features," in Proc. of ICASSP, pp. 389-392, 1996.
    • (1996) Proc. of ICASSP , pp. 389-392
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 2
    • 53049106512 scopus 로고    scopus 로고
    • Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
    • J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007", in Blizzard Challenge 2007.
    • Blizzard Challenge 2007
    • Yamagishi, J.1    Zen, H.2    Toda, T.3    Tokuda, K.4
  • 3
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMMbased speech synthesis," in Proc. of ICASSP, vol. 5, pp. 2347-2350, 1999.
    • (1999) Proc. of ICASSP , vol.5 , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 0028996993 scopus 로고
    • Speech parameter generation fromHMMusing dynamic features
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation fromHMMusing dynamic features," in Proc. of ICASSP, pp. 660-663, 1995.
    • (1995) Proc. of ICASSP , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 5
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Y.-J. Wu and R.H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. of ICASSP, vol. 1, pp. 889-892, 2006.
    • (2006) Proc. of ICASSP , vol.1 , pp. 889-892
    • Wu, Y.-J.1    Wang, R.H.2
  • 6
    • 0001810975 scopus 로고
    • Line spectrum representation of linear predictive coeffi-cients of speech signals
    • a, p, A
    • F. Itakura, "Line spectrum representation of linear predictive coeffi-cients of speech signals," in J. Acoust. Soc. Amer., vol. 57, p. 535(a), p. s35(A), 1975.
    • (1975) J. Acoust. Soc. Amer , vol.57
    • Itakura, F.1
  • 7
    • 84867214032 scopus 로고    scopus 로고
    • Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis
    • Y.-J. Wu and K. Tokuda, "Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis," in Proc. of Interspeech, pp. 577-580, 2008.
    • (2008) Proc. of Interspeech , pp. 577-580
    • Wu, Y.-J.1    Tokuda, K.2
  • 8
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse and A. deCheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," in Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    deCheveigne, A.3
  • 9
    • 85089837272 scopus 로고    scopus 로고
    • Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)
    • M. Akamine and T. Kagoshima, "Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)," in Proc. of ICSLP, 1998.
    • (1998) Proc. of ICSLP
    • Akamine, M.1    Kagoshima, T.2
  • 10
    • 51449096059 scopus 로고    scopus 로고
    • Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM
    • T. Toda and K. Tokuda, "Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM," in Proc of ICASSP, pp. 3925-3928, 2008.
    • (2008) Proc of ICASSP , pp. 3925-3928
    • Toda, T.1    Tokuda, K.2
  • 11
    • 0000920843 scopus 로고
    • A theory of adaptive pattern classifiers
    • S. Amari, "A theory of adaptive pattern classifiers," IEEE Trans. Electron. Comput., vol. EC-16, no. 3, pp. 299-307, 1967.
    • (1967) IEEE Trans. Electron. Comput , vol.EC-16 , Issue.3 , pp. 299-307
    • Amari, S.1
  • 12
    • 0032678076 scopus 로고    scopus 로고
    • Hidden markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, pp. 229-232, 1999.
    • (1999) Proc. of ICASSP , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.