메뉴 건너뛰기




Volumn , Issue , 2008, Pages 577-580

Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis

Author keywords

HMM; Line spectral pairs; Log spectral distortion; Minimum generation error; Speech synthesis

Indexed keywords

EUCLIDEAN DISTANCE; HMM; HMM TRAINING; HMM-BASED SPEECH SYNTHESIS; LINE SPECTRAL PAIRS; LOG SPECTRAL DISTORTIONS; MINIMUM GENERATION ERROR; SAMPLING STRATEGIES; SYNTHESIZED SPEECH;

EID: 84867214032     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (12)

References (13)
  • 2
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. of ICASSP, 1999, vol. 5, pp. 2347-2350.
    • Proc. of ICASSP, 1999 , vol.5 , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 3
    • 0028996993 scopus 로고    scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features," in Proc. of ICASSP, 1995, pp. 660-663.
    • Proc. of ICASSP, 1995 , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 4
    • 33745215669 scopus 로고    scopus 로고
    • An Overview of Nitech HMM-based Speech Synthesis System for Blizzard Challenge 2005
    • H. Zen, and T. Toda, "An Overview of Nitech HMM-based Speech Synthesis System for Blizzard Challenge 2005," in Proc. of Eurospeech, 2005, pp. 93-96, 2005.
    • (2005) Proc. of Eurospeech , vol.2005 , pp. 93-96
    • Zen, H.1    Toda, T.2
  • 6
    • 53049106512 scopus 로고    scopus 로고
    • Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
    • J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007", in Blizzard Challenge 2007.
    • (2007) Blizzard Challenge
    • Yamagishi, J.1    Zen, H.2    Toda, T.3    Tokuda, K.4
  • 7
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Y.-J. Wu and R.H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. of ICASSP, 2006, vol. 1, pp. 889-892.
    • Proc. of ICASSP, 2006 , vol.1 , pp. 889-892
    • Wu, Y.-J.1    Wang, R.H.2
  • 8
    • 0000920843 scopus 로고
    • A theory of adaptive pattern classifiers
    • S. Amari, "A theory of adaptive pattern classifiers," IEEE Trans. Electron. Comput., vol. EC-16, no. 3, pp. 299-307, 1967.
    • (1967) IEEE Trans. Electron. Comput. , vol.EC-16 , Issue.3 , pp. 299-307
    • Amari, S.1
  • 9
    • 34547517493 scopus 로고    scopus 로고
    • Full HMM training for minimizing generation error in synthesis
    • Y.-J. Wu, R.H. Wang, and F. Soong, "Full HMM training for minimizing generation error in synthesis," in Proc. of ICASSP, 2007, pp. 517-520.
    • Proc. of ICASSP, 2007 , pp. 517-520
    • Wu, Y.-J.1    Wang, R.H.2    Soong, F.3
  • 10
    • 0001810975 scopus 로고
    • Line spectrum representation of linear predictive coefficients of speech signals
    • p. s35(A)
    • F. Itakura, "Line spectrum representation of linear predictive coefficients of speech signals," in J. Acoust. Soc. Amer., 1975, vol. 57, p. 535(a), p. s35(A).
    • (1975) J. Acoust. Soc. Amer. , vol.57
    • Itakura, F.1
  • 12
    • 0032678076 scopus 로고    scopus 로고
    • Hidden markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. of ICASSP, 1999, pp. 229-232.
    • Proc. of ICASSP, 1999 , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse and A. deCheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," in Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    DeCheveigne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.