메뉴 건너뛰기




Volumn , Issue , 2010, Pages 4614-4617

An autoencoder neural-network based low-dimensionality approach to excitation modeling for HMM-based text-to-speech

Author keywords

Autoencoders; Excitation modeling; Hidden Markov models; Neural networks; Speech synthesis

Indexed keywords

DIGITAL STORAGE; HIDDEN MARKOV MODELS; LEARNING SYSTEMS; NEURAL NETWORKS;

EID: 78049412607     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2010.5495546     Document Type: Conference Paper
Times cited : (16)

References (10)
  • 1
    • 0011510419 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 1223-1226.
    • (1999) Eurospeech , pp. 1223-1226
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 84856248349 scopus 로고    scopus 로고
    • A trainable excitation model for HMM-based speech synthesis
    • R. Maia, T. Toda, H. Zen, Y. Nankaku, and K. Tokuda, "A trainable excitation model for HMM-based speech synthesis," in Interspeech, 2007, pp. 1909-1912.
    • (2007) Interspeech , pp. 1909-1912
    • Maia, R.1    Toda, T.2    Zen, H.3    Nankaku, Y.4    Tokuda, K.5
  • 3
    • 84883320815 scopus 로고    scopus 로고
    • Fundamentals and recent advances in HMM-based speech synthesis
    • K. Tokuda and H. Zen, "Fundamentals and recent advances in HMM-based speech synthesis," in Interspeech, 2009.
    • (2009) Interspeech
    • Tokuda, K.1    Zen, H.2
  • 4
    • 51449085178 scopus 로고    scopus 로고
    • On the state definition for a trainable excitation model in HMM-based speech synthesis
    • R. Maia, T. Toda, K. Tokuda, S. Sakai, and S. Nakamura, "On the state definition for a trainable excitation model in HMM-based speech synthesis," in ICASSP, 2008, pp. 3965-3968.
    • (2008) ICASSP , pp. 3965-3968
    • Maia, R.1    Toda, T.2    Tokuda, K.3    Sakai, S.4    Nakamura, S.5
  • 6
    • 67650793794 scopus 로고    scopus 로고
    • Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis
    • T. Drugman, A. Moinet, T. Dutoit, and G. Wilfart, "Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis," in ICASSP, 2009, pp. 3793-3796.
    • (2009) ICASSP , pp. 3793-3796
    • Drugman, T.1    Moinet, A.2    Dutoit, T.3    Wilfart, G.4
  • 7
    • 85009096905 scopus 로고    scopus 로고
    • An automatic pitch marking method using wavelet transform
    • M. Sakamoto and T. Saito, "An automatic pitch marking method using wavelet transform," in ICSLP, 2000, vol. 3, pp. 650-653.
    • (2000) ICSLP , vol.3 , pp. 650-653
    • Sakamoto, M.1    Saito, T.2
  • 8
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • July
    • G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, July 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.