메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2268-2272

Prosody contour prediction with long short-term memory, bi-directional, deep recurrent neural networks

Author keywords

Deep learning; Prosody prediction; Recurrent neural networks; Speech synthesis; Text to speech

Indexed keywords

FORECASTING; MEAN SQUARE ERROR; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 84910068142     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (89)

References (16)
  • 1
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for hmm-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in ICASSP, 2000, pp. 1315-1318.
    • (2000) ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 84890522099 scopus 로고    scopus 로고
    • F0 contour prediction with a deep belief network-gaussian process hybrid model
    • R. Fernandez, R. Rendel, B. Ramabhadran, and R. Hoory, "F0 contour prediction with a Deep Belief Network-Gaussian Process hybrid model, " in ICASSP, 2013, pp. 6885-6889.
    • (2013) ICASSP , pp. 6885-6889
    • Fernandez, R.1    Rendel, R.2    Ramabhadran, B.3    Hoory, R.4
  • 3
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using Deep Neural Networks, " in ICASSP, 2013, pp. 7962-7966.
    • (2013) ICASSP , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 4
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelops using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelops using Restricted Boltzmann Machines and Deep Belief Networks for statistical parametric speech synthesis, " IEEE Trans. Audio, Speech, and Lang. Proc., vol. 21, no. 10, pp. 2129-2139, 2013.
    • (2013) IEEE Trans. Audio, Speech, and Lang. Proc. , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 5
    • 84890527090 scopus 로고    scopus 로고
    • Multi-distribution deep belief networks for speech synthesis
    • S. Kang, X. Qian, and H. Meng, "Multi-distribution Deep Belief Networks for speech synthesis, " in ICASSP, 2013, pp. 8012-8016.
    • (2013) ICASSP , pp. 8012-8016
    • Kang, S.1    Qian, X.2    Meng, H.3
  • 6
    • 84890545600 scopus 로고    scopus 로고
    • Multi-task learning in deep neural networks for improved phoneme recognition
    • M. L. Seltzer and J. Droppo, "Multi-task learning in Deep Neural Networks for improved phoneme recognition, " in Proc. ICASSP, 2013, pp. 6965-6969.
    • (2013) Proc. ICASSP , pp. 6965-6969
    • Seltzer, M.L.1    Droppo, J.2
  • 7
    • 71249112130 scopus 로고    scopus 로고
    • Offline handwriting recognition with multidimensional recurrent neural networks
    • A. Graves and J. Schmidhuber, "Offline handwriting recognition with multidimensional Recurrent Neural Networks, " in NIPS, 2009.
    • (2009) NIPS
    • Graves, A.1    Schmidhuber, J.2
  • 8
    • 84890543083 scopus 로고    scopus 로고
    • Speech recognition with deep recurrent neural networks
    • A. Graves, M. Abdel-rahman, and G. Hinton, "Speech recognition with Deep Recurrent Neural Networks, " in ICASSP, 2013, pp. 6885-6889.
    • (2013) ICASSP , pp. 6885-6889
    • Graves, A.1    Abdel-Rahman, M.2    Hinton, G.3
  • 10
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 11
    • 0034293152 scopus 로고    scopus 로고
    • Learning to forget: Continual prediction with lstm
    • F. A. Gers, J. Schmidhuber, and F. Cummings, "Learning to forget: Continual prediction with LSTM, " Neural Computaiton, vol. 12, no. 10, pp. 2451-2471, 2000.
    • (2000) Neural Computaiton , vol.12 , Issue.10 , pp. 2451-2471
    • Gers, F.A.1    Schmidhuber, J.2    Cummings, F.3
  • 13
    • 84943274699 scopus 로고
    • A direct adaptive method for faster back propagation learning: The rprop algorithm
    • M. Riedmiller and H. Braun, "A direct adaptive method for faster back propagation learning: The RPROP algorithm, " in Proc. IEEE Intnl. Conf. on Neural Networks, 1993, pp. 586-591.
    • (1993) Proc. IEEE Intnl. Conf. on Neural Networks , pp. 586-591
    • Riedmiller, M.1    Braun, H.2
  • 15
    • 33745200051 scopus 로고    scopus 로고
    • Speech parameter generation algorithm considering global variance for hmm-based speech synthesis
    • T. Toda and K. Tokuda, "Speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " in Inter speech, 2005, pp. 2801-2804.
    • (2005) Inter Speech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2
  • 16
    • 80051607565 scopus 로고    scopus 로고
    • Crowdmos: An approach for crowd sourcing mean opinion score studies
    • F. Ribeiro, D. Floreâncio, C. Zhang, and M. Seltzer, "CROWDMOS: An approach for crowd sourcing Mean Opinion Score studies, " in ICASSP, 2011, pp. 2416-2419.
    • (2011) ICASSP , pp. 2416-2419
    • Ribeiro, F.1    Florêncio, D.2    Zhang, C.3    Seltzer, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.