메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4450-4454

A deep recurrent approach for acoustic-to-articulatory inversion

Author keywords

layer wise pre training; long short term memory (LSTM); mixture density network (MDN); recurrent nueral network (RNN)

Indexed keywords

AUDIO SIGNAL PROCESSING; BRAIN; DEEP NEURAL NETWORKS; MEAN SQUARE ERROR; MIXTURES; SPEECH COMMUNICATION;

EID: 84946016986     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178812     Document Type: Conference Paper
Times cited : (83)

References (25)
  • 3
    • 84890443373 scopus 로고    scopus 로고
    • Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training
    • J.H. Zhao, H. Yuan, W.K. Leung, H. Meng, J. Liu, and S.H. Xia, Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training, in Proc. ICASSP, 2013, pp. 8218-8222
    • (2013) Proc. ICASSP , pp. 8218-8222
    • Zhao, J.H.1    Yuan, H.2    Leung, W.K.3    Meng, H.4    Liu, J.5    Xia, S.H.6
  • 4
    • 0038359547 scopus 로고    scopus 로고
    • Modelling the uncertainty in recovering articulation from acoustics
    • K. Richmond, S. King, and P. Taylor, Modelling the uncertainty in recovering articulation from acoustics, Computer Speech &Language, vol. 17, no. 2, pp. 153-172, 2003
    • (2003) Computer Speech &Language , vol.17 , Issue.2 , pp. 153-172
    • Richmond, K.1    King, S.2    Taylor, P.3
  • 5
    • 67650153217 scopus 로고    scopus 로고
    • Acoustic-articulatory modeling with the trajectory HMM
    • L. Zhang and S. Renals, Acoustic-articulatory modeling with the trajectory HMM, IEEE Signal Processing Letters, vol. 15, pp. 245-248, 2008
    • (2008) IEEE Signal Processing Letters , vol.15 , pp. 245-248
    • Zhang, L.1    Renals, S.2
  • 6
    • 44949185845 scopus 로고    scopus 로고
    • A trajectory mixture density network for the acoustic-articulatory inversion mapping
    • K. Richmond, A trajectory mixture density network for the acoustic-articulatory inversion mapping, in Proc. INTERSPEECH, 2006, pp. 577-580
    • (2006) Proc. INTERSPEECH , pp. 577-580
    • Richmond, K.1
  • 8
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMMbased speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, Speech parameter generation algorithms for HMMbased speech synthesis, in Proc. ICASSP, 2000, pp. 1315-1318
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 9
    • 84890527090 scopus 로고    scopus 로고
    • Multi-distribution deep belief network for speech synthesis
    • S. Kang, X. Qian, and H. Meng, Multi-distribution deep belief network for speech synthesis, in Proc. ICASSP, 2013, pp. 8012-8016
    • (2013) Proc. ICASSP , pp. 8012-8016
    • Kang, S.1    Qian, X.2    Meng, H.3
  • 10
    • 84910030421 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using weighted multi-distribution deep belief network
    • S. Kang and H. Meng, Statistical parametric speech synthesis using weighted multi-distribution deep belief network, in Proc. INTERSPEECH, 2014, pp. 1959-1963
    • (2014) Proc. INTERSPEECH , pp. 1959-1963
    • Kang, S.1    Meng, H.2
  • 11
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, Statistical parametric speech synthesis using deep neural networks, in Proc. ICASSP, 2013, pp. 7962-7966
    • (2013) Proc. ICASSP , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 12
    • 84890447002 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis
    • Z. Ling, L. Deng, and D. Yu, Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis, in Proc. ICASSP, 2013, pp. 7825-7829
    • (2013) Proc. ICASSP , pp. 7825-7829
    • Ling, Z.1    Deng, L.2    Yu, D.3
  • 13
    • 84910047819 scopus 로고    scopus 로고
    • TTS synthesis with bidirectional LSTM based recurrent neural networks
    • Y. Fan, Y. Qian, F. Xie, and F.K. Soong, TTS synthesis with bidirectional LSTM based recurrent neural networks, in Proc. INTERSPEECH, 2014, pp. 1964-1968
    • (2014) Proc. INTERSPEECH , pp. 1964-1968
    • Fan, Y.1    Qian, Y.2    Xie, F.3    Soong, F.K.4
  • 15
  • 21
    • 84946058872 scopus 로고    scopus 로고
    • A. Graves, http://sourceforge.net/projects/rnnl
    • Graves, A.1
  • 22
    • 84865778430 scopus 로고    scopus 로고
    • Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus
    • K. Richmond, P. Hoole, and S. King, Announcing the electromagnetic articulography (day 1) subset of the mngu0 articulatory corpus, in Proc. INTERSPEECH, 2009, pp. 1505-1508
    • (2009) Proc. INTERSPEECH , pp. 1505-1508
    • Richmond, K.1    Hoole, P.2    King, S.3
  • 24
  • 25
    • 79959822106 scopus 로고    scopus 로고
    • Adaptation of a tongue shape model by local feature transformations
    • C. Qin, M.A. Carreira-Perpinán, and M. Farhadloo, Adaptation of a tongue shape model by local feature transformations, in Proc. INTERSPEECH, 2010, pp. 1596-1599
    • (2010) Proc. INTERSPEECH , pp. 1596-1599
    • Qin, C.1    Carreira-Perpinán, M.A.2    Farhadloo, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.