메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 3249-3253

A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition

Author keywords

Deep neural networks; Encoder decoder; End to end speech recognition; Recurrent neural networks

Indexed keywords

DECODING; HIDDEN MARKOV MODELS; MARKOV PROCESSES; RECURRENT NEURAL NETWORKS; SPEECH; SPEECH COMMUNICATION; VOCABULARY CONTROL;

EID: 84959173420     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (89)

References (41)
  • 3
    • 0029308753 scopus 로고
    • Neural networks for statisticalrecognition of continuous speech
    • N. Morgan and H. A. Bourlard, "Neural networks for statisticalrecognition of continuous speech, " Proceedings of the IEEE, vol. 83, no. 5, pp. 742-772, 1995.
    • (1995) Proceedings of the IEEE , vol.83 , Issue.5 , pp. 742-772
    • Morgan, N.1    Bourlard, H.A.2
  • 6
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcriptionusing context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcriptionusing context-dependent deep neural networks. " in Interspeech, 2011, pp. 437-440.
    • (2011) Interspeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 7
    • 0000329355 scopus 로고
    • A recurrent error propagation networkspeech recognition system
    • T. Robinson and F. Fallside, "A recurrent error propagation networkspeech recognition system, " Computer Speech and Language, vol. 5, pp. 259-274, 1991.
    • (1991) Computer Speech and Language , vol.5 , pp. 259-274
    • Robinson, T.1    Fallside, F.2
  • 8
    • 0001592322 scopus 로고    scopus 로고
    • The use of recurrentnetworks in continuous speech recognition
    • C. Lee, K. Paliwal, and F. Soong, Eds. Kluwer Academic Publishers
    • T. Robinson, M. Hochberg, and S. Renals, "The use of recurrentnetworks in continuous speech recognition, " in Automatic Speechand Speaker Recognition-Advanced Topics, C. Lee, K. Paliwal, and F. Soong, Eds. Kluwer Academic Publishers, 1996, pp. 233-258.
    • (1996) Automatic Speechand Speaker Recognition-Advanced Topics , pp. 233-258
    • Robinson, T.1    Hochberg, M.2    Renals, S.3
  • 14
    • 0024610919 scopus 로고
    • A tutorial on hidden markov models and selectedapplications in speech recognition
    • L. Rabiner, "A tutorial on hidden markov models and selectedapplications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 16
    • 0036460907 scopus 로고    scopus 로고
    • Weighted finite-state transducersin speech recognition
    • M. Mohri, F. Pereira, and M. Riley, "Weighted finite-state transducersin speech recognition, " Computer Speech & Language, vol. 16, pp. 69-88, 2002.
    • (2002) Computer Speech & Language , vol.16 , pp. 69-88
    • Mohri, M.1    Pereira, F.2    Riley, M.3
  • 18
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimumBayes risk training of deep neural network acoustic models usingdistributed Hessian-free optimization. " in INTERSPEECH, 2012.
    • (2012) INTERSPEECH
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3
  • 22
    • 33745185781 scopus 로고    scopus 로고
    • Hiddenconditional rand om fields for phone classification
    • A. Gunawardana, M. Mahajan, A. Acero, and J. C. Platt, "Hiddenconditional rand om fields for phone classification. " in INTERSPEECH, 2005, pp. 1117-1120.
    • (2005) INTERSPEECH , pp. 1117-1120
    • Gunawardana, A.1    Mahajan, M.2    Acero, A.3    Platt, J.C.4
  • 28
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translationby jointly learning to align and translate
    • D. Bahdanau, K. Cho, and Y. Bengio, "Neural machine translationby jointly learning to align and translate, " in Proc. ICLR, 2015.
    • (2015) Proc. ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 31
    • 84936143793 scopus 로고    scopus 로고
    • Towards end-to-end speech recognitionwith recurrent neural networks
    • A. Graves and N. Jaitly, "Towards end-to-end speech recognitionwith recurrent neural networks, " in Proc. ICML, 2014, pp. 1764-1772.
    • (2014) Proc. ICML , pp. 1764-1772
    • Graves, A.1    Jaitly, N.2
  • 36
    • 84887388950 scopus 로고    scopus 로고
    • An empiricalstudy of learning rates in deep neural networks for speech recognition
    • A. Senior, G. Heigold, M. Ranzato, and K. Yang, "An empiricalstudy of learning rates in deep neural networks for speech recognition, "in Proc. ICASSP. IEEE, 2013, pp. 6724-6728.
    • (2013) Proc. ICASSP. IEEE , pp. 6724-6728
    • Senior, A.1    Heigold, G.2    Ranzato, M.3    Yang, K.4
  • 37
    • 0028392483 scopus 로고
    • Learning long-term dependencieswith gradient descent is difficult
    • Y. Bengio, P. Simard, and P. Frasconi, "Learning long-term dependencieswith gradient descent is difficult, " Neural Networks, IEEE Transactions on, vol. 5, no. 2, pp. 157-166, 1994.
    • (1994) Neural Networks, IEEE Transactions on , vol.5 , Issue.2 , pp. 157-166
    • Bengio, Y.1    Simard, P.2    Frasconi, P.3
  • 38
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory, "Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 39
    • 85016587886 scopus 로고
    • SWITCHBOARD: Telephone speech corpus for research and development
    • J. J. Godfrey, E. C. Holliman, and J. McDaniel, "SWITCHBOARD: Telephone speech corpus for research and development, "in Proc. ICASSP. IEEE, 1992, pp. 517-520.
    • (1992) Proc. ICASSP. IEEE , pp. 517-520
    • Godfrey, J.J.1    Holliman, E.C.2    McDaniel, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.