메뉴 건너뛰기




Volumn 5, Issue , 2014, Pages 3771-3779

Towards end-to-end speech recognition with recurrent neural networks

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; CHARACTER RECOGNITION; CLASSIFICATION (OF INFORMATION); COMPUTATIONAL LINGUISTICS; LEARNING SYSTEMS; NETWORK ARCHITECTURE; RECURRENT NEURAL NETWORKS; TRANSCRIPTION;

EID: 84919832465     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (786)

References (22)
  • 2
    • 33745202406 scopus 로고    scopus 로고
    • Open vocabulary speech recognition with flat hybrid models
    • Bisani, Maximilian and Ney, Hermann. Open vocabulary speech recognition with flat hybrid models. In INTER-SPEECH, pp. 725-728, 2005.
    • (2005) INTER-SPEECH , pp. 725-728
    • Bisani, M.1    Ney, H.2
  • 4
    • 80054740693 scopus 로고    scopus 로고
    • A committee of neural networks for traffic sign classification
    • IEEE
    • Ciresan, Dan C, Meier, Ueli, Masci, Jonathan, and Schmidhuber, Jrgen. A committee of neural networks for traffic sign classification. In IJCNN, pp. 1918-1921. IEEE, 2011.
    • (2011) IJCNN , pp. 1918-1921
    • Ciresan, D.C.1    Meier, U.2    Masci, J.3    Schmidhuber, J.4
  • 5
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic. Word recognition in continuously spoken sentences
    • August
    • Davis, S. and Mermelstein, P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech and Signal Processing, 28(4):357-366, August 1980.
    • (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 7
    • 85009227775 scopus 로고    scopus 로고
    • Recognition of out-of-vocabulary words with sub-lexical language models
    • Galescu, Lucian. Recognition of out-of-vocabulary words with sub-lexical language models. In INTERSPEECH, 2003.
    • (2003) INTERSPEECH
    • Galescu, L.1
  • 9
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • June/July
    • Graves, A. and Schmidhuber, J. Framewise Phoneme Classification with Bidirectional LSTM and Other Neural Network Architectures. Neural Networks, 18(5-6):602-610, June/July 2005.
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 10
    • 33749259827 scopus 로고    scopus 로고
    • Connectionist temporal classification: Labelling un-segmented sequence data with recurrent neural networks
    • Pittsburgh, USA
    • Graves, A., Fernandez, S., Gomez, F., and Schmidhuber, J. Connectionist Temporal Classification: Labelling Un-segmented Sequence Data with Recurrent Neural Networks. In ICML, Pittsburgh, USA, 2006.
    • (2006) ICML
    • Graves, A.1    Fernandez, S.2    Gomez, F.3    Schmidhuber, J.4
  • 11
    • 84890543083 scopus 로고    scopus 로고
    • Speech recognition with deep recurrent neural networks
    • Vancouver, Canada, May
    • Graves, A., Mohamed, A., and Hinton, G. Speech recognition with deep recurrent neural networks. In Proc ICASSP 2013, Vancouver, Canada, May 2013.
    • (2013) Proc ICASSP 2013
    • Graves, A.1    Mohamed, A.2    Hinton, G.3
  • 13
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • May
    • Hinton, G. E. and Salakhutdinov, R. R. Reducing the Dimensionality of Data with Neural Networks. Science, 313(5786):504-507, May 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.E.1    Salakhutdinov, R.R.2
  • 16
    • 80051609011 scopus 로고    scopus 로고
    • Learning a better representation of speech soundwaves using restricted boltzmann machines
    • Jaitly, Navdeep and Hinton, Geoffrey E. Learning a better representation of speech soundwaves using restricted boltzmann machines. In ICASSP, pp. 5884-5887, 2011.
    • (2011) ICASSP , pp. 5884-5887
    • Jaitly, N.1    Hinton, G.E.2
  • 17
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • Jaitly, Navdeep, Nguyen, Patrick, Senior, Andrew W, and Vanhoucke, Vincent. Application of pretrained deep neural networks to large vocabulary speech recognition. In INTERSPEECH, 2012.
    • (2012) INTERSPEECH
    • Jaitly, N.1    Nguyen, P.2    Senior, A.W.3    Vanhoucke, V.4
  • 19
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • Jan
    • Lee, Li and Rose, R. A frequency warping approach to speaker normalization. Speech and Audio Processing, IEEE Transactions on, 6(1):49-60, Jan 1998.
    • (1998) Speech and Audio Processing, IEEE Transactions on , vol.6 , Issue.1 , pp. 49-60
    • Li, L.1    Rose, R.2
  • 20
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • Peters, J. and Schaal, S. Reinforcement learning of motor skills with policy gradients. In Neural Networks, number 4, pp. 682-97, 2008.
    • (2008) Neural Networks , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.