메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4280-4284

Learning acoustic frame labeling for speech recognition with recurrent neural networks

Author keywords

acoustic modeling; CTC; LSTM; RNN

Indexed keywords

AUDIO SIGNAL PROCESSING; SPEECH COMMUNICATION; SPEECH RECOGNITION; TELEPHONE SETS;

EID: 84946084790     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178778     Document Type: Conference Paper
Times cited : (229)

References (25)
  • 3
    • 84910046405 scopus 로고    scopus 로고
    • Long short-term memory recurrent neural network architectures for large scale acoustic modeling
    • H. Sak, A. Senior, and F. Beaufays, Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling, in INTERSPEECH 2014, 2014
    • (2014) INTERSPEECH 2014
    • Sak, H.1    Senior, A.2    Beaufays, F.3
  • 4
    • 84910072094 scopus 로고    scopus 로고
    • Sequence discriminative distributed training of long short-term memory recurrent neural networks
    • H. Sak, O. Vinyals, G. Heigold, A. Senior, E. McDermott, R. Monga, and M. Mao, Sequence discriminative distributed training of long short-term memory recurrent neural networks, in Interspeech, 2014
    • (2014) Interspeech
    • Sak, H.1    Vinyals, O.2    Heigold, G.3    Senior, A.4    McDermott, E.5    Monga, R.6    Mao, M.7
  • 8
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, Conversational speech transcription using context-dependent deep neural networks, in INTERSPEECH, 2011, pp. 437-440
    • (2011) INTERSPEECH , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 10
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • Jan
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 1, pp. 30-42, Jan. 2012. [Online]. Available: http://dx.doi.org/10.1109/TASL.2011.2134090
    • (2012) IEEE Transactions on Audio, Speech &Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 11
    • 84878539964 scopus 로고    scopus 로고
    • Application of pretrained deep neural networks to large vocabulary speech recognition
    • N. Jaitly, P. Nguyen, A. Senior, and V. Vanhoucke, Application of pretrained deep neural networks to large vocabulary speech recognition, in INTERSPEECH, 2012
    • (2012) INTERSPEECH
    • Jaitly, N.1    Nguyen, P.2    Senior, A.3    Vanhoucke, V.4
  • 13
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 16
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
    • Taipei, Taiwan, Apr
    • B. Kingsbury, Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3761-3764
    • (2009) IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , pp. 3761-3764
    • Kingsbury, B.1
  • 17
    • 84878379108 scopus 로고    scopus 로고
    • Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
    • B. Kingsbury, T. N. Sainath, and H. Soltau, Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization, in INTERSPEECH, 2012
    • (2012) INTERSPEECH
    • Kingsbury, B.1    Sainath, T.N.2    Soltau, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.