메뉴 건너뛰기




Volumn , Issue , 2016, Pages 78-83

Deep bi-directional recurrent networks over spectral windows

Author keywords

acoustic modeling; Deep learning; LSTM; Recurrent networks

Indexed keywords

LINGUISTICS; RANDOM PROCESSES; RECURRENT NEURAL NETWORKS; SPEECH TRANSMISSION; TRANSCRIPTION;

EID: 84964507635     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2015.7404777     Document Type: Conference Paper
Times cited : (36)

References (22)
  • 2
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech 2011
    • (2011) Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 3
    • 84890543852 scopus 로고    scopus 로고
    • Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
    • Hang Su, Gang Li, Dong Yu, and Frank Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription," in ICASSP 2013
    • (2013) ICASSP
    • Su, H.1    Li, G.2    Yu, D.3    Seide, F.4
  • 6
    • 84890543083 scopus 로고    scopus 로고
    • Speech recognition with deep recurrent neural networks
    • Alex Graves, Abdel rahman Mohamed, and Geoffrey Hinton, "Speech recognition with deep recurrent neural networks," in ICASSP 2013
    • (2013) ICASSP
    • Graves, A.1    Rahman Mohamed, A.2    Hinton, G.3
  • 7
    • 84962892645 scopus 로고    scopus 로고
    • Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition
    • abs/1402.1128
    • Hasim Sak, Andrew W., and Francoise Beaufays, "Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition," CoRR, vol. abs/1402.1128, 2014
    • (2014) CoRR
    • Sak, H.1    Andrew, W.2    Beaufays, F.3
  • 8
    • 70349227947 scopus 로고    scopus 로고
    • The application of hidden markov models in speech recognition
    • Mark Gales and Steve Young, "The application of hidden markov models in speech recognition," Found. Trends Signal Process., vol. 1, no. 3, 2007
    • (2007) Found. Trends Signal Process , vol.1 , Issue.3
    • Gales, M.1    Young, S.2
  • 10
    • 77956502334 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • Honglak Lee, Peter Pham, Yan Largman, and Andrew Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks," in Advances in Neural Information Processing Systems 22. 2009
    • (2009) Advances in Neural Information Processing Systems , pp. 22
    • Lee, H.1    Pham, P.2    Largman, Y.3    Ng, A.Y.4
  • 12
    • 84893701254 scopus 로고    scopus 로고
    • Hybrid speech recognition with deep bidirectional LSTM
    • Alex Graves, Navdeep Jaitly, and Abdel rahman Mohamed, "Hybrid speech recognition with deep bidirectional LSTM," in ASRU 2013
    • (2013) ASRU
    • Graves, A.1    Jaitly, N.2    Rahman Mohamed, A.3
  • 13
    • 0031268931 scopus 로고    scopus 로고
    • Bidirectional recurrent neural networks
    • Nov
    • M. Schuster and K.K. Paliwal, "Bidirectional recurrent neural networks," Trans. Sig. Proc., vol. 45, no. 11, pp. 2673-2681, Nov. 1997
    • (1997) Trans. Sig. Proc , vol.45 , Issue.11 , pp. 2673-2681
    • Schuster, M.1    Paliwal, K.K.2
  • 15
    • 84964537525 scopus 로고    scopus 로고
    • The IBM 2015 english conversational telephone speech recognition system
    • abs/1505.05899
    • George Saon, Hong-Kwang Jeff Kuo, Steven J. Rennie, and Michael Picheny, "The IBM 2015 english conversational telephone speech recognition system," CoRR, vol. abs/1505.05899, 2015
    • (2015) CoRR
    • Saon, G.1    Jeff Kuo, H.-K.2    Rennie, S.J.3    Picheny, M.4
  • 16
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • Sepp Hochreiter and Jurgen Schmidhuber, "Long short-term memory," Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997
    • (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 17
    • 84910069984 scopus 로고    scopus 로고
    • 1-bit stochastic gradient descent and its application to dataparallel distributed training of speech DNNs
    • Frank Seide, Hao Fu, Jasha Droppo, Gang Li, and Dong Yu, "1-bit stochastic gradient descent and its application to dataparallel distributed training of speech DNNs," in INTERSPEECH 2014
    • (2014) INTERSPEECH
    • Seide, F.1    Fu, H.2    Droppo, J.3    Li, G.4    Yu, D.5
  • 18
    • 84905269646 scopus 로고    scopus 로고
    • On parallelizability of stochastic gradient descent for speech DNNs
    • Frank Seide, Hao Fu, Jasha Droppo, Gang Li, and Dong Yu, "On parallelizability of stochastic gradient descent for speech DNNs," in ICASSP 2014
    • (2014) ICASSP
    • Seide, F.1    Fu, H.2    Droppo, J.3    Li, G.4    Yu, D.5
  • 19
    • 84959076031 scopus 로고    scopus 로고
    • Training deep bidirectional LSTM acoustic models for LVCSR by a contextsensitive-chunk BPTT approach
    • Kai Chen, Zhi-Jie Yan, and Qiang Huo, "Training deep bidirectional LSTM acoustic models for LVCSR by a contextsensitive-chunk BPTT approach," in interspeech 2015
    • (2015) Interspeech
    • Chen, K.1    Yan, Z.-J.2    Huo, Q.3
  • 20
    • 70349213445 scopus 로고    scopus 로고
    • Lattice-based optimization of sequence classication criteria for neural-network acoustic modeling
    • Brian Kingsbury, "Lattice-based optimization of sequence classication criteria for neural-network acoustic modeling," in icassp 2009
    • (2009) Icassp
    • Kingsbury, B.1
  • 21
    • 84906264325 scopus 로고    scopus 로고
    • Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension
    • Tanel Alumae and Mikko Kurimo, "Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension," in interspeech 2012
    • (2012) Interspeech
    • Alumae, T.1    Kurimo, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.