메뉴 건너뛰기




Volumn 2016-May, Issue , 2016, Pages 5755-5759

Highway long short-term memory RNNS for distant speech recognition

Author keywords

CNTK; Highway LSTM; LSTM; Sequence Training

Indexed keywords


EID: 84973358602     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2016.7472780     Document Type: Conference Paper
Times cited : (324)

References (26)
  • 1
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 5
    • 84901999583 scopus 로고    scopus 로고
    • Convolutional neural networks for distant speech recognition
    • September
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " Signal Processing Letters, IEEE, vol. 21, no. 9, pp. 1120-1124, September 2014
    • (2014) Signal Processing Letters, IEEE , vol.21 , Issue.9 , pp. 1120-1124
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 9
    • 84893704659 scopus 로고    scopus 로고
    • Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in ASRU, 2013
    • (2013) ASRU
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 10
    • 85032750883 scopus 로고    scopus 로고
    • Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors
    • K. Kumatani, J. W. McDonough, and B. Raj, "Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors. " IEEE Signal Process. Mag., vol. 29, no. 6, pp. 127-140, 2012
    • (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 127-140
    • Kumatani, K.1    McDonough, J.W.2    Raj, B.3
  • 12
    • 80051654520 scopus 로고    scopus 로고
    • Making the most from multiple microphones in meeting recognition
    • A. Stolcke, "Making the most from multiple microphones in meeting recognition, " in ICASSP, 2011
    • (2011) ICASSP
    • Stolcke, A.1
  • 13
    • 84959076031 scopus 로고    scopus 로고
    • Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach
    • K. Chen, Z.-J. Yan, and Q. Huo, "Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach, " in Interspeech, 2015
    • (2015) Interspeech
    • Chen, K.1    Yan, Z.-J.2    Huo, Q.3
  • 14
    • 84901999583 scopus 로고    scopus 로고
    • Convolutional neural networks for distant speech recognition
    • P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " IEEE Singal Processing Letters, vol. 21, no. 9, pp. 1120-1124, 2014
    • (2014) IEEE Singal Processing Letters , vol.21 , Issue.9 , pp. 1120-1124
    • Swietojanski, P.1    Ghoshal, A.2    Renals, S.3
  • 19
  • 20
    • 84905252022 scopus 로고    scopus 로고
    • Asynchronous stochastic optimization for sequence training of deep neural networks
    • G. Heigold, E. McDermott, V. Vanhoucke, A. Senior, and M. Bacchiani, "Asynchronous stochastic optimization for sequence training of deep neural networks, " in ICASSP, 2014
    • (2014) ICASSP
    • Heigold, G.1    McDermott, E.2    Vanhoucke, V.3    Senior, A.4    Bacchiani, M.5
  • 21
    • 35948981862 scopus 로고    scopus 로고
    • Unleashing the killer corpus: Experiences in creating the multi-everything ami meeting corpus
    • J. Carletta, "unleashing the killer corpus: experiences in creating the multi-everything ami meeting corpus, " Language Resources & Evaluation Journal, vol. 41, no. 2, pp. 181-190, 2007
    • (2007) Language Resources & Evaluation Journal , vol.41 , Issue.2 , pp. 181-190
    • Carletta, J.1
  • 22
    • 84903707061 scopus 로고    scopus 로고
    • Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech
    • J. Fiscus, J. Ajot, N. Radde, and C. Laprun, "Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech, " in LREC, 2006
    • (2006) LREC
    • Fiscus, J.1    Ajot, J.2    Radde, N.3    Laprun, C.4
  • 25
    • 0001609567 scopus 로고
    • An efficient gradient-based algorithm for online training of recurrent network trajectories
    • R. Williams and J. Peng, "An efficient gradient-based algorithm for online training of recurrent network trajectories, " Neural Computation, vol. 2, p. 490501, 1990
    • (1990) Neural Computation , vol.2 , pp. 490501
    • Williams, R.1    Peng, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.