SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2016-May, Issue , 2016, Pages 5755-5759

Highway long short-term memory RNNS for distant speech recognition

(6) Zhang, Yu a Chen, Guoguo a Yu, Dong b Yaco, Kaisheng b Khudanpur, Sanjeev a Glass, James a

a Mrr CSAIL ^* (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

CNTK; Highway LSTM; LSTM; Sequence Training

Indexed keywords

EID: 84973358602 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2016.7472780 Document Type: Conference Paper

Times cited : (324)

References (26)

1
- 84055222005
- Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
- G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012
- (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.1 , pp. 30-42
- Dahl, G.E.¹ Yu, D.² Deng, L.³ Acero, A.⁴

2
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Annual Conference of International Speech Communication Association (INTERSPEECH), 2011, pp. 437-440
- (2011) Proc. Annual Conference of International Speech Communication Association (INTERSPEECH) , pp. 437-440
- Seide, F.¹ Li, G.² Yu, D.³

3
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " IEEE Signal Processing Magazine, no. 6, pp. 82-97, 2012
- (2012) IEEE Signal Processing Magazine , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

4
- 84890492030
- An investigation of deep neural networks for noise robust speech recognition
- M. Seltzer, D. Yu, and Y. Q. Wang, "An investigation of deep neural networks for noise robust speech recognition, " in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013
- (2013) Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Seltzer, M.¹ Yu, D.² Wang, Y.Q.³

5
- 84901999583
- Convolutional neural networks for distant speech recognition
- September
- P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " Signal Processing Letters, IEEE, vol. 21, no. 9, pp. 1120-1124, September 2014
- (2014) Signal Processing Letters, IEEE , vol.21 , Issue.9 , pp. 1120-1124
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

6
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013
- (2013) Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- Graves, A.¹ Mohamed, A.² Hinton, G.³

7
- 84893701254
- Hybrid speech recognition with deep bidirectional LSTM
- A. Graves, N. Jaitly, and A. Mohamed, "Hybrid speech recognition with deep bidirectional LSTM, " in Proc. IEEEWorkshop on Automfatic Speech Recognition and Understanding (ASRU), 2013, pp. 273-278
- (2013) Proc. IEEEWorkshop on Automfatic Speech Recognition and Understanding (ASRU) , pp. 273-278
- Graves, A.¹ Jaitly, N.² Mohamed, A.³

8
- 84910046405
- Long short-term memory recurrent neural network architectures for large scale acoustic modeling
- H. Sak, A. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling, " in Fifteenth Annual Conference of the International Speech Communication Association, 2014
- (2014) Fifteenth Annual Conference of the International Speech Communication Association
- Sak, H.¹ Senior, A.² Beaufays, F.³

9
- 84893704659
- Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
- P. Swietojanski, A. Ghoshal, and S. Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in ASRU, 2013
- (2013) ASRU
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

10
- 85032750883
- Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors
- K. Kumatani, J. W. McDonough, and B. Raj, "Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors. " IEEE Signal Process. Mag., vol. 29, no. 6, pp. 127-140, 2012
- (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 127-140
- Kumatani, K.¹ McDonough, J.W.² Raj, B.³

11
- 85008520364
- Transcribing meetings with the amida systems
- T. Hain, L. Burget, J. Dines, P. N. Garner, F. Grzl, A. E. Hannani, M. Huijbregts, M. Karafit, M. Lincoln, and V. Wan, "Transcribing meetings with the amida systems. " IEEE Transactions on Audio, Speech & Language Processing, vol. 20, no. 2, pp. 486-498, 2012
- (2012) IEEE Transactions on Audio, Speech & Language Processing , vol.20 , Issue.2 , pp. 486-498
- Hain, T.¹ Burget, L.² Dines, J.³ Garner, P.N.⁴ Grzl, F.⁵ Hannani, A.E.⁶ Huijbregts, M.⁷ Karafit, M.⁸ Lincoln, M.⁹ Wan, V.¹⁰

12
- 80051654520
- Making the most from multiple microphones in meeting recognition
- A. Stolcke, "Making the most from multiple microphones in meeting recognition, " in ICASSP, 2011
- (2011) ICASSP
- Stolcke, A.¹

13
- 84959076031
- Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach
- K. Chen, Z.-J. Yan, and Q. Huo, "Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach, " in Interspeech, 2015
- (2015) Interspeech
- Chen, K.¹ Yan, Z.-J.² Huo, Q.³

14
- 84901999583
- Convolutional neural networks for distant speech recognition
- P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " IEEE Singal Processing Letters, vol. 21, no. 9, pp. 1120-1124, 2014
- (2014) IEEE Singal Processing Letters , vol.21 , Issue.9 , pp. 1120-1124
- Swietojanski, P.¹ Ghoshal, A.² Renals, S.³

15
- 84928146953
- An introduction to computational networks and the computational network toolkit
- D. Yu, A. Eversole, M. Seltzer, K. Yao, B. Guenter, O. Kuchaiev, Y. Zhang, F. Seide, G. Chen, H. Wang, J. Droppo, A. Agarwal, C. Basoglu, M. Padmilac, A. Kamenev, V. Ivanov, S. Cyphers, H. Parthasarathi, B. Mitra, Z. Huang, G. Zweig, C. Rossbach, J. Currey, J. Gao, A. May, B. Peng, A. Stolcke, M. Slaney, and X. Huang, "An introduction to computational networks and the computational network toolkit, " Microsoft Technical Report, 2014
- (2014) Microsoft Technical Report
- Yu, D.¹ Eversole, A.² Seltzer, M.³ Yao, K.⁴ Guenter, B.⁵ Kuchaiev, O.⁶ Zhang, Y.⁷ Seide, F.⁸ Chen, G.⁹ Wang, H.¹⁰ Droppo, J.¹¹ Agarwal, A.¹² Basoglu, C.¹³ Padmilac, M.¹⁴ Kamenev, A.¹⁵ Ivanov, V.¹⁶ Cyphers, S.¹⁷ Parthasarathi, H.¹⁸ Mitra, B.¹⁹ Huang, Z.²⁰ more..

16
- 84965156812
- R. Srivastava, K. Greff, and J. Schmidhuber, "Highway networks, " 2015. [Online]. Available: http: //arxiv. org/abs/1505. 00387
- (2015) Highway Networks
- Srivastava, R.¹ Greff, K.² Schmidhuber, J.³

17
- 84973280333
- K. Yao, T. Cohn, K. Vylomova, K. Duh, and C. Dyer, "Depth-gated lstm, " 2015. [Online]. Available: http: //arxiv. org/abs/1508. 03790
- (2015) Depth-gated Lstm
- Yao, K.¹ Cohn, T.² Vylomova, K.³ Duh, K.⁴ Dyer, C.⁵

18
- 84965135019
- N. Kalchbrenner, I. Danihelka, and A. Graves, "Grid long short-term memory, " 2015. [Online]. Available: http: //arXiv. org/abs/1507. 01526
- (2015) Grid Long Short-term Memory
- Kalchbrenner, N.¹ Danihelka, I.² Graves, A.³

19
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural Computation, vol. 9, no. 8, p. 17351438, 1997
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 17351438
- Hochreiter, S.¹ Schmidhuber, J.²

20
- 84905252022
- Asynchronous stochastic optimization for sequence training of deep neural networks
- G. Heigold, E. McDermott, V. Vanhoucke, A. Senior, and M. Bacchiani, "Asynchronous stochastic optimization for sequence training of deep neural networks, " in ICASSP, 2014
- (2014) ICASSP
- Heigold, G.¹ McDermott, E.² Vanhoucke, V.³ Senior, A.⁴ Bacchiani, M.⁵

21
- 35948981862
- Unleashing the killer corpus: Experiences in creating the multi-everything ami meeting corpus
- J. Carletta, "unleashing the killer corpus: experiences in creating the multi-everything ami meeting corpus, " Language Resources & Evaluation Journal, vol. 41, no. 2, pp. 181-190, 2007
- (2007) Language Resources & Evaluation Journal , vol.41 , Issue.2 , pp. 181-190
- Carletta, J.¹

22
- 84903707061
- Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech
- J. Fiscus, J. Ajot, N. Radde, and C. Laprun, "Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech, " in LREC, 2006
- (2006) LREC
- Fiscus, J.¹ Ajot, J.² Radde, N.³ Laprun, C.⁴

23
- 84867600292
- The Kaldi speech recognition toolkit
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit, " in ASRU, 2011
- (2011) ASRU
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motĺcek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovský, J.¹¹ Stemmer, G.¹² Veselý, K.¹³

24
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. IEEE Workshop on Automfatic Speech Recognition and Understanding (ASRU), 2011, pp. 24-29
- (2011) Proc. IEEE Workshop on Automfatic Speech Recognition and Understanding (ASRU) , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

25
- 0001609567
- An efficient gradient-based algorithm for online training of recurrent network trajectories
- R. Williams and J. Peng, "An efficient gradient-based algorithm for online training of recurrent network trajectories, " Neural Computation, vol. 2, p. 490501, 1990
- (1990) Neural Computation , vol.2 , pp. 490501
- Williams, R.¹ Peng, J.²

26
- 84867720412
- G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " 2012. [Online]. Available: http: //arxiv. org/abs/1207. 0580
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.