-
1
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, " IEEE Transactions on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012
-
(2012)
IEEE Transactions on Audio, Speech and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Proc. Annual Conference of International Speech Communication Association (INTERSPEECH), 2011, pp. 437-440
-
(2011)
Proc. Annual Conference of International Speech Communication Association (INTERSPEECH)
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
3
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " IEEE Signal Processing Magazine, no. 6, pp. 82-97, 2012
-
(2012)
IEEE Signal Processing Magazine
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
4
-
-
84890492030
-
An investigation of deep neural networks for noise robust speech recognition
-
M. Seltzer, D. Yu, and Y. Q. Wang, "An investigation of deep neural networks for noise robust speech recognition, " in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013
-
(2013)
Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
-
Seltzer, M.1
Yu, D.2
Wang, Y.Q.3
-
5
-
-
84901999583
-
Convolutional neural networks for distant speech recognition
-
September
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " Signal Processing Letters, IEEE, vol. 21, no. 9, pp. 1120-1124, September 2014
-
(2014)
Signal Processing Letters, IEEE
, vol.21
, Issue.9
, pp. 1120-1124
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
6
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
A. Graves, A. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013
-
(2013)
Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP)
-
-
Graves, A.1
Mohamed, A.2
Hinton, G.3
-
7
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional LSTM
-
A. Graves, N. Jaitly, and A. Mohamed, "Hybrid speech recognition with deep bidirectional LSTM, " in Proc. IEEEWorkshop on Automfatic Speech Recognition and Understanding (ASRU), 2013, pp. 273-278
-
(2013)
Proc. IEEEWorkshop on Automfatic Speech Recognition and Understanding (ASRU)
, pp. 273-278
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.3
-
9
-
-
84893704659
-
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Hybrid acoustic models for distant and multichannel large vocabulary speech recognition, " in ASRU, 2013
-
(2013)
ASRU
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
10
-
-
85032750883
-
Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors
-
K. Kumatani, J. W. McDonough, and B. Raj, "Microphone array processing for distant speech recognition: From closetalking microphones to far-field sensors. " IEEE Signal Process. Mag., vol. 29, no. 6, pp. 127-140, 2012
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 127-140
-
-
Kumatani, K.1
McDonough, J.W.2
Raj, B.3
-
11
-
-
85008520364
-
Transcribing meetings with the amida systems
-
T. Hain, L. Burget, J. Dines, P. N. Garner, F. Grzl, A. E. Hannani, M. Huijbregts, M. Karafit, M. Lincoln, and V. Wan, "Transcribing meetings with the amida systems. " IEEE Transactions on Audio, Speech & Language Processing, vol. 20, no. 2, pp. 486-498, 2012
-
(2012)
IEEE Transactions on Audio, Speech & Language Processing
, vol.20
, Issue.2
, pp. 486-498
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garner, P.N.4
Grzl, F.5
Hannani, A.E.6
Huijbregts, M.7
Karafit, M.8
Lincoln, M.9
Wan, V.10
-
12
-
-
80051654520
-
Making the most from multiple microphones in meeting recognition
-
A. Stolcke, "Making the most from multiple microphones in meeting recognition, " in ICASSP, 2011
-
(2011)
ICASSP
-
-
Stolcke, A.1
-
13
-
-
84959076031
-
Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach
-
K. Chen, Z.-J. Yan, and Q. Huo, "Training deep bidirectional lstm acoustic model for lvcsr by a context-sensitive-chunk bptt approach, " in Interspeech, 2015
-
(2015)
Interspeech
-
-
Chen, K.1
Yan, Z.-J.2
Huo, Q.3
-
14
-
-
84901999583
-
Convolutional neural networks for distant speech recognition
-
P. Swietojanski, A. Ghoshal, and S. Renals, "Convolutional neural networks for distant speech recognition, " IEEE Singal Processing Letters, vol. 21, no. 9, pp. 1120-1124, 2014
-
(2014)
IEEE Singal Processing Letters
, vol.21
, Issue.9
, pp. 1120-1124
-
-
Swietojanski, P.1
Ghoshal, A.2
Renals, S.3
-
15
-
-
84928146953
-
An introduction to computational networks and the computational network toolkit
-
D. Yu, A. Eversole, M. Seltzer, K. Yao, B. Guenter, O. Kuchaiev, Y. Zhang, F. Seide, G. Chen, H. Wang, J. Droppo, A. Agarwal, C. Basoglu, M. Padmilac, A. Kamenev, V. Ivanov, S. Cyphers, H. Parthasarathi, B. Mitra, Z. Huang, G. Zweig, C. Rossbach, J. Currey, J. Gao, A. May, B. Peng, A. Stolcke, M. Slaney, and X. Huang, "An introduction to computational networks and the computational network toolkit, " Microsoft Technical Report, 2014
-
(2014)
Microsoft Technical Report
-
-
Yu, D.1
Eversole, A.2
Seltzer, M.3
Yao, K.4
Guenter, B.5
Kuchaiev, O.6
Zhang, Y.7
Seide, F.8
Chen, G.9
Wang, H.10
Droppo, J.11
Agarwal, A.12
Basoglu, C.13
Padmilac, M.14
Kamenev, A.15
Ivanov, V.16
Cyphers, S.17
Parthasarathi, H.18
Mitra, B.19
Huang, Z.20
Zweig, G.21
Rossbach, C.22
Currey, J.23
Gao, J.24
May, A.25
Peng, B.26
Stolcke, A.27
Slaney, M.28
Huang, X.29
more..
-
17
-
-
84973280333
-
-
K. Yao, T. Cohn, K. Vylomova, K. Duh, and C. Dyer, "Depth-gated lstm, " 2015. [Online]. Available: http: //arxiv. org/abs/1508. 03790
-
(2015)
Depth-gated Lstm
-
-
Yao, K.1
Cohn, T.2
Vylomova, K.3
Duh, K.4
Dyer, C.5
-
19
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural Computation, vol. 9, no. 8, p. 17351438, 1997
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 17351438
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
20
-
-
84905252022
-
Asynchronous stochastic optimization for sequence training of deep neural networks
-
G. Heigold, E. McDermott, V. Vanhoucke, A. Senior, and M. Bacchiani, "Asynchronous stochastic optimization for sequence training of deep neural networks, " in ICASSP, 2014
-
(2014)
ICASSP
-
-
Heigold, G.1
McDermott, E.2
Vanhoucke, V.3
Senior, A.4
Bacchiani, M.5
-
21
-
-
35948981862
-
Unleashing the killer corpus: Experiences in creating the multi-everything ami meeting corpus
-
J. Carletta, "unleashing the killer corpus: experiences in creating the multi-everything ami meeting corpus, " Language Resources & Evaluation Journal, vol. 41, no. 2, pp. 181-190, 2007
-
(2007)
Language Resources & Evaluation Journal
, vol.41
, Issue.2
, pp. 181-190
-
-
Carletta, J.1
-
22
-
-
84903707061
-
Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech
-
J. Fiscus, J. Ajot, N. Radde, and C. Laprun, "Multiple dimension levenshtein edit distance calculations for evaluating asr systems during simultaneous speech, " in LREC, 2006
-
(2006)
LREC
-
-
Fiscus, J.1
Ajot, J.2
Radde, N.3
Laprun, C.4
-
23
-
-
84867600292
-
The Kaldi speech recognition toolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The Kaldi speech recognition toolkit, " in ASRU, 2011
-
(2011)
ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motĺcek, P.8
Qian, Y.9
Schwarz, P.10
Silovský, J.11
Stemmer, G.12
Veselý, K.13
-
24
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. IEEE Workshop on Automfatic Speech Recognition and Understanding (ASRU), 2011, pp. 24-29
-
(2011)
Proc. IEEE Workshop on Automfatic Speech Recognition and Understanding (ASRU)
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
25
-
-
0001609567
-
An efficient gradient-based algorithm for online training of recurrent network trajectories
-
R. Williams and J. Peng, "An efficient gradient-based algorithm for online training of recurrent network trajectories, " Neural Computation, vol. 2, p. 490501, 1990
-
(1990)
Neural Computation
, vol.2
, pp. 490501
-
-
Williams, R.1
Peng, J.2
-
26
-
-
84867720412
-
-
G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors, " 2012. [Online]. Available: http: //arxiv. org/abs/1207. 0580
-
(2012)
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
|