-
2
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech 2011
-
(2011)
Interspeech
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
3
-
-
84890543852
-
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
-
Hang Su, Gang Li, Dong Yu, and Frank Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription," in ICASSP 2013
-
(2013)
ICASSP
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
4
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, and Brian Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," Signal Processing Magazine, 2012
-
(2012)
Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Rahman Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
6
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
Alex Graves, Abdel rahman Mohamed, and Geoffrey Hinton, "Speech recognition with deep recurrent neural networks," in ICASSP 2013
-
(2013)
ICASSP
-
-
Graves, A.1
Rahman Mohamed, A.2
Hinton, G.3
-
7
-
-
84962892645
-
Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition
-
abs/1402.1128
-
Hasim Sak, Andrew W., and Francoise Beaufays, "Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition," CoRR, vol. abs/1402.1128, 2014
-
(2014)
CoRR
-
-
Sak, H.1
Andrew, W.2
Beaufays, F.3
-
8
-
-
70349227947
-
The application of hidden markov models in speech recognition
-
Mark Gales and Steve Young, "The application of hidden markov models in speech recognition," Found. Trends Signal Process., vol. 1, no. 3, 2007
-
(2007)
Found. Trends Signal Process
, vol.1
, Issue.3
-
-
Gales, M.1
Young, S.2
-
10
-
-
77956502334
-
Unsupervised feature learning for audio classification using convolutional deep belief networks
-
Honglak Lee, Peter Pham, Yan Largman, and Andrew Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks," in Advances in Neural Information Processing Systems 22. 2009
-
(2009)
Advances in Neural Information Processing Systems
, pp. 22
-
-
Lee, H.1
Pham, P.2
Largman, Y.3
Ng, A.Y.4
-
11
-
-
84911473441
-
Convolutional neural networks for speech recognition
-
Oct
-
Ossama Abdel-Hamid, Abdel-Rahman Mohamed, Hui Jiang, Li Deng, Gerald Penn, and Dong Yu, "Convolutional neural networks for speech recognition," IEEE/ACM Trans. Audio, Speech and Lang. Proc., vol. 22, no. 10, pp. 1533-1545, Oct. 2014
-
(2014)
IEEE/ACM Trans. Audio, Speech and Lang. Proc
, vol.22
, Issue.10
, pp. 1533-1545
-
-
Ossama, A.-H.1
Mohamed, A.-R.2
Jiang, H.3
Deng, L.4
Penn, G.5
Yu, D.6
-
12
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional LSTM
-
Alex Graves, Navdeep Jaitly, and Abdel rahman Mohamed, "Hybrid speech recognition with deep bidirectional LSTM," in ASRU 2013
-
(2013)
ASRU
-
-
Graves, A.1
Jaitly, N.2
Rahman Mohamed, A.3
-
13
-
-
0031268931
-
Bidirectional recurrent neural networks
-
Nov
-
M. Schuster and K.K. Paliwal, "Bidirectional recurrent neural networks," Trans. Sig. Proc., vol. 45, no. 11, pp. 2673-2681, Nov. 1997
-
(1997)
Trans. Sig. Proc
, vol.45
, Issue.11
, pp. 2673-2681
-
-
Schuster, M.1
Paliwal, K.K.2
-
14
-
-
84890545163
-
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
-
May
-
Li Deng, Ossama Abdel-Hamid, and Dong Yu, "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion," in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
-
(2013)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Deng, L.1
Ossama, A.-H.2
Yu, D.3
-
15
-
-
84964537525
-
The IBM 2015 english conversational telephone speech recognition system
-
abs/1505.05899
-
George Saon, Hong-Kwang Jeff Kuo, Steven J. Rennie, and Michael Picheny, "The IBM 2015 english conversational telephone speech recognition system," CoRR, vol. abs/1505.05899, 2015
-
(2015)
CoRR
-
-
Saon, G.1
Jeff Kuo, H.-K.2
Rennie, S.J.3
Picheny, M.4
-
16
-
-
0031573117
-
Long short-term memory
-
Sepp Hochreiter and Jurgen Schmidhuber, "Long short-term memory," Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
17
-
-
84910069984
-
1-bit stochastic gradient descent and its application to dataparallel distributed training of speech DNNs
-
Frank Seide, Hao Fu, Jasha Droppo, Gang Li, and Dong Yu, "1-bit stochastic gradient descent and its application to dataparallel distributed training of speech DNNs," in INTERSPEECH 2014
-
(2014)
INTERSPEECH
-
-
Seide, F.1
Fu, H.2
Droppo, J.3
Li, G.4
Yu, D.5
-
18
-
-
84905269646
-
On parallelizability of stochastic gradient descent for speech DNNs
-
Frank Seide, Hao Fu, Jasha Droppo, Gang Li, and Dong Yu, "On parallelizability of stochastic gradient descent for speech DNNs," in ICASSP 2014
-
(2014)
ICASSP
-
-
Seide, F.1
Fu, H.2
Droppo, J.3
Li, G.4
Yu, D.5
-
19
-
-
84959076031
-
Training deep bidirectional LSTM acoustic models for LVCSR by a contextsensitive-chunk BPTT approach
-
Kai Chen, Zhi-Jie Yan, and Qiang Huo, "Training deep bidirectional LSTM acoustic models for LVCSR by a contextsensitive-chunk BPTT approach," in interspeech 2015
-
(2015)
Interspeech
-
-
Chen, K.1
Yan, Z.-J.2
Huo, Q.3
-
20
-
-
70349213445
-
Lattice-based optimization of sequence classication criteria for neural-network acoustic modeling
-
Brian Kingsbury, "Lattice-based optimization of sequence classication criteria for neural-network acoustic modeling," in icassp 2009
-
(2009)
Icassp
-
-
Kingsbury, B.1
-
21
-
-
84906264325
-
Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension
-
Tanel Alumae and Mikko Kurimo, "Efficient estimation of maximum entropy language models with N-gram features: An SRILM extension," in interspeech 2012
-
(2012)
Interspeech
-
-
Alumae, T.1
Kurimo, M.2
-
22
-
-
37849007170
-
Web resources for language modeling in conversational speech recognition
-
Ivan Bulyko, Mari Ostendorf, Manhung Siu, Tim Ng, Andreas Stolcke, and O zgur C etin, "Web resources for language modeling in conversational speech recognition," ACM Transactions on Speech and Language Processing, vol. 5, no. 1, 2007
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.5
, Issue.1
-
-
Bulyko, I.1
Ostendorf, M.2
Siu, M.3
Ng, T.4
Stolcke, A.5
Zgur Etin C, O.6
|