-
1
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chien, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Seide, F.1
Li, G.2
Chien, X.3
Yu, D.4
-
2
-
-
84906214784
-
Exploring convolutional neural network structures and optimization techniques for speech recognition
-
O. Abdel-Hamid, L. Deng, and D. Yu, "Exploring convolutional neural network structures and optimization techniques for speech recognition." in INTERSPEECH, 2013, pp. 3366-3370.
-
(2013)
INTERSPEECH
, pp. 3366-3370
-
-
Abdel-Hamid, O.1
Deng, L.2
Yu, D.3
-
3
-
-
84890525984
-
Deep convolutional neural networks for lvcsr
-
T. N. Sainath, A.-r. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for lvcsr," in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 8614-8618.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP 2013 IEEE International Conference On. IEEE
, pp. 8614-8618
-
-
Sainath, T.N.1
Mohamed, A.-R.2
Kingsbury, B.3
Ramabhadran, B.4
-
4
-
-
84910072497
-
Unfolded recurrent neural networks for speech recognition
-
G. Saon, H. Soltau, A. Emami, and M. Picheny, "Unfolded recurrent neural networks for speech recognition," in Fifteenth Annual Conference of the International Speech Communication Association, 2014.
-
(2014)
Fifteenth Annual Conference of the International Speech Communication Association
-
-
Saon, G.1
Soltau, H.2
Emami, A.3
Picheny, M.4
-
5
-
-
84959115289
-
A time delay neural network architecture for efficient modeling of long temporal contexts
-
V. Peddinti, D. Povey, and S. Khudanpur, "A time delay neural network architecture for efficient modeling of long temporal contexts," in Proceedings of INTERSPEECH, 2015.
-
(2015)
Proceedings of INTERSPEECH
-
-
Peddinti, V.1
Povey, D.2
Khudanpur, S.3
-
6
-
-
84928545733
-
-
arXiv preprint arXiv:1412.5567
-
A. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, E. Elsen, R. Prenger, S. Satheesh, S. Sengupta, A. Coates et al., "Deepspeech: Scaling up end-to-end speech recognition," arXiv preprint arXiv:1412.5567, 2014.
-
(2014)
Deepspeech: Scaling Up End-to-end Speech Recognition
-
-
Hannun, A.1
Case, C.2
Casper, J.3
Catanzaro, B.4
Diamos, G.5
Elsen, E.6
Prenger, R.7
Satheesh, S.8
Sengupta, S.9
Coates, A.10
-
7
-
-
84946084790
-
Learning acoustic frame labeling for speech recognition with recurrent neural networks
-
H. Sak, A. Senior, K. Rao, O. Irsoy, A. Graves, F. Beaufays, and J. Schalkwyk, "Learning acoustic frame labeling for speech recognition with recurrent neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. IEEE, 2015, pp. 4280-4284.
-
(2015)
Acoustics, Speech and Signal Processing (ICASSP 2015 IEEE International Conference On. IEEE
, pp. 4280-4284
-
-
Sak, H.1
Senior, A.2
Rao, K.3
Irsoy, O.4
Graves, A.5
Beaufays, F.6
Schalkwyk, J.7
-
9
-
-
84964507635
-
Deep bi-directional recurrent networks over spectral windows
-
A.-r. Mohamed, F. Seide, D. Yu, J. Droppo, A. Stolcke, G. Zweig, and G. Penn, "Deep bi-directional recurrent networks over spectral windows," in Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on. IEEE, 2015.
-
(2015)
Automatic Speech Recognition and Understanding (ASRU 2015 IEEE Workshop On. IEEE
-
-
Mohamed, A.-R.1
Seide, F.2
Yu, D.3
Droppo, J.4
Stolcke, A.5
Zweig, G.6
Penn, G.7
-
10
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
A. Graves, A.-r. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 6645-6649.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP 2013 IEEE International Conference On. IEEE
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.-R.2
Hinton, G.3
-
11
-
-
84959129849
-
The IBM 2015 English conversational speech recognition system
-
G. Saon, H.-K. Kuo, S. Rennie, and M. Picheny, "The IBM 2015 English conversational speech recognition system," in Sixteenth Annual Conference of the International Speech Communication Association, 2015.
-
(2015)
Sixteenth Annual Conference of the International Speech Communication Association
-
-
Saon, G.1
Kuo, H.-K.2
Rennie, S.3
Picheny, M.4
-
12
-
-
84892421248
-
-
arXiv preprint arXiv:1302.4389
-
I. J. Goodfellow, D. Warde-Farley, M. Mirza, A. Courville, and Y. Bengio, "Maxout networks," arXiv preprint arXiv:1302.4389, 2013.
-
(2013)
Maxout Networks
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
13
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Kingsbury, B.1
Sainath, T.2
Soltau, H.3
-
14
-
-
84973324686
-
Very deep multilingual convolutional neural networks for lvcsr
-
T. Sercu, C. Puhrsch, B. Kingsbury, and Y. LeCun, "Very deep multilingual convolutional neural networks for lvcsr," Proc. ICASSP, 2016.
-
Proc. ICASSP, 2016
-
-
Sercu, T.1
Puhrsch, C.2
Kingsbury, B.3
LeCun, Y.4
-
16
-
-
84978755117
-
Very deep convolutional networks for large-scale image recognition
-
arXiv:1409.1556
-
K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," CoRR arXiv:1409.1556, 2014.
-
(2014)
CoRR
-
-
Simonyan, K.1
Zisserman, A.2
-
17
-
-
84905265980
-
Joint training of convolutional and non-convolutional neural networks
-
H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutional and non-convolutional neural networks," to Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Soltau, H.1
Saon, G.2
Sainath, T.N.3
-
18
-
-
84990044091
-
Torch7: A matlab-like environment for machine learning
-
R. Collobert, K. Kavukcuoglu, and C. Farabet, "Torch7: A matlab-like environment for machine learning," in BigLearn, NIPS Workshop, no. EPFL-CONF-192376, 2011.
-
(2011)
BigLearn, NIPS Workshop, No. EPFL-CONF-192376
-
-
Collobert, R.1
Kavukcuoglu, K.2
Farabet, C.3
-
19
-
-
84890543852
-
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
-
H. Su, G. Li, D. Yu, and F. Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription," Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
20
-
-
0033329799
-
An empirical study of smoothing techniques for language modeling
-
S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Computer Speech & Language, vol. 13, no. 4, pp. 359-393, 1999.
-
(1999)
Computer Speech & Language
, vol.13
, Issue.4
, pp. 359-393
-
-
Chen, S.F.1
Goodman, J.2
-
22
-
-
84863387613
-
Shrinking exponential language models
-
S. F. Chen, "Shrinking exponential language models," in Proc. NAACL-HLT, 2009, pp. 468-476.
-
(2009)
Proc. NAACL-HLT
, pp. 468-476
-
-
Chen, S.F.1
-
23
-
-
0142166851
-
A neural probabilistic language model
-
Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin, "A neural probabilistic language model," Journal of Machine Learning Research, vol. 3, pp. 1137-1155, 2003.
-
(2003)
Journal of Machine Learning Research
, vol.3
, pp. 1137-1155
-
-
Bengio, Y.1
Ducharme, R.2
Vincent, P.3
Jauvin, C.4
-
24
-
-
85055309630
-
-
Ph.D. dissertation, Johns Hopkins University, Baltimore, MD, USA
-
A. Emami, "A neural syntactic language model," Ph.D. dissertation, Johns Hopkins University, Baltimore, MD, USA, 2006.
-
(2006)
A Neural Syntactic Language Model
-
-
Emami, A.1
-
25
-
-
33847610331
-
Continuous space language models
-
H. Schwenk, "Continuous space language models," Computer Speech & Language, vol. 21, no. 3, pp. 492-518, 2007.
-
(2007)
Computer Speech & Language
, vol.21
, Issue.3
, pp. 492-518
-
-
Schwenk, H.1
-
26
-
-
44849092930
-
Empirical study of neural network language models for Arabic speech recognition
-
A. Emami and L. Mangu, "Empirical study of neural network language models for Arabic speech recognition," in Proc. ASRU, 2007, pp. 147-152.
-
(2007)
Proc. ASRU
, pp. 147-152
-
-
Emami, A.1
Mangu, L.2
-
27
-
-
84878422162
-
Large scale hierarchical neural network language models
-
H.-K. J. Kuo, E. Arisoy, A. Emami, and P. Vozila, "Large scale hierarchical neural network language models," in Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Kuo, H.-K.J.1
Arisoy, E.2
Emami, A.3
Vozila, P.4
|