-
1
-
-
84867605836
-
Applying convolutional neural network concepts to hybrid NN-HMM model for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural network concepts to hybrid NN-HMM model for speech recognition, " in Proc. ICASSP, 2012, pp. 4277 - 4280.
-
(2012)
Proc. ICASSP
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
2
-
-
84890545163
-
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
-
L. Deng, O. Abdel-Hamid, and D. Yu, "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " in Proc. ICASSP, 2013, pp. 6669 - 6673.
-
(2013)
Proc. ICASSP
, pp. 6669-6673
-
-
Deng, L.1
Abdel-Hamid, O.2
Yu, D.3
-
3
-
-
84906214784
-
Exploring convolutional neural network structures and optimization techniques for speech recognition
-
O. Abdel-Hamid, L. Deng, and D. Yu, "Exploring convolutional neural network structures and optimization techniques for speech recognition, " in Proc. Interspeech, 2013, pp. 3366 - 3370.
-
(2013)
Proc. Interspeech
, pp. 3366-3370
-
-
Abdel-Hamid, O.1
Deng, L.2
Yu, D.3
-
4
-
-
84890525984
-
Deep convolutional neural networks for LVCSR
-
T. N. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR, " in Proc. ICASSP, 2013, pp. 8614-8618.
-
(2013)
Proc. ICASSP
, pp. 8614-8618
-
-
Sainath, T.N.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
5
-
-
84893654379
-
Improvements to deep convolutional neural networks for LVCSR
-
T. N. Sainath, B. Kingsbury, A. Mohamed, and B. Ramabhadran, et al. "Improvements to deep convolutional neural networks for LVCSR, " in Proc. ASRU, 2013, pp. 315-320.
-
(2013)
Proc. ASRU
, pp. 315-320
-
-
Sainath, T.N.1
Kingsbury, B.2
Mohamed, A.3
Ramabhadran, B.4
-
6
-
-
84905252069
-
Combining time- And frequency-domain convolution in convolutional neural network-based phone recognition
-
accepted, in print
-
L. Tóth, "Combining time- And frequency-domain convolution in convolutional neural network-based phone recognition, " in Proc. ICASSP. 2014, accepted, in print.
-
(2014)
Proc. ICASSP
-
-
Tóth, L.1
-
7
-
-
84858971297
-
Convolutive bottleneck network features for LVCSR
-
K. Veselý, M. Karafiát, and F. Grézl, "Convolutive bottleneck network features for LVCSR, " in Proc. ASRU, 2011, pp. 42 - 47.
-
(2011)
Proc. ASRU
, pp. 42-47
-
-
Veselý, K.1
Karafiát, M.2
Grézl, F.3
-
8
-
-
84906276981
-
Convolutional deep rectifier neural nets for phone recognition
-
L. Tóth, "Convolutional deep rectifier neural nets for phone recognition, " in Proc. Interspeech, 2013, pp. 1722-1726.
-
(2013)
Proc. Interspeech
, pp. 1722-1726
-
-
Tóth, L.1
-
9
-
-
84893710272
-
Maxout networks
-
I. J. Goodfellow, D. Warde-Farley, M. Mirza, A. Courville, and Y. Bengio, "Maxout networks, " in Proc. ICML, 2013, pp. 1319- 1327.
-
(2013)
Proc. ICML
, pp. 1319-1327
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
10
-
-
84893651518
-
Deep maxout neural networks for speech recognition
-
M. Cai, Y. Shi, and J. Liu, "Deep maxout neural networks for speech recognition, " in Proc. ASRU, 2013, pp. 291-296.
-
(2013)
Proc. ASRU
, pp. 291-296
-
-
Cai, M.1
Shi, Y.2
Liu, J.3
-
11
-
-
84893701756
-
Deep maxout networks for low-resource speech recognition
-
Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for low-resource speech recognition, " in Proc. ASRU, 2013, pp. 398- 403.
-
(2013)
Proc. ASRU
, pp. 398-403
-
-
Miao, Y.1
Metze, F.2
Rawat, S.3
-
12
-
-
84905270524
-
Investigation of maxout networks for speech recognition
-
accepted, in print
-
P. Swietojanski, J. Li, and J. T. Huang, "Investigation of maxout networks for speech recognition, " in Proc. ICASSP. 2014, accepted, in print.
-
(2014)
Proc. ICASSP
-
-
Swietojanski, P.1
Li, J.2
Huang, J.T.3
-
13
-
-
84905239342
-
Improving deep neural network acoustic models using generalized maxout networks
-
accepted, in print
-
X. Zhang, J. Trmal, D. Povey, and S. Khudanpur, "Improving deep neural network acoustic models using generalized maxout networks, " in Proc. ICASSP. 2014, accepted, in print.
-
(2014)
Proc. ICASSP
-
-
Zhang, X.1
Trmal, J.2
Povey, D.3
Khudanpur, S.4
-
14
-
-
84905252882
-
Stochastic pooling maxout networks for low-resource speech recognition
-
accepted, in print
-
M. Cai, Y. Shi, and J. Liu, "Stochastic pooling maxout networks for low-resource speech recognition, " in Proc. ICASSP. 2014, accepted, in print.
-
(2014)
Proc. ICASSP
-
-
Cai, M.1
Shi, Y.2
Liu, J.3
-
15
-
-
77955803591
-
Enhanced phone posteriors for improving speech recognition systems
-
H. Ketabdar and H. Bourlard, "Enhanced phone posteriors for improving speech recognition systems, " IEEE Trans. ASLP, vol. 18, no. 6, pp. 1094-1106, 2010.
-
(2010)
IEEE Trans. ASLP
, vol.18
, Issue.6
, pp. 1094-1106
-
-
Ketabdar, H.1
Bourlard, H.2
-
16
-
-
78049251448
-
Analysis of MLP based hierarchical phoneme posterior probability estimator
-
J. Pinto et al., "Analysis of MLP based hierarchical phoneme posterior probability estimator, " IEEE Trans. ASLP, vol. 19, no. 2, pp. 225-241, 2010.
-
(2010)
IEEE Trans. ASLP
, vol.19
, Issue.2
, pp. 225-241
-
-
Pinto, J.1
-
18
-
-
84890527827
-
Improving deep neural networks for LVCSR using rectified linear units and dropout
-
G. E. Dahl, T. N. Sainath, and G. E. Hinton, "Improving deep neural networks for LVCSR using rectified linear units and dropout, " in Proc. ICASSP, 2013, pp. 8609-8613.
-
(2013)
Proc. ICASSP
, pp. 8609-8613
-
-
Dahl, G.E.1
Sainath, T.N.2
Hinton, G.E.3
-
19
-
-
84890471125
-
On rectified linear units for speech processing
-
M. D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q. V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, and G. E. Hinton, "On rectified linear units for speech processing, " in Proc. ICASSP, 2013, pp. 3517-3521.
-
(2013)
Proc. ICASSP
, pp. 3517-3521
-
-
Zeiler, M.D.1
Ranzato, M.2
Monga, R.3
Mao, M.4
Yang, K.5
Le, Q.V.6
Nguyen, P.7
Senior, A.8
Vanhoucke, V.9
Dean, J.10
Hinton, G.E.11
-
20
-
-
84890451371
-
Phone recognition with deep sparse rectifier neural networks
-
L. Tóth, "Phone recognition with deep sparse rectifier neural networks, " in Proc. ICASSP, 2013, pp. 6985-6989.
-
(2013)
Proc. ICASSP
, pp. 6985-6989
-
-
Tóth, L.1
-
21
-
-
84893676344
-
Rectifier nonlinearities improve neural network acoustic models
-
A. L. Maas, A. Y. Hannun, and A. Y. Ng, "Rectifier nonlinearities improve neural network acoustic models, " in Proc. ICML, 2013.
-
(2013)
Proc. ICML
-
-
Maas, A.L.1
Hannun, A.Y.2
Ng, A.Y.3
-
22
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. ASLP, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. ASLP
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.E.2
Hinton, G.3
-
23
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, L. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription, " in Proc. ASRU, 2011, pp. 24-29.
-
(2011)
Proc. ASRU
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, L.3
Yu, D.4
-
24
-
-
84890466217
-
Improving neural networks by preventing coadaptation of feature detectors
-
vol. abs/1207.0580
-
G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, "Improving neural networks by preventing coadaptation of feature detectors, " CoRR, vol. abs/1207.0580, 2012.
-
(2012)
CoRR
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
|