-
2
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in Proc. Workshop on Automatic Speech Recognition and Understanding, pp. 30-35, 2011.
-
(2011)
Proc. Workshop on Automatic Speech Recognition and Understanding
, pp. 30-35
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novak, P.5
Mohamed, A.6
-
3
-
-
84055222005
-
Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition," IEEE Trans. on Audio, Speech and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. on Audio, Speech and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
4
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
5
-
-
84890491198
-
Recent advances in deep learning for speech research at Microsoft
-
L. Deng, J. Li, J.-T. Huang et al. "Recent advances in deep learning for speech research at Microsoft," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Deng, L.1
Li, J.2
Huang, J.-T.3
-
6
-
-
84911473441
-
Convolutional neural networks for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, L. Deng, G. Penn, D. Yu, "Convolutional neural networks for speech recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol.22, no.1, pp.1533-1545, 2014.
-
(2014)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.22
, Issue.1
, pp. 1533-1545
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Deng, L.4
Penn, G.5
Yu, D.6
-
7
-
-
84890525984
-
Deep convolutional neural networks for LVCSR
-
T.N. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR," in Proc IEEE ICASSP, 2013.
-
(2013)
Proc IEEE ICASSP
-
-
Sainath, T.N.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
8
-
-
84893654379
-
Improvements to deep convolutional neural networks for LVCSR
-
T.N. Sainath, B. Kingsbury, A. Mohamed, G.E. Dahl, G. Saon, H. Soltau, T. Beran, A.Y. Aravkin, and B. Ramabhadran, "Improvements to deep convolutional neural networks for LVCSR," in Proc IEEE ASRU, 2013.
-
(2013)
Proc IEEE ASRU
-
-
Sainath, T.N.1
Kingsbury, B.2
Mohamed, A.3
Dahl, G.E.4
Saon, G.5
Soltau, H.6
Beran, T.7
Aravkin, A.Y.8
Ramabhadran, B.9
-
9
-
-
84910028405
-
Improving language-universal feature extraction with deep maxout and convolutional neural networks
-
Y. Miao and F. Metze, "Improving language-universal feature extraction with deep maxout and convolutional neural networks," in Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Miao, Y.1
Metze, F.2
-
11
-
-
85083953021
-
Feature learning in deep neural networks-studies on speech recognition tasks
-
D. Yu, M. Seltzer, J. Li, J-T. Huang, F. Seide, "Feature learning in deep neural networks-studies on speech recognition tasks", ICLR 2013.
-
(2013)
ICLR
-
-
Yu, D.1
Seltzer, M.2
Li, J.3
Huang, J.-T.4
Seide, F.5
-
12
-
-
84906257050
-
Neural network acoustic models for the DARPA RATS program
-
H. Soltau, H-K. Kuo, L. Mangu, G. Saon, T. Beran, "Neural network acoustic models for the DARPA RATS program," in Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Soltau, H.1
Kuo, H.-K.2
Mangu, L.3
Saon, G.4
Beran, T.5
-
13
-
-
85048545369
-
Measuring invariances in deep networks
-
I. Goodfellow, H. Lee, Q. Le, A. Saxe, A. Ng, "Measuring invariances in deep networks," in Proc. NIPS, 2009.
-
(2009)
Proc. NIPS
-
-
Goodfellow, I.1
Lee, H.2
Le, Q.3
Saxe, A.4
Ng, A.5
-
14
-
-
84906251664
-
Accurate and compact large vocabulary speech recognition on mobile devices
-
X. Lei, A. Senior, A., A. Gruenstein, and J. Sorensen, "Accurate and compact large vocabulary speech recognition on mobile devices," in Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Lei, X.1
Senior, A.A.2
Gruenstein, A.3
Sorensen, J.4
-
15
-
-
84897543523
-
Maxout networks
-
I. J. Goodfellow, D. Warde-Farley, M. Mirza, A. Courville, and Y. Bengio, "Maxout networks," in Proc. ICML, 2013.
-
(2013)
Proc. ICML
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
16
-
-
84890471125
-
On rectified linear units for speech processing
-
M.D. Zeiler, M. Ranzato, R. Monga, M. Mao, K. Yang, Q.V. Le, P. Nguyen, A. Senior, V. Vanhoucke, J. Dean, et al., "On rectified linear units for speech processing," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Zeiler, M.D.1
Ranzato, M.2
Monga, R.3
Mao, M.4
Yang, K.5
Le, Q.V.6
Nguyen, P.7
Senior, A.8
Vanhoucke, V.9
Dean, J.10
-
17
-
-
84905270524
-
Investigation of maxout networks for speech recognition
-
P. Swietojanski, J. Li, and J.-T. Huang, "Investigation of maxout networks for speech recognition," in Proc. ICASSP, 2014
-
(2014)
Proc. ICASSP
-
-
Swietojanski, P.1
Li, J.2
Huang, J.-T.3
-
18
-
-
67651044226
-
Spectro-temporal analysis of speech using 2-d Gabor filters
-
T. Ezzat, J. Bouvrie, and T. Poggio, "Spectro-temporal analysis of speech using 2-d Gabor filters," in Proc. Interspeech, 2007.
-
(2007)
Proc. Interspeech
-
-
Ezzat, T.1
Bouvrie, J.2
Poggio, T.3
-
19
-
-
84910036228
-
Robust CNN-based speech recognition with Gabor filter kernels
-
S.-Y. Chang and N. Morgan, "Robust CNN-based speech recognition with Gabor filter kernels," in Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Chang, S.-Y.1
Morgan, N.2
-
21
-
-
84874415293
-
Microphone array processing for distant speech recognition: Towards realworld deployment
-
K. Kumatani, T. Arakawa, K. Yamamoto, J. McDonough, B. Raj, R. Singh, and I. Tashev, "Microphone array processing for distant speech recognition: towards realworld deployment," APSIPA Annual Summit and Conference, 2012
-
(2012)
APSIPA Annual Summit and Conference
-
-
Kumatani, K.1
Arakawa, T.2
Yamamoto, K.3
McDonough, J.4
Raj, B.5
Singh, R.6
Tashev, I.7
-
22
-
-
84910035297
-
Learning small-size DNN with output-distribution-based criteria
-
J. Li, R. Zhao, J.-T. Huang, and Y. Gong, "Learning small-size DNN with output-distribution-based criteria," in Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Li, J.1
Zhao, R.2
Huang, J.-T.3
Gong, Y.4
-
23
-
-
84910069623
-
Convolutional deep maxout networks for phone recognition
-
L. Toth, "Convolutional deep maxout networks for phone recognition," in Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Toth, L.1
-
24
-
-
84910046405
-
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
-
H. Sak, A. Senior, and F. Beaufays, "Long short-term memory recurrent neural network architectures for large scale acoustic modeling," in Interspeech, 2014, pp. 338-342.
-
(2014)
Interspeech
, pp. 338-342
-
-
Sak, H.1
Senior, A.2
Beaufays, F.3
|