-
1
-
-
85032751593
-
Research developments and directions in speech recognition and understanding
-
May
-
Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C.-H., Morgan, N., and O'Shaughnessy, D. "Research developments and directions in speech recognition and understanding, " IEEE Sig. Proc. Mag., vol. 26, no. 3, May 2009, pp. 75-80.
-
(2009)
IEEE Sig. Proc. Mag.
, vol.26
, Issue.3
, pp. 75-80
-
-
Baker, J.1
Deng, L.2
Glass, J.3
Khudanpur, S.4
Lee, C.-H.5
Morgan, N.6
O'Shaughnessy, D.7
-
2
-
-
84879854889
-
Representation learning: A review and new perspectives
-
Bengio, Y., Courville, A., and Vincent, P. "Representation learning: A review and new perspectives, " IEEE Trans. PAMI, vol. 38, pp. 1798-1828, 2013.
-
(2013)
IEEE Trans. PAMI
, vol.38
, pp. 1798-1828
-
-
Bengio, Y.1
Courville, A.2
Vincent, P.3
-
3
-
-
84890543516
-
Advances in optimizing recurrent networks
-
Bengio, Y., Boulanger, N., and Pascanu, R. "Advances in optimizing recurrent networks, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Bengio, Y.1
Boulanger, N.2
Pascanu, R.3
-
4
-
-
0030196364
-
Stacked regression
-
Breiman, L. "Stacked regression, " Machine Learning, Vol. 24, pp. 49-64, 1996.
-
(1996)
Machine Learning
, vol.24
, pp. 49-64
-
-
Breiman, L.1
-
6
-
-
85083950550
-
A primal-dual method for training recurrent neural networks constrained by the echo-state property
-
April
-
Chen, J. and Deng, L. "A primal-dual method for training recurrent neural networks constrained by the echo-state property, " Proc. Int. Conf. Learning Representations, April, 2014.
-
(2014)
Proc. Int. Conf. Learning Representations
-
-
Chen, J.1
Deng, L.2
-
7
-
-
84055222005
-
Contextdependent, pre-trained deep neural networks for large vocabulary speech recognition
-
Dahl, G., Yu, D., Deng, L., and Acero, A. "Contextdependent, pre-trained deep neural networks for large vocabulary speech recognition, " IEEE Trans. Audio, Speech, & Language Proc., Vol. 20, pp. 30-42, 2012.
-
(2012)
IEEE Trans. Audio, Speech, & Language Proc.
, vol.20
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
8
-
-
84905280906
-
Sequence classification using the high-level features extracted from deep neural networks
-
Deng, L. and Chen, J. "Sequence classification using the high-level features extracted from deep neural networks, " Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Deng, L.1
Chen, J.2
-
9
-
-
84890545163
-
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
-
Deng, L., Abdel-Hamid, O., and Yu, D. "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Deng, L.1
Abdel-Hamid, O.2
Yu, D.3
-
10
-
-
84890491198
-
Recent advances in deep learning for speech research at Microsoft
-
Deng, L., Li, J., Huang, K., Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams, Y. Gong, and A. Acero. "Recent advances in deep learning for speech research at Microsoft, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Yao1
Deng, L.2
Li, J.3
Huang, K.4
Yu, D.5
Seide, F.6
Seltzer, M.7
Zweig, G.8
He, X.9
Williams, J.10
Gong, Y.11
Acero, A.12
-
11
-
-
84890526837
-
New types of deep neural network learning for speech recognition and related applications: An overview
-
Deng, L., Hinton, G., and Kingsbury, B. "New types of deep neural network learning for speech recognition and related applications: An overview, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Deng, L.1
Hinton, G.2
Kingsbury, B.3
-
12
-
-
84890468916
-
Deep learning for speech recognition and related applications
-
Deng, L., Yu, D., and Hinton, G. "Deep Learning for Speech Recognition and Related Applications" NIPS Workshop, 2009.
-
(2009)
NIPS Workshop
-
-
Deng, L.1
Yu, D.2
Hinton, G.3
-
13
-
-
84867614591
-
Scalable stacking and learning for building deep architectures
-
Deng, L., Yu, D., and Platt, J. "Scalable stacking and learning for building deep architectures, " Proc. ICASSP, 2012.
-
(2012)
Proc. ICASSP
-
-
Deng, L.1
Yu, D.2
Platt, J.3
-
14
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
Graves, A., Mohamed, A., and Hinton, G. "Speech recognition with deep recurrent neural networks, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Graves, A.1
Mohamed, A.2
Hinton, G.3
-
15
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional LSTM
-
Graves, A., Jaitly, N., and Mohamed, A. "Hybrid speech recognition with deep bidirectional LSTM, " Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.3
-
16
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
Hinton, G., Deng, L., Yu, D., Dahl, G., Mohamed, A., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath, T., and Kingsbury, B., "Deep Neural Networks for Acoustic Modeling in Speech Recognition, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
IEEE Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
17
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
July
-
Hinton, G. and Salakhutdinov, R. "Reducing the dimensionality of data with neural networks, " Science, vol. 313. no. 5786, pp. 504 - 507, July 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.1
Salakhutdinov, R.2
-
18
-
-
84878539964
-
Application of pre-trained deep neural networks to large vocabulary speech recognition
-
Jaitly, N., Nguyen, P., and Vanhoucke, V. "Application of pre-trained deep neural networks to large vocabulary speech recognition, " Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Jaitly, N.1
Nguyen, P.2
Vanhoucke, V.3
-
19
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
Kingsbury, B., Sainath, T., and Soltau, H. "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization, " Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Kingsbury, B.1
Sainath, T.2
Soltau, H.3
-
20
-
-
84878409063
-
Recurrent neural networks for noise reduction in robust ASR
-
Maas, A., Le, Q., O'Neil, T., Vinyals, O., Nguyen, P., and Ng, P. "Recurrent neural networks for noise reduction in robust ASR, " Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Maas, A.1
Le, Q.2
O'Neil, T.3
Vinyals, O.4
Nguyen, P.5
Ng, P.6
-
21
-
-
80053451847
-
Learning recurrent neural networks with Hessian-free optimization
-
Martens, J. and Sutskever, I. "Learning recurrent neural networks with Hessian-free optimization, " Proc. ICML, 2011.
-
(2011)
Proc. ICML
-
-
Martens, J.1
Sutskever, I.2
-
22
-
-
84858966958
-
Strategies for training large scale neural network language models
-
Mikolov, T., Deoras, A., Povey, D., Burget, L., and Cernocky, J. "Strategies for training large scale neural network language models, " Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Mikolov, T.1
Deoras, A.2
Povey, D.3
Burget, L.4
Cernocky, J.5
-
23
-
-
79959829092
-
Recurrent neural network based language model
-
Mikolov, T., Karafiat, M., Burget, L., Cernocky, J., and Khudanpur, S. "Recurrent neural network based language model, " Proc. ICASSP, 2010, 1045-1048.
-
(2010)
Proc. ICASSP
, pp. 1045-1048
-
-
Mikolov, T.1
Karafiat, M.2
Burget, L.3
Cernocky, J.4
Khudanpur, S.5
-
24
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
January
-
Mohamed, A., Dahl, G. and Hinton, G. "Acoustic modeling using deep belief networks", IEEE Trans. Audio, Speech, and Language Proc. Vol. 20., January 2012.
-
(2012)
IEEE Trans. Audio, Speech, and Language Proc.
, vol.20
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
25
-
-
79959840616
-
Investigation of fullsequence training of deep belief networks for speech recognition
-
Mohamed, A., Yu, D., and Deng, L. "Investigation of fullsequence training of deep belief networks for speech recognition, " Proc. Interspeech, 2010.
-
(2010)
Proc. Interspeech
-
-
Mohamed, A.1
Yu, D.2
Deng, L.3
-
26
-
-
84255177123
-
Deep and wide: Multiple layers in automatic speech recognition
-
January
-
Morgan, N. "Deep and wide: Multiple layers in automatic speech recognition, " IEEE Trans. Audio, Speech, and Language Processing, Vol. 20 (1), January 2012.
-
(2012)
IEEE Trans. Audio, Speech, and Language Processing
, vol.20
, Issue.1
-
-
Morgan, N.1
-
27
-
-
84897497795
-
On the difficulty of training recurrent neural networks
-
Pascanu, R., Mikolov, T., and Bengio, Y. "On the difficulty of training recurrent neural networks, " Proc. ICML, 2013.
-
(2013)
Proc. ICML
-
-
Pascanu, R.1
Mikolov, T.2
Bengio, Y.3
-
28
-
-
0028392167
-
An application of recurrent nets to phone probability estimation
-
Robinson, A. "An application of recurrent nets to phone probability estimation, " IEEE Trans. Neural Networks, Vol. 5, pp. 298-305, 1994.
-
(1994)
IEEE Trans. Neural Networks
, vol.5
, pp. 298-305
-
-
Robinson, A.1
-
29
-
-
84886829539
-
Optimization techniques to improve training speed of deep neural networks for large speech tasks
-
Nov
-
Sainath, T., Kingsbury, B., Soltau, H., and Ramabhadran, B. "Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks, " IEEE Trans. Audio, Speech, and Language Processing, vol.21, no.11, pp.2267-2276, Nov. 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Processing
, vol.21
, Issue.11
, pp. 2267-2276
-
-
Sainath, T.1
Kingsbury, B.2
Soltau, H.3
Ramabhadran, B.4
-
30
-
-
84890525984
-
Convolutional neural networks for LVCSR
-
Sainath, T., Mohamed, A., Kingsbury, B., and Ramabhadran, B. "Convolutional neural networks for LVCSR, " Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Sainath, T.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
31
-
-
84893654379
-
Improvements to deep convolutional neural networks for LVCSR
-
Sainath, T., Kingsbury, Mohamed, A., Dahl, G., Saon, G., Soltau, H., Beran, T., Aravkin, A., and B. Ramabhadran. "Improvements to deep convolutional neural networks for LVCSR, " Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Kingsbury1
Sainath, T.2
Mohamed, A.3
Dahl, G.4
Saon, G.5
Soltau, H.6
Beran, T.7
Aravkin, A.8
Ramabhadran, B.9
-
32
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
Sainath, T., Kingsbury, B., Ramabhadran, B., Novak, P., and Mohamed, A. "Making deep belief networks effective for large vocabulary continuous speech recognition, " Proc. ASRU, 2011.
-
(2011)
Proc. ASRU
-
-
Sainath, T.1
Kingsbury, B.2
Ramabhadran, B.3
Novak, P.4
Mohamed, A.5
-
33
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
Seide, F., Li, G., and Yu, D. "Conversational speech transcription using context-dependent deep neural networks, " Proc. Interspeech, 2011.
-
(2011)
Proc. Interspeech
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
35
-
-
84886714036
-
Acoustic modeling with hierarchical reservoirs
-
Nov
-
Triefenbach, F., Jalalvand, A., Demuynck, K., Martens, J.- P. "Acoustic modeling with hierarchical reservoirs, " IEEE Trans. Audio, Speech, and Language Processing, vol.21, no.11, pp. 2439-2450, Nov. 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Processing
, vol.21
, Issue.11
, pp. 2439-2450
-
-
Triefenbach, F.1
Jalalvand, A.2
Demuynck, K.3
Martens, J.-P.4
-
36
-
-
0024634603
-
Phoneme recognition using time-delay neural networks
-
Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., and Lang, K. "Phoneme recognition using time-delay neural networks, " IEEE Trans. Acoust. Speech, and Signal Proc., vol. 37, pp. 328-339, 1989.
-
(1989)
IEEE Trans. Acoust. Speech, and Signal Proc.
, vol.37
, pp. 328-339
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.5
-
37
-
-
0026692226
-
Stacked generalization
-
Wolpert, D. "Stacked generalization, " Neural Networks, vol. 5, no. 2, pp. 241-259, 1992.
-
(1992)
Neural Networks
, vol.5
, Issue.2
, pp. 241-259
-
-
Wolpert, D.1
-
38
-
-
84904483474
-
Recurrent neural networks for language understanding
-
Yao, K., Zweig, G., Hwang, M., Shi, Y., and Yu, D. "Recurrent Neural Networks for Language Understanding, " Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Yao, K.1
Zweig, G.2
Hwang, M.3
Shi, Y.4
Yu, D.5
-
39
-
-
84871387302
-
The deep tensor neural network with applications to large vocabulary speech recognition
-
Yu, D., Deng, L., and Seide, F. "The deep tensor neural network with applications to large vocabulary speech recognition, " IEEE Trans. Audio, Speech, and Language Processing, vol. 21, no. 2, pp. 388-396, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Processing
, vol.21
, Issue.2
, pp. 388-396
-
-
Yu, D.1
Deng, L.2
Seide, F.3
-
40
-
-
84865713025
-
Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition
-
Yu, D., Deng, L., and Dahl, G.E., "Roles of pre-training and fine-tuning in context-dependent DBN-HMMs for real-world speech recognition, " NIPS Workshop on Deep Learning and Unsupervised Feature Learning, 2010.
-
(2010)
NIPS Workshop on Deep Learning and Unsupervised Feature Learning
-
-
Yu, D.1
Deng, L.2
Dahl, G.E.3
|