-
1
-
-
84055222005
-
Context-dependentpre-trained deep neural networks for large-vocabulary speechrecognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependentpre-trained deep neural networks for large-vocabulary speechrecognition, " Audio, Speech, and Language Processing, IEEETransactions on, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEETransactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
85032751458
-
Deepneural networks for acoustic modeling in speech recognition: Theshared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deepneural networks for acoustic modeling in speech recognition: Theshared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
3
-
-
84893701756
-
Deep maxout networks for lowresourcespeech recognition
-
Y. Miao, F. Metze, and S. Rawat, "Deep maxout networks for lowresourcespeech recognition, " in Automatic Speech Recognitionand Understand ing (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 398-403.
-
(2013)
Automatic Speech Recognitionand Understand Ing (ASRU), 2013 IEEE Workshop On. IEEE
, pp. 398-403
-
-
Miao, Y.1
Metze, F.2
Rawat, S.3
-
4
-
-
84890525984
-
Deep convolutional neural networks for lvcsr
-
T. N. Sainath, A.-r. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for lvcsr, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE InternationalConference on. IEEE, 2013, pp. 8614-8618.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE InternationalConference On. IEEE
, pp. 8614-8618
-
-
Sainath, T.N.1
Mohamed, A.-R.2
Kingsbury, B.3
Ramabhadran, B.4
-
5
-
-
84911473441
-
Convolutional neural networks for speech recognition
-
O. Abdel-Hamid, A.-R. Mohamed, H. Jiang, L. Deng, G. Penn, and D. Yu, "Convolutional neural networks for speech recognition, "IEEE/ACM Transactions on Audio, Speech and LanguageProcessing (TASLP), vol. 22, no. 10, pp. 1533-1545, 2014.
-
(2014)
IEEE/ACM Transactions on Audio, Speech and LanguageProcessing (TASLP)
, vol.22
, Issue.10
, pp. 1533-1545
-
-
Abdel-Hamid, O.1
Mohamed, A.-R.2
Jiang, H.3
Deng, L.4
Penn, G.5
Yu, D.6
-
7
-
-
84905265980
-
Joint training of convolutionaland non-convolutional neural networks
-
H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutionaland non-convolutional neural networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE InternationalConference on. IEEE, 2014, pp. 5572-5576.
-
(2014)
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE InternationalConference On. IEEE
, pp. 5572-5576
-
-
Soltau, H.1
Saon, G.2
Sainath, T.N.3
-
8
-
-
84890543083
-
Speech recognitionwith deep recurrent neural networks
-
A. Graves, A.-R. Mohamed, and G. Hinton, "Speech recognitionwith deep recurrent neural networks, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conferenceon. IEEE, 2013, pp. 6645-6649.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conferenceon. IEEE
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.-R.2
Hinton, G.3
-
9
-
-
84878409063
-
Recurrent neural networks for noise reduction in robustasr
-
A. L. Maas, Q. V. Le, T. M. O'Neil, O. Vinyals, P. Nguyen, and A. Y. Ng, "Recurrent neural networks for noise reduction in robustasr, " in Thirteenth Annual Conference of the International SpeechCommunication Association (INTERSPEECH). ISCA, 2012.
-
(2012)
Thirteenth Annual Conference of the International SpeechCommunication Association (INTERSPEECH). ISCA
-
-
Maas, A.L.1
Le, Q.V.2
O'Neil, T.M.3
Vinyals, O.4
Nguyen, P.5
Ng, A.Y.6
-
10
-
-
0028392483
-
Learning long-term dependencieswith gradient descent is difficult
-
Y. Bengio, P. Simard, and P. Frasconi, "Learning long-term dependencieswith gradient descent is difficult, " Neural Networks, IEEE Transactions on, vol. 5, no. 2, pp. 157-166, 1994.
-
(1994)
Neural Networks, IEEE Transactions on
, vol.5
, Issue.2
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
11
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, "Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
12
-
-
84893701254
-
Hybrid speech recognitionwith deep bidirectional lstm
-
A. Graves, N. Jaitly, and A.-R. Mohamed, "Hybrid speech recognitionwith deep bidirectional lstm, " in Automatic Speech Recognitionand Understand ing (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 273-278.
-
(2013)
Automatic Speech Recognitionand Understand Ing (ASRU), 2013 IEEE Workshop On. IEEE
, pp. 273-278
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.-R.3
-
14
-
-
84910072094
-
Sequence discriminative distributedtraining of long short-term memory recurrent neural networks
-
H. Sak, O. Vinyals, G. Heigold, A. Senior, E. McDermott, R. Monga, and M. Mao, "Sequence discriminative distributedtraining of long short-term memory recurrent neural networks, "in Fifteenth Annual Conference of the International Speech CommunicationAssociation (INTERSPEECH). ISCA, 2014.
-
(2014)
Fifteenth Annual Conference of the International Speech CommunicationAssociation (INTERSPEECH). ISCA
-
-
Sak, H.1
Vinyals, O.2
Heigold, G.3
Senior, A.4
McDermott, E.5
Monga, R.6
Mao, M.7
-
16
-
-
85083953021
-
-
arXiv preprint arXiv: 1301. 3605
-
D. Yu, M. L. Seltzer, J. Li, J.-T. Huang, and F. Seide, "Featurelearning in deep neural networks-studies on speech recognitiontasks, " arXiv preprint arXiv: 1301. 3605, 2013.
-
(2013)
Featurelearning in Deep Neural Networks-studies on Speech Recognitiontasks
-
-
Yu, D.1
Seltzer, M.L.2
Li, J.3
Huang, J.-T.4
Seide, F.5
-
18
-
-
84890521103
-
Speaker adaptation of context dependent deep neuralnetworks
-
H. Liao, "Speaker adaptation of context dependent deep neuralnetworks, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7947-7951.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 7947-7951
-
-
Liao, H.1
-
19
-
-
84890542079
-
Kl-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition
-
D. Yu, K. Yao, H. Su, G. Li, and F. Seide, "Kl-divergence regularizeddeep neural network adaptation for improved large vocabularyspeech recognition, " in Acoustics, Speech and SignalProcessing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7893-7897.
-
(2013)
Acoustics, Speech and SignalProcessing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 7893-7897
-
-
Yu, D.1
Yao, K.2
Su, H.3
Li, G.4
Seide, F.5
-
21
-
-
84874226579
-
Adaptationof context-dependent deep neural networks for automaticspeech recognition
-
K. Yao, D. Yu, F. Seide, H. Su, L. Deng, and Y. Gong, "Adaptationof context-dependent deep neural networks for automaticspeech recognition, " in 2012 IEEE Spoken Language TechnologyWorkshop (SLT). IEEE, 2012.
-
(2012)
2012 IEEE Spoken Language TechnologyWorkshop (SLT). IEEE
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.5
Gong, Y.6
-
22
-
-
84858976070
-
Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineeringin context-dependent deep neural networks for conversationalspeech transcription, " in Automatic Speech Recognition and Understand ing(ASRU), 2011 IEEE Workshop on. IEEE, 2011, pp. 24-29.
-
(2011)
Automatic Speech Recognition and Understand Ing(ASRU), 2011 IEEE Workshop On. IEEE
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
23
-
-
84906241049
-
Improved featureprocessing for deep neural networks
-
S. P. Rath, D. Povey, K. Vesely, and J. Cernocky, "Improved featureprocessing for deep neural networks, " in Fourteenth AnnualConference of the International Speech Communication Association(INTERSPEECH). ISCA, 2013, pp. 109-113.
-
(2013)
Fourteenth AnnualConference of the International Speech Communication Association(INTERSPEECH). ISCA
, pp. 109-113
-
-
Rath, S.P.1
Povey, D.2
Vesely, K.3
Cernocky, J.4
-
24
-
-
84946046160
-
Regularizing dnn acousticmodels with Gaussian stochastic neurons
-
H. Zhang, Y. Miao, and F. Metze, "Regularizing dnn acousticmodels with Gaussian stochastic neurons, " in Acoustics, Speechand Signal Processing (ICASSP), 2015 IEEE International Conferenceon. IEEE, 2015, pp. 4964-4968.
-
(2015)
Acoustics, Speechand Signal Processing (ICASSP), 2015 IEEE International Conferenceon. IEEE
, pp. 4964-4968
-
-
Zhang, H.1
Miao, Y.2
Metze, F.3
-
25
-
-
84893691530
-
Speaker adaptationof neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptationof neural network acoustic models using i-vectors, " in AutomaticSpeech Recognition and Understand ing (ASRU), 2013IEEE Workshop on. IEEE, 2013, pp. 55-59.
-
(2013)
AutomaticSpeech Recognition and Understand Ing (ASRU), 2013IEEE Workshop On. IEEE
, pp. 55-59
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
26
-
-
84905259138
-
Improving dnn speaker independencewith i-vector inputs
-
A. Senior and I. Lopez-Moreno, "Improving dnn speaker independencewith i-vector inputs, " in Acoustics, Speech and SignalProcessing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 225-229.
-
(2014)
Acoustics, Speech and SignalProcessing (ICASSP), 2014 IEEE International Conference On. IEEE
, pp. 225-229
-
-
Senior, A.1
Lopez-Moreno, I.2
-
28
-
-
84946685505
-
Improvements tospeaker adaptive training of deep neural networks
-
Y. Miao, L. Jiang, H. Zhang, and F. Metze, "Improvements tospeaker adaptive training of deep neural networks, " in 2014 IEEESpoken Language Technology Workshop (SLT). IEEE, 2014.
-
(2014)
2014 IEEESpoken Language Technology Workshop (SLT). IEEE
-
-
Miao, Y.1
Jiang, L.2
Zhang, H.3
Metze, F.4
-
29
-
-
0041965934
-
Learningprecise timing with lstm recurrent networks
-
F. A. Gers, N. N. Schraudolph, and J. Schmidhuber, "Learningprecise timing with lstm recurrent networks, " The Journal of MachineLearning Research, vol. 3, pp. 115-143, 2003.
-
(2003)
The Journal of MachineLearning Research
, vol.3
, pp. 115-143
-
-
Gers, F.A.1
Schraudolph, N.N.2
Schmidhuber, J.3
-
30
-
-
84858953642
-
The kaldi speech recognitiontoolkit
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlcek, Y. Qian, P. Schwarz, J. Silovský, G. Stemmer, and K. Veselý, "The kaldi speech recognitiontoolkit, " in Automatic Speech Recognition and Understand ing(ASRU), 2011 IEEE Workshop on. IEEE, 2011, pp. 1-4.
-
(2011)
Automatic Speech Recognition and Understand Ing(ASRU), 2011 IEEE Workshop On. IEEE
, pp. 1-4
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlcek, P.8
Qian, Y.9
Schwarz, P.10
Silovský, J.11
Stemmer, G.12
Veselý, K.13
-
31
-
-
0032050110
-
Maximum likelihood linear transformations forhmm-based speech recognition
-
M. J. Gales, "Maximum likelihood linear transformations forhmm-based speech recognition, " Computer speech & language, vol. 12, no. 2, pp. 75-98, 1998.
-
(1998)
Computer Speech & Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.1
-
32
-
-
33745805403
-
A fast learning algorithmfor deep belief nets
-
G. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithmfor deep belief nets, " Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.1
Osindero, S.2
Teh, Y.-W.3
-
34
-
-
84910068915
-
Combination of fst and cn search in spoken term detection
-
J. Chiu, Y. Wang, J. Trmal, D. Povey, G. Chen, and A. Rudnicky, "Combination of fst and cn search in spoken term detection, " inFifteenth Annual Conference of the International Speech CommunicationAssociation (INTERSPEECH). ISCA, 2014.
-
(2014)
Fifteenth Annual Conference of the International Speech CommunicationAssociation (INTERSPEECH). ISCA
-
-
Chiu, J.1
Wang, Y.2
Trmal, J.3
Povey, D.4
Chen, G.5
Rudnicky, A.6
|