-
2
-
-
84893701254
-
Hybrid speech recognition with deep bidirectional LSTM
-
A. Graves, N. Jaitly, and A. Mohamed, Hybrid speech recognition with deep bidirectional LSTM, in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEEWorkshop on. IEEE, 2013, pp. 273-278
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEEWorkshop On. IEEE
, pp. 273-278
-
-
Graves, A.1
Jaitly, N.2
Mohamed, A.3
-
3
-
-
84910046405
-
Long short-term memory recurrent neural network architectures for large scale acoustic modeling
-
H. Sak, A. Senior, and F. Beaufays, Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling, in INTERSPEECH 2014, 2014
-
(2014)
INTERSPEECH 2014
-
-
Sak, H.1
Senior, A.2
Beaufays, F.3
-
4
-
-
84910072094
-
Sequence discriminative distributed training of long short-term memory recurrent neural networks
-
H. Sak, O. Vinyals, G. Heigold, A. Senior, E. McDermott, R. Monga, and M. Mao, Sequence discriminative distributed training of long short-term memory recurrent neural networks, in Interspeech, 2014
-
(2014)
Interspeech
-
-
Sak, H.1
Vinyals, O.2
Heigold, G.3
Senior, A.4
McDermott, E.5
Monga, R.6
Mao, M.7
-
6
-
-
33749259827
-
Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks
-
A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, in Proceedings of the 23rd international conference on Machine learning. ACM, 2006, pp. 369-376
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning. ACM
, pp. 369-376
-
-
Graves, A.1
Fernández, S.2
Gomez, F.3
Schmidhuber, J.4
-
8
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, Conversational speech transcription using context-dependent deep neural networks, in INTERSPEECH, 2011, pp. 437-440
-
(2011)
INTERSPEECH
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
9
-
-
84905237729
-
Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. Dahl, D. Yu, and L. Deng, Context-dependent pre-trained deep neural networks for large vocabulary speech recognition, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2011
-
(2011)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
-
10
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
Jan
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition, IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 1, pp. 30-42, Jan. 2012. [Online]. Available: http://dx.doi.org/10.1109/TASL.2011.2134090
-
(2012)
IEEE Transactions on Audio, Speech &Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
11
-
-
84878539964
-
Application of pretrained deep neural networks to large vocabulary speech recognition
-
N. Jaitly, P. Nguyen, A. Senior, and V. Vanhoucke, Application of pretrained deep neural networks to large vocabulary speech recognition, in INTERSPEECH, 2012
-
(2012)
INTERSPEECH
-
-
Jaitly, N.1
Nguyen, P.2
Senior, A.3
Vanhoucke, V.4
-
12
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, 2012
-
(2012)
IEEE Signal Process. Mag
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
13
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Feb
-
L. R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition, Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
14
-
-
0003459132
-
-
Ph.D. dissertation, McGill University, Montreal, Canada
-
Y. Normandin, Hidden Markov models, maximum mutual information, and the speech recognition problem, Ph.D. dissertation, McGill University, Montreal, Canada, 1991
-
(1991)
Hidden Markov Models, Maximum Mutual Information, and the Speech Recognition Problem
-
-
Normandin, Y.1
-
16
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
Taipei, Taiwan, Apr
-
B. Kingsbury, Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Taipei, Taiwan, Apr. 2009, pp. 3761-3764
-
(2009)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 3761-3764
-
-
Kingsbury, B.1
-
17
-
-
84878379108
-
Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization, in INTERSPEECH, 2012
-
(2012)
INTERSPEECH
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
18
-
-
84890543852
-
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
-
H. Su, G. Li, D. Yu, and F. Seide, Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013, pp. 6664-6668
-
(2013)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 6664-6668
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
19
-
-
84906274730
-
Sequencediscriminative training of deep neural networks
-
K. Veselý, A. Ghoshal, L. Burget, and D. Povey, Sequencediscriminative training of deep neural networks, in INTERSPEECH, 2013
-
(2013)
INTERSPEECH
-
-
Veselý, K.1
Ghoshal, A.2
Burget, L.3
Povey, D.4
-
20
-
-
80051640064
-
-
Ph.D. dissertation, RWTH Aachen University, Aachen, Germany, Jun
-
G. Heigold, A log-linear discriminative modeling framework for speech recognition, Ph.D. dissertation, RWTH Aachen University, Aachen, Germany, Jun. 2010
-
(2010)
A Log-linear Discriminative Modeling Framework for Speech Recognition
-
-
Heigold, G.1
-
21
-
-
0031268931
-
Bidirectional recurrent neural networks
-
M. Schuster and K. K. Paliwal, Bidirectional recurrent neural networks, Signal Processing, IEEE Transactions on, vol. 45, no. 11, pp. 2673-2681, 1997
-
(1997)
Signal Processing, IEEE Transactions on
, vol.45
, Issue.11
, pp. 2673-2681
-
-
Schuster, M.1
Paliwal, K.K.2
-
22
-
-
77949404053
-
From speech to letters using a novel neural network architecture for grapheme based ASR
-
F. Eyben, M. Wollmer, B. Schuller, and A. Graves, From speech to letters using a novel neural network architecture for grapheme based ASR, in Automatic Speech Recognition &Understanding, 2009. ASRU 2009. IEEE Workshop on. IEEE, 2009, pp. 376-380
-
(2009)
Automatic Speech Recognition &Understanding, 2009. ASRU 2009. IEEE Workshop On. IEEE
, pp. 376-380
-
-
Eyben, F.1
Wollmer, M.2
Schuller, B.3
Graves, A.4
-
23
-
-
84867135575
-
Building high-level features using large scale unsupervised learning
-
Q. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. Corrado, J. Dean, and A. Ng, Building high-level features using large scale unsupervised learning, in International Conference on Machine Learning, 2012, pp. 81-88
-
(2012)
International Conference on Machine Learning
, pp. 81-88
-
-
Le, Q.1
Ranzato, M.2
Monga, R.3
Devin, M.4
Chen, K.5
Corrado, G.6
Dean, J.7
Ng, A.8
-
24
-
-
84877760312
-
Large scale distributed deep networks
-
J. Dean, G. Corrado, R. Monga, K. Chen, M. Devin, Q. Le, M. Mao, M. Ranzato, A. Senior, P. Tucker, K. Yang, and A. Ng, Large scale distributed deep networks, in Advances in Neural Information Processing Systems (NIPS), 2012
-
(2012)
Advances in Neural Information Processing Systems (NIPS)
-
-
Dean, J.1
Corrado, G.2
Monga, R.3
Chen, K.4
Devin, M.5
Le, Q.6
Mao, M.7
Ranzato, M.8
Senior, A.9
Tucker, P.10
Yang, K.11
Ng, A.12
-
25
-
-
84890539009
-
Multilingual acoustic models using distributed deep neural networks
-
Vancouver, Canada, Apr
-
G. Heigold, V. Vanhoucke, A. Senior, P. Nguyen, M. Ranzato, M. Devin, and J. Dean, Multilingual acoustic models using distributed deep neural networks, in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, Vancouver, Canada, Apr. 2013
-
(2013)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.1
-
-
Heigold, G.1
Vanhoucke, V.2
Senior, A.3
Nguyen, P.4
Ranzato, M.5
Devin, M.6
Dean, J.7
|