-
1
-
-
84055222005
-
Contextdependent pre-trained deep neural networks for largevocabulary speech recognition
-
jan
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for largevocabulary speech recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 30 -42, jan. 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks, " in Interspeech 2011, 2011.
-
(2011)
Interspeech 2011
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
3
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition, " in ASRU, 2011.
-
(2011)
ASRU
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novak, P.5
Mohamed, A.6
-
4
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
IEEE, nov
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82 -97, nov. 2012.
-
(2012)
Signal Processing Magazine
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
5
-
-
84871387302
-
The deep tensor neural network with applications to large vocabulary speech recognition
-
D. Yu, L. Deng, and F. Seide, "The deep tensor neural network with applications to large vocabulary speech recognition, " IEEE Trans. Audio, Speech, and Language Proc., vol. 21, no. 2, pp. 388-396, 2013.
-
(2013)
IEEE Trans. Audio, Speech, and Language Proc.
, vol.21
, Issue.2
, pp. 388-396
-
-
Yu, D.1
Deng, L.2
Seide, F.3
-
6
-
-
84890545163
-
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion
-
May
-
L. Deng, O. Abdel-Hamid, and D. Yu, "A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, May 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Deng, L.1
Abdel-Hamid, O.2
Yu, D.3
-
7
-
-
84890491198
-
Recent advances in deep learning for speech research at microsoft
-
May
-
L. Deng, J. Li, J.-T. Huang, K. Yao, D. Yu, F. Seide, M. Seltzer, G. Zweig, X. He, J. Williams, Y. Gong, and A. Acero, "Recent advances in deep learning for speech research at Microsoft, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, May 2013.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
-
-
Deng, L.1
Li, J.2
Huang, J.-T.3
Yao, K.4
Yu, D.5
Seide, F.6
Seltzer, M.7
Zweig, G.8
He, X.9
Williams, J.10
Gong, Y.11
Acero, A.12
-
8
-
-
84873303660
-
Speech recognition using long-span temporal patterns in a deep network model
-
March
-
S. M. Siniscalchi, D. Yu, L. Deng, and C. hui Lee, "Speech recognition using long-span temporal patterns in a deep network model, " IEEE Signal Processing Letters, March 2013.
-
(2013)
IEEE Signal Processing Letters
-
-
Siniscalchi, S.M.1
Yu, D.2
Deng, L.3
Lee, C.H.4
-
9
-
-
0030245363
-
From hmm's to segment models: A unified view of stochastic modeling for speech recognition
-
M. Ostendorf, V. Digalakis, and O. Kimball, "From HMM's to segment models: A unified view of stochastic modeling for speech recognition, " Speech and Audio Processing, IEEE Transactions on, vol. 4, no. 5, pp. 360-378, 1996.
-
(1996)
Speech and Audio Processing, IEEE Transactions on
, vol.4
, Issue.5
, pp. 360-378
-
-
Ostendorf, M.1
Digalakis, V.2
Kimball, O.3
-
10
-
-
0026854213
-
A generalized hidden Markov model with stateconditioned trend functions of time for the speech signal
-
L. Deng, "A generalized hidden markov model with stateconditioned trend functions of time for the speech signal, " Signal Processing, vol. 27, no. 1, pp. 65 - 78, 1992.
-
(1992)
Signal Processing
, vol.27
, Issue.1
, pp. 65-78
-
-
Deng, L.1
-
11
-
-
0028516022
-
Speech recognition using hidden markov models with polynomial regression functions as nonstationary states
-
oct
-
L. Deng, M. Aksmanovic, X. Sun, and C. Wu, "Speech recognition using hidden markov models with polynomial regression functions as nonstationary states, " Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 4, pp. 507 -520, oct 1994.
-
(1994)
Speech and Audio Processing, IEEE Transactions on
, vol.2
, Issue.4
, pp. 507-520
-
-
Deng, L.1
Aksmanovic, M.2
Sun, X.3
Wu, C.4
-
12
-
-
77949370075
-
A segmental CRF approach to large vocabulary continuous speech recognition
-
ASRU 2009. IEEE Workshop on, 13 2009-dec. 17
-
G. Zweig and P. Nguyen, "A segmental CRF approach to large vocabulary continuous speech recognition, " in Automatic Speech Recognition Understanding, 2009. ASRU 2009. IEEE Workshop on, 13 2009-dec. 17 2009, pp. 152 -157.
-
(2009)
Automatic Speech Recognition Understanding, 2009
, pp. 152-157
-
-
Zweig, G.1
Nguyen, P.2
-
13
-
-
80051659716
-
Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop
-
may
-
G. Zweig, P. Nguyen, D. Van-Compernolle, K. Demuynck, L. Atlas, P. Clark, G. Sell, M. Wang, F. Sha, H. Hermansky, D. Karakos, A. Jansen, S. Thomas, G. Sivaram, S. Bowman, and J. Kao, "Speech recognition with segmental conditional random fields: A summary of the JHU CLSP 2010 summer workshop, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, may 2011, pp. 5044 -5047.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 5044-5047
-
-
Zweig, G.1
Nguyen, P.2
Van-Compernolle, D.3
Demuynck, K.4
Atlas, L.5
Clark, P.6
Sell, G.7
Wang, M.8
Sha, F.9
Hermansky, H.10
Karakos, D.11
Jansen, A.12
Thomas, S.13
Sivaram, G.14
Bowman, S.15
Kao, J.16
-
14
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
jan
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14 -22, jan. 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
15
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
-
march
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, march 2012, pp. 4277 - 4280.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
16
-
-
84867598637
-
Classification and recognition with direct segment models
-
march
-
G. Zweig, "Classification and recognition with direct segment models, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on, march 2012, pp. 4161 -4164.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
, pp. 4161-4164
-
-
Zweig, G.1
-
17
-
-
84906230797
-
Continuous speech recognition using segmental neural nets
-
IJCNN. International Joint Conference on, vol. 2, jun
-
S. Austin, G. Zavaliagkos, J. Makhoul, and R. Schwartz, "Continuous speech recognition using segmental neural nets, " in Neural Networks, 1992. IJCNN., International Joint Conference on, vol. 2, jun 1992, pp. 314 -319 vol.2.
-
(1992)
Neural Networks, 1992
, vol.2
, pp. 314-319
-
-
Austin, S.1
Zavaliagkos, G.2
Makhoul, J.3
Schwartz, R.4
-
18
-
-
84878565391
-
Efficient segmental conditional random fields for one-pass phone recognition
-
Y. He and E. Fosler-Lussier, "Efficient segmental conditional random fields for one-pass phone recognition, " in Interspeech 2012, 2012.
-
(2012)
Interspeech 2012
-
-
He, Y.1
Fosler-Lussier, E.2
-
20
-
-
79959840616
-
Investigation of full-sequence training of deep belief networks for speech recognition
-
A.-R. Mohamed, D. Yu, and L. Deng, "Investigation of full-sequence training of deep belief networks for speech recognition, " in Interspeech, 2010, pp. 2846-2849.
-
(2010)
Interspeech
, pp. 2846-2849
-
-
Mohamed, A.-R.1
Yu, D.2
Deng, L.3
-
21
-
-
79959828814
-
Deep-structured hidden conditional random fields for phonetic recognition
-
D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition, " in Interspeech, 2010.
-
(2010)
Interspeech
-
-
Yu, D.1
Deng, L.2
|