-
1
-
-
84055222005
-
Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
3
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Proc. Interspeech'11, pp. 437-440, 2011.
-
(2011)
Proc. Interspeech'11
, pp. 437-440
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
4
-
-
44049108531
-
Automated directory assistance system-From theory to practice
-
D. Yu, Y.-C. Ju, Y.-Y. Wang, G. Zweig, and A. Acero, "Automated Directory Assistance System-from Theory to Practice", in Proc. Interspeech'07, pp. 2709-2712, 2007.
-
(2007)
Proc. Interspeech'07
, pp. 2709-2712
-
-
Yu, D.1
Ju, Y.-C.2
Wang, Y.-Y.3
Zweig, G.4
Acero, A.5
-
5
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
6
-
-
84878539964
-
Application of pretrained deep neural networks to large vocabulary speech recognition
-
N. Jaitly, P. Nguyen, and V. Vanhoucke, "application of pretrained deep neural networks to large vocabulary speech recognition", in Proc. Interspeech'12, 2012.
-
(2012)
Proc. Interspeech'12
-
-
Jaitly, N.1
Nguyen, P.2
Vanhoucke, V.3
-
7
-
-
84858972572
-
Making deep belief networks effective for large vocabulary continuous speech recognition
-
T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novak, and A.-r. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition", in Proc. ASRU'11, pp. 30-35, 2011.
-
(2011)
Proc. ASRU'11
, pp. 30-35
-
-
Sainath, T.N.1
Kingsbury, B.2
Ramabhadran, B.3
Fousek, P.4
Novak, P.5
Mohamed, A.-R.6
-
8
-
-
84878379108
-
Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum bayes risk training of deep neural network acoustic models using distributed hessian-free optimization," in Proc. Interspeech'12, 2012.
-
(2012)
Proc. Interspeech'12
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
9
-
-
84890543852
-
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
-
Hang Su, Gang Li, Dong Yu, Frank Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription", in Proc. ICASSP 2013.
-
(2013)
Proc. ICASSP
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
10
-
-
84890492030
-
An investigation of deep neural networks for noise robust speech recognition
-
Michael Seltzer, Dong Yu, Yongqiang Wang, "An investigation of deep neural networks for noise robust speech recognition", in Proc. ICASSP 2013.
-
(2013)
Proc. ICASSP
-
-
Seltzer, M.1
Yu, D.2
Wang, Y.3
-
11
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, 2012.
-
(2012)
IEEE Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
12
-
-
84937880519
-
Connectionist speaker normalization and adaptation
-
V. Abrash, H. Franco, A. Sankar, and M. Cohen, "Connectionist speaker normalization and adaptation," in Proc. EUROSPEECH'95, pp. 2183-2186, 1995.
-
(1995)
Proc. EUROSPEECH'95
, pp. 2183-2186
-
-
Abrash, V.1
Franco, H.2
Sankar, A.3
Cohen, M.4
-
13
-
-
84937854847
-
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
-
J. Neto, L. Almeida, M. Hochberg, C. Martins, L. Nunes, and S. Renals, T. Robinson, "Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system," in Proc. EUROSPEECH'95, pp. 2171-2174, 1995.
-
(1995)
Proc. EUROSPEECH'95
, pp. 2171-2174
-
-
Neto, J.1
Almeida, L.2
Hochberg, M.3
Martins, C.4
Nunes, L.5
Renals, S.6
Robinson, T.7
-
14
-
-
0343476363
-
Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition
-
D. Albesano, R. Gemello, and F. Mana, "Hybrid HMM-NN modeling of stationary-transitional units for continuous speech recognition", in Proc. NIPS'97, pp. 1112-1115, 1997.
-
(1997)
Proc. NIPS'97
, pp. 1112-1115
-
-
Albesano, D.1
Gemello, R.2
Mana, F.3
-
15
-
-
79959849500
-
Comparison of discriminative input and output transformations for speaker adaptation in the Hybrid NN/HMM systems
-
B. Li and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the Hybrid NN/HMM systems", in Proc. Interspeech'10, pp. 526-529, 2010.
-
(2010)
Proc. Interspeech'10
, pp. 526-529
-
-
Li, B.1
Sim, K.C.2
-
16
-
-
34548012893
-
Linear hidden transformations for adaptation of hybrid ANN/HMM models
-
R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. De Mori, "Linear hidden transformations for adaptation of hybrid ANN/HMM models", Speech Communication 49, no. 10, pp. 827-83, 2007.
-
(2007)
Speech Communication
, vol.49
, Issue.10
, pp. 827-883
-
-
Gemello, R.1
Mana, F.2
Scanzio, S.3
Laface, P.4
De Mori, R.5
-
17
-
-
84865740155
-
Improving lvcsr system combination using neural network language model cross adaptation
-
X. Liu, M. J. F. Gales, and P. C. Woodland. "Improving LVCSR system combination using neural network language model cross adaptation," in Proc. Interspeech'11, Pp. 2857-2860, 2011.
-
(2011)
Proc. Interspeech'11
, pp. 2857-2860
-
-
Liu, X.1
Gales, M.J.F.2
Woodland, P.C.3
-
18
-
-
84878535870
-
A initial attempt on task-specific adaptation for deep neural networkbased large vocabulary continuous speech recognition
-
Y. Xiao, Z. Zhang, S. Cai, J. Pan, and Y. Yan, "A initial attempt on task-specific adaptation for deep neural networkbased large vocabulary continuous speech recognition", in Proc. Interspeech'12, 2012.
-
(2012)
Proc. Interspeech'12
-
-
Xiao, Y.1
Zhang, Z.2
Cai, S.3
Pan, J.4
Yan, Y.5
-
19
-
-
78049310851
-
Adaptation of a feedforward artificial neural network using a linear transform
-
J. Trmal, J. Zelinka, and L. Müller. "Adaptation of a feedforward artificial neural network using a linear transform," Text, Speech and Dialogue. Springer Berlin/Heidelberg, pp. 423-430, 2010.
-
(2010)
Text, Speech and Dialogue. Springer Berlin/Heidelberg
, pp. 423-430
-
-
Trmal, J.1
Zelinka, J.2
Müller, L.3
-
20
-
-
84874226579
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
K. Yao, D. Yu, F. Seide, H. Su, L.i Deng, and Y. Gong, "Adaptation of context-dependent deep neural networks for automatic speech recognition", in Proc. SLT'12, 2012.
-
(2012)
Proc. SLT'12
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.I.5
Gong, Y.6
-
21
-
-
33646794050
-
Two-stage speaker adaptation of hybrid tied-posterior acoustic models
-
J. Stadermann and G. Rigoll, "Two-stage speaker adaptation of hybrid tied-posterior acoustic models," in Proc. ICASSP'05, vol. I, pp. 997-1000, 2005.
-
(2005)
Proc. ICASSP'05
, vol.1
, pp. 997-1000
-
-
Stadermann, J.1
Rigoll, G.2
-
22
-
-
40649088651
-
Adaptation of artificial neural networks avoiding catastrophic forgetting
-
D. Albesano, R. Gemello, P. Laface, F. Mana, and S. Scanzio, "Adaptation of artificial neural networks avoiding catastrophic forgetting," in Proc. Int. Jnt. Conference on Neural Networks 2006, pp. 2863-2870, 2006.
-
(2006)
Proc. Int. Jnt. Conference on Neural Networks 2006
, pp. 2863-2870
-
-
Albesano, D.1
Gemello, R.2
Laface, P.3
Mana, F.4
Scanzio, S.5
-
23
-
-
33947635130
-
Regularized adaptation of discriminative classifiers
-
X. Li and J. Bilmes, "Regularized adaptation of discriminative classifiers," in Proc. ICASSP'06, 2006.
-
(2006)
Proc. ICASSP'06
-
-
Li, X.1
Bilmes, J.2
-
24
-
-
0033677005
-
Fast speaker adaptation of artificial neural networks for automatic speech recognition
-
S. Dupont and L. Cheboub, "Fast speaker adaptation of artificial neural networks for automatic speech recognition", in Proc. ICASSP'00, vol.3, pp. 1795-1798, 2000.
-
(2000)
Proc. ICASSP'00
, vol.3
, pp. 1795-1798
-
-
Dupont, S.1
Cheboub, L.2
-
25
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU'11, 2011.
-
(2011)
Proc. ASRU'11
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
27
-
-
84871387302
-
The deep tensor neural network with applications to large vocabulary speech recognition
-
D. Yu, L. Deng, and F. Seide, "The deep tensor neural network with applications to large vocabulary speech recognition", IEEE Trans. on Audio, Speech, and Language Processing, 2013.
-
(2013)
IEEE Trans. on Audio, Speech, and Language Processing
-
-
Yu, D.1
Deng, L.2
Seide, F.3
-
29
-
-
68549140008
-
A novel framework and training algorithm for variable-parameter hidden markov models
-
D. Yu, L. Deng, Y. Gong, and A. Acero, "A novel framework and training algorithm for variable-parameter hidden markov models", IEEE Trans. on Audio, Speech, and Language Processing, vol 17, no. 7, pp. 1348-1360, 2009.
-
(2009)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.17
, Issue.7
, pp. 1348-1360
-
-
Yu, D.1
Deng, L.2
Gong, Y.3
Acero, A.4
-
30
-
-
79959853780
-
On speaker adaptive training of artificial neural networks
-
J. Trmal, J. Zelinka, and L. Müller, "On speaker adaptive training of artificial neural networks", in Proc. Interspeech'10, pp. 554-557, 2010.
-
(2010)
Proc. Interspeech'10
, pp. 554-557
-
-
Trmal, J.1
Zelinka, J.2
Müller, L.3
|