-
1
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
2
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A.-R. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, 2012.
-
(2012)
IEEE Signal Processing Magazine
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
3
-
-
0004056285
-
-
Prentice Hall
-
X. Huang, A. Acero, and H.-W. Hong, Spoken Language Processing: a guide to theory, algorithm, and system development, Prentice Hall, 2001.
-
(2001)
Spoken Language Processing: A Guide to Theory Algorithm, and System Development
-
-
Huang, X.1
Acero, A.2
Hong, H.-W.3
-
4
-
-
79959849500
-
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
-
B. Li and K. C. Sim, "Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems," in INTERSPEECH, 2010, pp. 526-529.
-
(2010)
Interspeech
, pp. 526-529
-
-
Li, B.1
Sim, K.C.2
-
5
-
-
33947703156
-
Adaptation of hybrid ANN/HMM models using linear hidden transformations and conservative training
-
R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. D. Mori, "Adaptation of hybrid ANN/HMM models using linear hidden transformations and conservative training," in ICASSP, 2006, pp. 1189-1192.
-
(2006)
ICASSP
, pp. 1189-1192
-
-
Gemello, R.1
Mana, F.2
Scanzio, S.3
Laface, P.4
Mori, R.D.5
-
6
-
-
84878606732
-
Hermitian based hidden activation functions for adaptation of hybrid HMM/ANN models
-
S. M. Siniscalchi, J. Li, and C.-H. Lee, "Hermitian based hidden activation functions for adaptation of hybrid HMM/ANN models," in INTERSPEECH, 2012.
-
(2012)
Interspeech
-
-
Siniscalchi, S.M.1
Li, J.2
Lee, C.-H.3
-
7
-
-
84937854847
-
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system
-
J. Neto et al, "Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system," in EUROSPEECH, 1995.
-
(1995)
EUROSPEECH
-
-
Neto Et Al, J.1
-
9
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in ASRU, 2011.
-
(2011)
ASRU
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
10
-
-
84858953286
-
Vocal tract length normalization for LVCSR
-
Carnegie Mellon University
-
P. Zhan etal, "Vocal tract length normalization for LVCSR," in Tech. Rep. CMU-LTI-97-150. Carnegie Mellon University, 1997.
-
(1997)
Tech. Rep. CMU-LTI-97-150
-
-
Zhan, P.1
-
11
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer, Speech and Language, vol. 12, pp. 75-98, 1998.
-
(1998)
Computer, Speech and Language
, vol.12
, pp. 75-98
-
-
Gales, M.J.F.1
-
12
-
-
84937880519
-
Connectionist speaker normalization and adaptation
-
V. Abrash, H. Franco, A. Sankar, and M. Cohen, "Connectionist speaker normalization and adaptation," in EUROSPEECH, 1995.
-
(1995)
EUROSPEECH
-
-
Abrash, V.1
Franco, H.2
Sankar, A.3
Cohen, M.4
-
13
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks," IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
14
-
-
0013344078
-
Training products of experts by minimizing contrastive divergence
-
G. E. Hinton, "Training products of experts by minimizing contrastive divergence," Neural Computation, vol. 14, pp. 1771-1800, 2002.
-
(2002)
Neural Computation
, vol.14
, pp. 1771-1800
-
-
Hinton, G.E.1
-
15
-
-
85008035419
-
Equivalence of generative and log-linear models
-
G. Heigold, H. Ney, P. Lehnen, T. Gass, and R. Schluter, "Equivalence of generative and log-linear models," IEEE Trans. on Audio, Speech, and Language Processing, vol. 19, no. 5, pp. 1138-1148, 2011.
-
(2011)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.19
, Issue.5
, pp. 1138-1148
-
-
Heigold, G.1
Ney, H.2
Lehnen, P.3
Gass, T.4
Schluter, R.5
-
16
-
-
33947635130
-
Regularized adaptation of discriminative classifiers
-
X. Li and J. Bilmes, "Regularized adaptation of discriminative classifiers," in ICASSP, 2006.
-
(2006)
ICASSP
-
-
Li, X.1
Bilmes, J.2
-
17
-
-
33646777278
-
A generalization of linear dis-criminant analysis in maximum likelihood framework
-
Johns Hopkins University, Aug
-
N. Kumar and A. G. Andreou, "A generalization of linear dis-criminant analysis in maximum likelihood framework," in Tech. Rep. JHU-CLSP Technical Report. Johns Hopkins University, Aug 1996, vol. 16.
-
(1996)
Tech. Rep. JHU-CLSP Technical Report
, pp. 16
-
-
Kumar, N.1
Andreou, A.G.2
|