-
2
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
H. Hermansky, D. P. W. Ellis, and S. Sharma, "Tandem connectionist feature extraction for conventional HMM systems, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2000, vol. 3, pp. 1635-1638.
-
(2000)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.3
, pp. 1635-1638
-
-
Hermansky, H.1
Ellis, D.P.W.2
Sharma, S.3
-
3
-
-
85009097225
-
On using MLP features in LVCSR
-
Q. Zhu, B. Chen, N. Morgan, and A. Stolcke, "On using MLP features in LVCSR, " in Proc. Interspeech, 2004, pp. 921-924.
-
(2004)
Proc. Interspeech
, pp. 921-924
-
-
Zhu, Q.1
Chen, B.2
Morgan, N.3
Stolcke, A.4
-
4
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
Jan
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 20, no. 1, pp. 14 -22, Jan. 2012.
-
(2012)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
5
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
IEEE
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, and T. N. Sainath, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, p. 8297, 2012.
-
(2012)
Signal Processing Magazine
, vol.29
, Issue.6
, pp. 8297
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
6
-
-
84890492030
-
An investigation of deep neural networks for noise robust speech recognition
-
M. Seltzer, D. Yu and Y. Wang, "An investigation of deep neural networks for noise robust speech recognition" in Proc. ICASSP, pp. 7398-7402, 2013.
-
(2013)
Proc. ICASSP
, pp. 7398-7402
-
-
Seltzer, M.1
Yu, D.2
Wang, Y.3
-
7
-
-
84906214784
-
Exploring convolutional neural network structures and optimization for speech recognition
-
O. Abdel-Hamid, L. Deng, and D. Yu. "Exploring convolutional neural network structures and optimization for speech recognition, " Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Abdel-Hamid, O.1
Deng, L.2
Yu, D.3
-
9
-
-
84858971297
-
Convolutive bottleneck network features for LVCSR
-
IEEE
-
K. Vesely, M. Karafiat, and Frantisek Grezl, "Convolutive bottleneck network features for LVCSR, " in Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2011 pp. 42-47.
-
(2011)
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
, pp. 42-47
-
-
Vesely, K.1
Karafiat, M.2
Grezl, F.3
-
10
-
-
84867602181
-
Easy does it: Robust spectro-temporal many-stream asr without fine tuning streams
-
S.V Ravuri and N. Morgan "Easy does it: robust spectro-temporal many-stream asr without fine tuning streams", Proc. ICSASP 2012, pp. 4309-4312.
-
(2012)
Proc. ICSASP
, pp. 4309-4312
-
-
Ravuri, S.V.1
Morgan, N.2
-
11
-
-
70349223037
-
An auditory-based feature for robust speech recognition
-
Sep 2009
-
Y. Shao, Z. Jin, D. Wang and S. Srinivasan "An auditory-based feature for robust speech recognition" Proc. Interspeech 2009, Sep 2009, pp. 4625-4628.
-
(2009)
Proc. Interspeech
, pp. 4625-4628
-
-
Shao, Y.1
Jin, Z.2
Wang, D.3
Srinivasan, S.4
-
12
-
-
70349194599
-
Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
-
O. Kalinli, M.L. Seltzer, and A. Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition, " in Proc. ICASSP, 2009, pp. 3825-3828.
-
(2009)
Proc. ICASSP
, pp. 3825-3828
-
-
Kalinli, O.1
Seltzer, M.L.2
Acero, A.3
-
13
-
-
84867611164
-
Factor analysis based VTS discriminative adaptive training
-
IEEE
-
F. Flego and M. J. F. Gales, "Factor Analysis Based VTS Discriminative Adaptive Training" Proc. ICASSP. IEEE, 2012, pp. 4669-4672.
-
(2012)
Proc. ICASSP
, pp. 4669-4672
-
-
Flego, F.1
Gales, M.J.F.2
-
14
-
-
84867589420
-
Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
-
March
-
H. Franco, M. Graciarena, and A. Mandal, "Normalized amplitude modulation features for large vocabulary noise-robust speech recognition", Proc. ICASSP 2012, March 2012, pp. 4117-4120.
-
(2012)
Proc. ICASSP 2012
, pp. 4117-4120
-
-
Franco, H.1
Graciarena, M.2
Mandal, A.3
-
16
-
-
78049398950
-
Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring
-
C. Kim and R. M. Stern, "Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring", in Proc. ICASSP, pp. 4574-4577, 2010.
-
(2010)
Proc. ICASSP
, pp. 4574-4577
-
-
Kim, C.1
Stern, R.M.2
-
17
-
-
85009227802
-
Localized spectro-temporal features for automatic speech recognition
-
Sep
-
M. Kleinschmidt, "Localized spectro-temporal features for automatic speech recognition, " in Proc. of Eurospeech, 2003, Sep 2003, pp. 2573-2576.
-
(2003)
Proc. of Eurospeech, 2003
, pp. 2573-2576
-
-
Kleinschmidt, M.1
-
18
-
-
0032658253
-
Temporal patterns (TRAPs) in ASR of noisy speech
-
March
-
H. Hermansky and S. Sharma, "Temporal patterns (TRAPs) in ASR of noisy speech, " Proc. ICASSP 1999, March 1999, pp. 289-292 vol. 1.
-
(1999)
Proc. ICASSP 1999
, vol.1
, pp. 289-292
-
-
Hermansky, H.1
Sharma, S.2
-
19
-
-
33646799825
-
A neural network for learning long-term temporal features for speech recognition
-
March
-
B.Y. Chen, Q. Zhu, and N. Morgan, "A Neural Network for Learning Long-Term Temporal Features for Speech Recognition, " Proc. ICASSP 2005, March 2005, pp. 945-948.
-
(2005)
Proc. ICASSP 2005
, pp. 945-948
-
-
Chen, B.Y.1
Zhu, Q.2
Morgan, N.3
-
20
-
-
84906221944
-
Informative spectro-temporal bottleneck features for noise-robust speech recognition
-
S.Y. Chang, N. Morgan "Informative spectro-temporal bottleneck features for noise-robust speech recognition", Proc. Interspeech 2013.
-
(2013)
Proc. Interspeech
-
-
Chang, S.Y.1
Morgan, N.2
-
21
-
-
84890543873
-
Investigating deep neural network based transforms of robust audio features for LVCSR
-
E. Bocchieri and D. Dimitriadis "Investigating deep neural network based transforms of robust audio features for LVCSR" in Proc. ICASSP, pp. 6709-6713, 2013.
-
(2013)
Proc. ICASSP
, pp. 6709-6713
-
-
Bocchieri, E.1
Dimitriadis, D.2
-
22
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition, " Proceedings of the IEEE, 86(11), 2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
23
-
-
67651242353
-
Performance analysis of the Aurora large vocabulary baseline system
-
Vienna, Austria
-
N. Parihar, J. Picone, D. Pearce, H.G. Hirsch, "Performance analysis of the Aurora large vocabulary baseline system, " Proceedings of the European Signal Processing Conference, Vienna, Austria, 2004.
-
(2004)
Proceedings of the European Signal Processing Conference
-
-
Parihar, N.1
Picone, J.2
Pearce, D.3
Hirsch, H.G.4
-
24
-
-
84910089405
-
-
"Renoiser web page, " http://labrosa.ee.columbia.edu/projects/renoiser/create-wsj.html.
-
Renoiser Web Page
-
-
-
26
-
-
78650474133
-
A practical guide to training restricted Boltzmann machines
-
University of Toronto
-
G. Hinton, "A practical guide to training restricted Boltzmann machines, " Tech. Rep. UTML TR 2010-003, University of Toronto, 2010.
-
(2010)
Tech. Rep. UTML TR 2010-003
-
-
Hinton, G.1
|