-
1
-
-
79959828814
-
Deep-structured hidden conditional random fields for phonetic recognition
-
D. Yu and L. Deng, "Deep-structured hidden conditional random fields for phonetic recognition," in Proc. IN-TERSPEECH, 2010, pp. 2986-2989.
-
(2010)
Proc. IN-TERSPEECH
, pp. 2986-2989
-
-
Yu, D.1
Deng, L.2
-
2
-
-
84055222005
-
Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
3
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, et al., "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Process. Mag., vol. 29, no. 11, pp. 2-17, 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.11
, pp. 2-17
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
-
5
-
-
28244470718
-
The time dimension for scene analysis
-
D. L. Wang, "The time dimension for scene analysis," IEEE Trans. Neural Netw., vol. 16, no. 6, pp. 1401-1426, 2005.
-
(2005)
IEEE Trans. Neural Netw.
, vol.16
, Issue.6
, pp. 1401-1426
-
-
Wang, D.L.1
-
7
-
-
84877762231
-
Exploring monaural features for classification-based speech segregation
-
Y. X. Wang, K. Han, and D. L. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 1, no. 99, pp. 1-10, 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process
, vol.1
, Issue.99
, pp. 1-10
-
-
Wang, Y.X.1
Han, K.2
Wang, D.L.3
-
9
-
-
84875678689
-
Towards scaling up classification-based speech separation
-
Y. X. Wang and D. L. Wang, "Towards scaling up classification-based speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. PP, no. 99, pp. 1-23, 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process
, vol.PP
, Issue.99
, pp. 1-23
-
-
Wang, Y.X.1
Wang, D.L.2
-
10
-
-
67650137747
-
Discriminative weight training for a statistical model-based voice activity detection
-
S. I. Kang, Q. H. Jo, and J. H. Chang, "Discriminative weight training for a statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 15, pp. 170-173, 2008.
-
(2008)
IEEE Signal Process. Lett.
, vol.15
, pp. 170-173
-
-
Kang, S.I.1
Jo, Q.H.2
Chang, J.H.3
-
11
-
-
77950091897
-
Voice activity detection based on statistical models and machine learning approaches
-
J. W. Shin, J. H. Chang, and N. S. Kim, "Voice activity detection based on statistical models and machine learning approaches," Computer Speech & Language, vol. 24, no. 3, pp. 515-530, 2010.
-
(2010)
Computer Speech & Language
, vol.24
, Issue.3
, pp. 515-530
-
-
Shin, J.W.1
Chang, J.H.2
Kim, N.S.3
-
12
-
-
77956289831
-
Discriminative training for multiple observation likelihood ratio based voice activity detection
-
T. Yu and J. H. L. Hansen, "Discriminative training for multiple observation likelihood ratio based voice activity detection," IEEE Signal Process. Lett., vol. 17, no. 11, pp. 897-900, 2010.
-
(2010)
IEEE Signal Process. Lett.
, vol.17
, Issue.11
, pp. 897-900
-
-
Yu, T.1
Hansen, J.H.L.2
-
13
-
-
79952611095
-
Maximum margin clustering based statistical VAD with multiple observation compound feature
-
J. Wu and X. L. Zhang, "Maximum margin clustering based statistical VAD with multiple observation compound feature," IEEE Signal Process. Lett., vol. 18, no. 5, pp. 283-286, 2011.
-
(2011)
IEEE Signal Process. Lett.
, vol.18
, Issue.5
, pp. 283-286
-
-
Wu, J.1
Zhang, X.L.2
-
14
-
-
79959756010
-
Efficient multiple kernel support vector machine based voice activity detection
-
J. Wu and X. L. Zhang, "Efficient multiple kernel support vector machine based voice activity detection," IEEE Signal Process. Lett., vol. 18, no. 8, pp. 466-499, 2011.
-
(2011)
IEEE Signal Process. Lett.
, vol.18
, Issue.8
, pp. 466-499
-
-
Wu, J.1
Zhang, X.L.2
-
15
-
-
84890504386
-
Linearithmic time sparse and convex maximum margin clustering
-
X. L. Zhang and J. Wu, "Linearithmic time sparse and convex maximum margin clustering," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 1, no. 99, pp. 1-24, 2012.
-
(2012)
IEEE Trans. Syst., Man, Cybern. B, Cybern.
, vol.1
, Issue.99
, pp. 1-24
-
-
Zhang, X.L.1
Wu, J.2
-
16
-
-
85008579584
-
Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
-
Y. Suh and H. Kim, "Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection," IEEE Signal Process. Lett., vol. 19, no. 8, pp. 507-510, 2012.
-
(2012)
IEEE Signal Process. Lett.
, vol.19
, Issue.8
, pp. 507-510
-
-
Suh, Y.1
Kim, H.2
-
17
-
-
84872300403
-
Deep belief networks based voice activity detection
-
X. L. Zhang and J. Wu, "Deep belief networks based voice activity detection," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 4, pp. 3371-3408, 2013.
-
(2013)
IEEE Trans. Audio, Speech, Lang. Process
, vol.21
, Issue.4
, pp. 3371-3408
-
-
Zhang, X.L.1
Wu, J.2
-
18
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
G.E. Hinton and R.R. Salakhutdinov, "Reducing the dimensionality of data with neural networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
20
-
-
56449089103
-
Extracting and composing robust features with denoising auto encoders
-
P. Vincent, H. Larochelle, Y. Bengio, and P. A. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proc. 25th Int. Conf. Mach. Learn., 2008, pp. 1096-1103.
-
(2008)
Proc. 25th Int. Conf. Mach. Learn.
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.A.4
-
21
-
-
79551480483
-
Stacked denoising auto encoders: Learning useful representations in a deep network with a local denoising criterion
-
P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P. A. Manzagol, "Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion," J. Mach. Learn. Res., vol. 11, pp. 3371-3408, 2010.
-
(2010)
J. Mach. Learn. Res.
, vol.11
, pp. 3371-3408
-
-
Vincent, P.1
Larochelle, H.2
Lajoie, I.3
Bengio, Y.4
Manzagol, P.A.5
-
22
-
-
0021645331
-
Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator
-
Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator," IEEE Trans. Acoustic, Speech, Signal Process., vol. 32, no. 6, pp. 1109-1121, 1984.
-
(1984)
IEEE Trans. Acoustic, Speech, Signal Process
, vol.32
, Issue.6
, pp. 1109-1121
-
-
Ephraim, Y.1
Malah, D.2
-
23
-
-
0032762471
-
A statistical model based voice activity detection
-
J. Sohn, N. S. Kim, and W. Sung, "A statistical modelbased voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
-
(1999)
IEEE Signal Process. Lett.
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
24
-
-
0041360463
-
Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
-
Israel Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Trans. Speech, Audio Process., vol. 11, no. 5, pp. 466-475, 2003.
-
(2003)
IEEE Trans. Speech, Audio Process
, vol.11
, Issue.5
, pp. 466-475
-
-
Cohen, I.1
-
25
-
-
23344452899
-
Statistical voice activity detection using a multiple observation likelihood ratio test
-
J. Ramírez, J. C. Segura, C. Benítez, L. García, and A. Rubio, "Statistical voice activity detection using a multiple observation likelihood ratio test," IEEE Signal Process. Lett., vol. 12, no. 10, pp. 689-692, 2005.
-
(2005)
IEEE Signal Process. Lett.
, vol.12
, Issue.10
, pp. 689-692
-
-
Ramírez, J.1
Segura, J.C.2
Benítez, C.3
García, L.4
Rubio, A.5
-
26
-
-
77956547440
-
Simple and efficient multiple kernel learning by group lasso
-
Z. Xu, R. Jin, H. Yang, I. King, and M. R. Lyu, "Simple and efficient multiple kernel learning by group lasso," in Proc. 27th Int. Conf. Mach. Learn., 2010, pp. 1175-1182.
-
(2010)
Proc. 27th Int. Conf. Mach. Learn.
, pp. 1175-1182
-
-
Xu, Z.1
Jin, R.2
Yang, H.3
King, I.4
Lyu, M.R.5
|