-
1
-
-
84891583985
-
-
John Wiley &Sons, West Sussex, UK
-
T. Virtanen, B. Raj, and R. Singh, Eds., Techniques for Noise Robustness in Automatic Speech Recognition, John Wiley &Sons, West Sussex, UK, 2012
-
(2012)
Techniques for Noise Robustness in Automatic Speech Recognition
-
-
Virtanen, T.1
Raj, B.2
Singh, R.3
-
2
-
-
0028517164
-
RASTA processing of speech
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 4, pp. 578-589, 1994
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
4
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer speech and language, vol. 12, no. 2, pp. 75-98, 1998
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.J.F.1
-
5
-
-
0029725301
-
A vector taylor series approach for environment-independent speech recognition
-
P. J. Moreno, B. Raj, and R. M. Stern, "A vector taylor series approach for environment-independent speech recognition," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1996, pp. 733-736
-
(1996)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 733-736
-
-
Moreno, P.J.1
Raj, B.2
Stern, R.M.3
-
6
-
-
62249130045
-
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
-
J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Computer, Speech, and Language, vol. 23, pp. 389-405, 2009
-
(2009)
Computer, Speech, and Language
, vol.23
, pp. 389-405
-
-
Li, J.1
Deng, L.2
Yu, D.3
Gong, Y.4
Acero, A.5
-
7
-
-
85032752225
-
Missing-feature approaches in speech recognition
-
B. Raj and R. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 101-116, 2005
-
(2005)
IEEE Signal Processing Magazine
, vol.22
, Issue.5
, pp. 101-116
-
-
Raj, B.1
Stern, R.2
-
9
-
-
84867584623
-
Improvements to VTS feature enhancement
-
J. Droppo, L. Deng, and A. Acero, "Improvements to VTS feature enhancement," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012, pp. 4677-4680
-
(2012)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 4677-4680
-
-
Droppo, J.1
Deng, L.2
Acero, A.3
-
10
-
-
82255178542
-
-
Wiley/ IEEE Press, Hoboken, NJ
-
D. L. Wang and G. J. Brown, Eds., Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, Wiley/ IEEE Press, Hoboken, NJ, 2006
-
(2006)
Computational Auditory Scene Analysis: Principles, Algorithms, and Applications
-
-
Wang, D.L.1
Brown, G.J.2
-
11
-
-
84892233308
-
On ideal binary masks as the computational goal of auditory scene analysis
-
P. Divenyi, Ed.Kluwer Academic, Boston, MA
-
D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed., pp. 181-197. Kluwer Academic, Boston, MA, 2005
-
(2005)
Speech Separation by Humans and Machines
, pp. 181-197
-
-
Wang, D.L.1
-
12
-
-
84877594942
-
-
Tech. Rep. OSU-CISRC-7/11-TR21, Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA
-
W. Hartmann, A. Narayanan, E. Fosler-Lussier, and D. L. Wang, "Nothing doing: Re-evaluating missing feature ASR," Tech. Rep. OSU-CISRC-7/11-TR21, Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA, 2011, Available: ftp://ftp.cse.ohiostate. edu/pub/tech-report/2011
-
(2011)
Nothing Doing: Re-evaluating Missing Feature ASR
-
-
Hartmann, W.1
Narayanan, A.2
Fosler-Lussier, E.3
Wang, D.L.4
-
13
-
-
0142026377
-
Speech segregation based on sound localization
-
N. Roman, D. L. Wang, and G. J. Brown, "Speech segregation based on sound localization," Journal of Acoustical Society of America, vol. 114, no. 4, pp. 2236-2252, 2003
-
(2003)
Journal of Acoustical Society of America
, vol.114
, Issue.4
, pp. 2236-2252
-
-
Roman, N.1
Wang, D.L.2
Brown, G.J.3
-
14
-
-
4644317224
-
A Bayesian classifer for spectrographic mask estimation for missing feature speech recognition
-
M. L. Seltzer, B. Raj, and R. M. Stern, "A Bayesian classifer for spectrographic mask estimation for missing feature speech recognition," Speech Communication, vol. 43, no. 4, pp. 379-393, 2004
-
(2004)
Speech Communication
, vol.43
, Issue.4
, pp. 379-393
-
-
Seltzer, M.L.1
Raj, B.2
Stern, R.M.3
-
15
-
-
33750311718
-
Binary and ratio time-frequency masks for robust speech recognition
-
S. Srinivasan, N. Roman, and D. L. Wang, "Binary and ratio time-frequency masks for robust speech recognition," Speech Communication, vol. 48, pp. 1486-1501, 2006
-
(2006)
Speech Communication
, vol.48
, pp. 1486-1501
-
-
Srinivasan, S.1
Roman, N.2
Wang, D.L.3
-
16
-
-
85009063707
-
Soft decisions in missing data techniques for robust automatic speech recognition
-
J. Barker, L. Josifovski, M. P. Cooke, and P. D. Green, "Soft decisions in missing data techniques for robust automatic speech recognition," in Proceedings of the International Conference on Spoken Language Processing, Beijing, China, 2000, pp. 373-376
-
(2000)
Proceedings of the International Conference on Spoken Language Processing, Beijing, China
, pp. 373-376
-
-
Barker, J.1
Josifovski, L.2
Cooke, M.P.3
Green, P.D.4
-
17
-
-
84867596016
-
A novel approach to soft-mask estimation and log-spectral enhancement for robust speech recognition
-
J. van Hout and A. Alwan, "A novel approach to soft-mask estimation and log-spectral enhancement for robust speech recognition," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2012, pp. 4105-4108
-
(2012)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 4105-4108
-
-
Van Hout, J.1
Alwan, A.2
-
18
-
-
0038712550
-
SNR estimation based on amplitude modulation analysis with applications to noise suppression
-
J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Transactions on Audio, Speech, and Signal Processing, vol. 11, pp. 184-192, 2003
-
(2003)
IEEE Transactions on Audio, Speech, and Signal Processing
, vol.11
, pp. 184-192
-
-
Tchorz, J.1
Kollmeier, B.2
-
19
-
-
64649103540
-
Speech intelligibility in background noise with ideal binary time-frequency masking
-
D. L.Wang, U. Kjems, M. S. Pedersen, J. B. Boldt, and T. Lunner, "Speech intelligibility in background noise with ideal binary time-frequency masking," Journal of Acoustical Society of America, vol. 125, pp. 2336-2347, 2009
-
(2009)
Journal of Acoustical Society of America
, vol.125
, pp. 2336-2347
-
-
Wang, D.L.1
Kjems, U.2
Pedersen, M.S.3
Boldt, J.B.4
Lunner, T.5
-
20
-
-
84870477511
-
Exploring monaural features for classification-based speech segregation
-
Y. Wang, K. Han, and D. Wang, "Exploring monaural features for classification-based speech segregation," IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, pp. 270-279, 2013
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.21
, pp. 270-279
-
-
Wang, Y.1
Han, K.2
Wang, D.3
-
21
-
-
84875678689
-
Towards scaling up classificationbased speech separation
-
in press
-
Y. Wang and D. Wang, "Towards scaling up classificationbased speech separation," IEEE Transactions on Audio, Speech, and Language Processing, 2013, in press
-
(2013)
IEEE Transactions on Audio, Speech, and Language Processing
-
-
Wang, Y.1
Wang, D.2
-
22
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G.E. Hinton, S. Osindero, and Y.W. Teh, "A fast learning algorithm for deep belief nets," Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.W.3
-
24
-
-
0003548585
-
-
J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA TIMIT acoustic phonetic continuous speech corpus," 1993 [Online]. Available: http://www.ldc.upenn.edu/Catalog/ LDC93S1.html
-
(1993)
DARPA TIMIT Acoustic Phonetic Continuous Speech Corpus
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
-
25
-
-
0003822743
-
-
Cambridge University Publishing Department
-
S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book, Cambridge University Publishing Department, 2002, [Online]. Available: http://htk.eng.cam.ac.uk.
-
(2002)
The HTK Book
-
-
Young, S.1
Evermann, G.2
Hain, T.3
Kershaw, D.4
Moore, G.5
Odell, J.6
Ollason, D.7
Povey, D.8
Valtchev, V.9
Woodland, P.10
|