-
1
-
-
84893622444
-
The REVERB challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
-
K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, E. Habets, R. Haeb-Umbach, V. Leutnant, A. Sehr, W. Kellermann, R. Maas, S. Gannot and B. Raj, "The REVERB Challenge: A Common Evaluation Framework for Dereverberation and Recognition of Reverberant Speech, " Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013.
-
(2013)
Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
-
-
Kinoshita, K.1
Delcroix, M.2
Yoshioka, T.3
Nakatani, T.4
Habets, E.5
Haeb-Umbach, R.6
Leutnant, V.7
Sehr, A.8
Kellermann, W.9
Maas, R.10
Gannot, S.11
Raj, B.12
-
3
-
-
0028478507
-
Combined acoustic echo cancellation, dereverberation and noise reduction: A two microphone approach
-
R. Martin and P. Vary, "Combined Acoustic Echo Cancellation, Dereverberation and Noise Reduction: A Two Microphone Approach, " Journal of Annales des Télécommunications, Vol. 49, Iss. 7-8, pp. 429-438, 1994.
-
(1994)
Journal of Annales des Télécommunications
, vol.49
, Issue.7-8
, pp. 429-438
-
-
Martin, R.1
Vary, P.2
-
4
-
-
84964511330
-
Single channel blind dereverberation based on auto-correlation functions of frame-wise time sequences of frequency components
-
K. Ohta and M. Yanagida, "Single Channel Blind Dereverberation Based on Auto-Correlation Functions of Frame-Wise Time Sequences of Frequency Components, " Proc. of IWAENC, pp. 1-4, 2006.
-
(2006)
Proc. of IWAENC
, pp. 1-4
-
-
Ohta, K.1
Yanagida, M.2
-
5
-
-
33745761716
-
A two-stage algorithm for one-microphone reverberant speech enhancement
-
M. Wu and D. L. Wang, "A Two-Stage Algorithm for One-Microphone Reverberant Speech Enhancement, " IEEE Trans. Aud. Speech & Lang. Process, Vol. 14, No. 3, pp. 774-784, 2006.
-
(2006)
IEEE Trans. Aud. Speech & Lang. Process
, vol.14
, Issue.3
, pp. 774-784
-
-
Wu, M.1
Wang, D.L.2
-
6
-
-
4544336156
-
Robust automatic speech recognition in reverberant environments by model selection
-
L. Couvreur and C. Couvreur, "Robust Automatic Speech Recognition in Reverberant Environments by Model Selection, " Proc. of HSC, pp. 147-150, 2001.
-
(2001)
Proc. of HSC
, pp. 147-150
-
-
Couvreur, L.1
Couvreur, C.2
-
7
-
-
34547517494
-
A new concept for feature-domain dereverberation for robust distant-talking asr
-
A. Sehr and W. Kellermann, "A New Concept for Feature-Domain Dereverberation for Robust Distant-Talking ASR, " Proc. of ICASSP, pp. 369-372, 2007.
-
(2007)
Proc. of ICASSP
, pp. 369-372
-
-
Sehr, A.1
Kellermann, W.2
-
8
-
-
70350450398
-
Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
-
M. Delcroix and S. Watanabe, "Static and Dynamic Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing, " IEEE Trans. on Aud. Speech & Lang. Process, Vol. 17, No. 2, pp. 324-334, 2009.
-
(2009)
IEEE Trans. on Aud. Speech & Lang. Process
, vol.17
, Issue.2
, pp. 324-334
-
-
Delcroix, M.1
Watanabe, S.2
-
9
-
-
84928158251
-
Use of multiple front-ends and i-vector-based speaker adaptation for robust speech recognition
-
Md. J. Alam, V. Gupta, P. Kenny, P. Dumouchel, "Use Of Multiple Front-Ends And I-Vector-Based Speaker Adaptation For Robust Speech Recognition, " in Proc. of REVERB Challenge, 2014.
-
(2014)
Proc. of REVERB Challenge
-
-
Alam, M.J.1
Gupta, V.2
Kenny, P.3
Dumouchel, P.4
-
10
-
-
84933559263
-
Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge
-
M. Delcroix, T. Yoshioka, A. Ogawa, Y. Kubo, M. Fujimoto, I. Nobutaka, K. Kinoshita, M. Espi, T. Hori, T. Nakatani, "Linear prediction-based dereverberation with advanced speech enhancement and recognition technologies for the REVERB challenge, " in Proc. of REVERB Challenge, 2014.
-
(2014)
Proc. of REVERB Challenge
-
-
Delcroix, M.1
Yoshioka, T.2
Ogawa, A.3
Kubo, Y.4
Fujimoto, M.5
Nobutaka, I.6
Kinoshita, K.7
Espi, M.8
Hori, T.9
Nakatani, T.10
-
11
-
-
84928158249
-
Robust features and system fusion for reverberation-robust speech recognition
-
V. Mitra, W. Wang, Y. Lei, A. Kathol, G. Sivaraman, C. Espy-Wilson, "Robust features and system fusion for reverberation-robust speech recognition, " in Proc. of REVERB Challenge, 2014.
-
(2014)
Proc. of REVERB Challenge
-
-
Mitra, V.1
Wang, W.2
Lei, Y.3
Kathol, A.4
Sivaraman, G.5
Espy-Wilson, C.6
-
12
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. E. Dahl and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. on ASLP, Vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. on ASLP
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.E.2
Hinton, G.3
-
14
-
-
84910075252
-
Evaluating robust features on deep neural networks for speech recognition in noisy and channel mismatched conditions
-
V. Mitra, W. Wang, H. Franco, Y. Lei, C. Bartels, M. Graciarena, "Evaluating robust features on Deep Neural Networks for speech recognition in noisy and channel mismatched conditions, " in Proc. of Interspeech, 2014.
-
(2014)
Proc. of Interspeech
-
-
Mitra, V.1
Wang, W.2
Franco, H.3
Lei, Y.4
Bartels, C.5
Graciarena, M.6
-
15
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using i-vectors
-
G. Saon, H. Soltau, D. Nahamoo and M. Picheny, "Speaker Adaptation of Neural Network Acoustic Models using I-vectors, " in Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
16
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, P. Ouellet, "Front-end factor analysis for speaker verification, " IEEE Trans. on Speech and Audio Processing, 2011, 19, 788-798.
-
(2011)
IEEE Trans. on Speech and Audio Processing
, vol.19
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
17
-
-
0028996854
-
WSJCAM0: A british english speech corpus for large vocabulary continuous speech recognition
-
T. Robinson, J. Fransen, D. Pye, J. Foote and S. Renals, "WSJCAM0: A British English Speech Corpus for Large Vocabulary Continuous Speech Recognition, " Proc. ICASSP, pp. 81-84, 1995.
-
(1995)
Proc. ICASSP
, pp. 81-84
-
-
Robinson, T.1
Fransen, J.2
Pye, D.3
Foote, J.4
Renals, S.5
-
19
-
-
84906260861
-
Damped oscillator cepstral coefficients for robust speech recognition
-
V. Mitra, H. Franco and M. Graciarena, "Damped Oscillator Cepstral Coefficients for Robust Speech Recognition, " Proc. of Interspeech, pp. 886-890, 2013.
-
(2013)
Proc. of Interspeech
, pp. 886-890
-
-
Mitra, V.1
Franco, H.2
Graciarena, M.3
-
20
-
-
84867589420
-
Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
-
V. Mitra, H. Franco, M. Graciarena, and A. Mandal, "Normalized Amplitude Modulation Features for Large Vocabulary Noise-Robust Speech Recognition, " Proc. of ICASSP, pp. 4117-4120, 2012.
-
(2012)
Proc. of ICASSP
, pp. 4117-4120
-
-
Mitra, V.1
Franco, H.2
Graciarena, M.3
Mandal, A.4
-
21
-
-
0027676955
-
Energy separation in signal modulations with application to speech analysis
-
P. Maragos, J. Kaiser and T. Quatieri, "Energy Separation in Signal Modulations with Application to Speech Analysis, " IEEE Trans. Signal Processing, Vol. 41, pp. 3024-3051, 1993.
-
(1993)
IEEE Trans. Signal Processing
, vol.41
, pp. 3024-3051
-
-
Maragos, P.1
Kaiser, J.2
Quatieri, T.3
-
22
-
-
84905269267
-
Medium duration modulation cepstral feature for robust speech recognition
-
Florence
-
V. Mitra, H. Franco, M. Graciarena, D. Vergyri, "Medium duration modulation cepstral feature for robust speech recognition, " Proc. of ICASSP, Florence, 2014.
-
(2014)
Proc. of ICASSP
-
-
Mitra, V.1
Franco, H.2
Graciarena, M.3
Vergyri, D.4
-
23
-
-
84906246749
-
Modulation features for noise robust speaker identification
-
V. Mitra, M. McLaren, H. Franco, M. Graciarena and N. Scheffer, "Modulation Features for Noise Robust Speaker Identification, " Proc. of Interspeech, pp. 3703-3707, 2013.
-
(2013)
Proc. of Interspeech
, pp. 3703-3707
-
-
Mitra, V.1
McLaren, M.2
Franco, H.3
Graciarena, M.4
Scheffer, N.5
-
24
-
-
0019075685
-
Some observations on oral air flow during phonation
-
H. Teager, "Some Observations on Oral Air Flow During Phonation, " in IEEE Trans. ASSP, pp. 599-601, 1980.
-
(1980)
IEEE Trans. ASSP
, pp. 599-601
-
-
Teager, H.1
-
25
-
-
84928164944
-
The design for the wall street journal-based CSR corpus
-
D. B. Paul and J. M. Baker, "The Design for the Wall Street Journal-based CSR Corpus, " Proc. of HLT, pp 3
-
Proc. of HLT
, pp. 3
-
-
Paul, D.B.1
Baker, J.M.2
-
26
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recognizer output voting error reduction.(ROVER)
-
J. G. Fiscus, "A Post-Processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction. (ROVER), " Proc. of ASRU, pp. 347-354, 1997.
-
(1997)
Proc. of ASRU
, pp. 347-354
-
-
Fiscus, J.G.1
-
27
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NNHMM model for speech recognition, " Proc. of ICASSP, pp. 4277-4280, 2012.
-
(2012)
Proc. of ICASSP
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
|