-
1
-
-
84890492030
-
An investigation of deep neural networks for noise robust speech recognition
-
Vancouver, Canada
-
M.L. Seltzer, D. Yu, and Y. Wang, "An investigation of deep neural networks for noise robust speech recognition, " in Proc. of ICASSP, Vancouver, Canada, 2013, pp. 7398-7402.
-
(2013)
Proc. of ICASSP
, pp. 7398-7402
-
-
Seltzer, M.L.1
Yu, D.2
Wang, Y.3
-
2
-
-
84906237188
-
Reverberant speech recognition based on denoising autoencoder
-
Lyon, France
-
T. Ishii, H. Komiyama, T. Shinozaki, Y. Horiuchi, and S. Kuroiwa, "Reverberant speech recognition based on denoising autoencoder, " in Proc. of INTERSPEECH, Lyon, France, 2013, pp. 3512-3516.
-
(2013)
Proc. of INTERSPEECH
, pp. 3512-3516
-
-
Ishii, T.1
Komiyama, H.2
Shinozaki, T.3
Horiuchi, Y.4
Kuroiwa, S.5
-
3
-
-
84900537286
-
The Munich feature enhancement approach to the 2013 CHiME Challenge using BLSTM recurrent neural networks
-
Vancouver, Canada
-
F. Weninger, J. Geiger, M. Wollmer, B. Schuller, and G. Rigoll, "The Munich feature enhancement approach to the 2013 CHiME Challenge using BLSTM recurrent neural networks, " in Proc. The 2nd CHiME Workshop, Vancouver, Canada, 2013, pp. 86-90.
-
(2013)
Proc. The 2nd CHiME Workshop
, pp. 86-90
-
-
Weninger, F.1
Geiger, J.2
Wollmer, M.3
Schuller, B.4
Rigoll, G.5
-
4
-
-
84883396653
-
Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory
-
M. Wollmer, F. Weninger, J. Geiger, B. Schuller, and G. Rigoll, "Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory, " Computer Speech and Language, Special Issue on Speech Separation and Recognition in Multisource Environments, vol. 27, no. 3, pp. 780-797, 2013.
-
(2013)
Computer Speech and Language, Special Issue on Speech Separation and Recognition in Multisource Environments
, vol.27
, Issue.3
, pp. 780-797
-
-
Wollmer, M.1
Weninger, F.2
Geiger, J.3
Schuller, B.4
Rigoll, G.5
-
5
-
-
0242609086
-
Blind estimation of reverberation time
-
R. Ratnam, D.L. Jones, B.C. Wheeler, W.D. O'Brien, Jr, C.R. Lansing, and A.S. Feng, "Blind estimation of reverberation time, " The Journal of the Acoustical Society of America, vol. 114, no. 5, pp. 2877-2892, 2003.
-
(2003)
The Journal of the Acoustical Society of America
, vol.114
, Issue.5
, pp. 2877-2892
-
-
Ratnam, R.1
Jones, D.L.2
Wheeler, B.C.3
O'brien Jr., W.D.4
Lansing, C.R.5
Feng, A.S.6
-
6
-
-
77955671150
-
Model-based dereverberation in the Logmelspec domain for robust distant-talking speech recognition
-
Dallas, USA
-
A. Sehr, R. Maas, and W. Kellermann, "Model-based dereverberation in the Logmelspec domain for robust distant-talking speech recognition, " in Proc. of ICASSP, Dallas, USA, 2010, pp. 4298-4301.
-
(2010)
Proc. of ICASSP
, pp. 4298-4301
-
-
Sehr, A.1
Maas, R.2
Kellermann, W.3
-
7
-
-
80051616058
-
Frame-wise HMM adaptation using state-dependent reverberation estimates
-
Prague, Czech Republic
-
A. Sehr, R. Maas, and W. Kellermann, "Frame-wise HMM adaptation using state-dependent reverberation estimates, " in IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Prague, Czech Republic, 2011, pp. 5484-5487.
-
(2011)
IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 5484-5487
-
-
Sehr, A.1
Maas, R.2
Kellermann, W.3
-
8
-
-
79961153040
-
Model-based approaches to handling additive noise in reverberant environments
-
Edinburgh, UK
-
M.J.F. Gales and Y.Q. Wang, "Model-based approaches to handling additive noise in reverberant environments, " in Proc. IEEE Workshop on Hands-free Speech Communication and Microphone Arrays, Edinburgh, UK, 2011, pp. 121-126.
-
(2011)
Proc. IEEE Workshop on Hands-free Speech Communication and Microphone Arrays
, pp. 121-126
-
-
Gales, M.J.F.1
Wang, Y.Q.2
-
9
-
-
79957856980
-
A basis representation of constrained MLLR transforms for robust adaptation
-
D. Povey and K. Yao, "A basis representation of constrained MLLR transforms for robust adaptation, " Computer Speech and Language, vol. 26, pp. 35-51, 2012.
-
(2012)
Computer Speech and Language
, vol.26
, pp. 35-51
-
-
Povey, D.1
Yao, K.2
-
10
-
-
70350450398
-
Static and dynamic variance compensation for recognition of reverberant speech with dereverberation pre-processing
-
M. Delcroix, T. Nakatani, and S.Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation pre-processing, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 2, pp. 324-334, 2009.
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, Issue.2
, pp. 324-334
-
-
Delcroix, M.1
Nakatani, T.2
Watanabe, S.3
-
11
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
Helsinki, Finland
-
P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, "Extracting and composing robust features with denoising autoencoders, " in Proc. of ICML, Helsinki, Finland, 2008, pp. 1096-1103.
-
(2008)
Proc. of ICML
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.4
-
12
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural Computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
13
-
-
0034293152
-
Learning to forget: Continual prediction with LSTM
-
F. Gers, J. Schmidhuber, and F. Cummins, "Learning to forget: Continual prediction with LSTM, " Neural Computation, vol. 12, no. 10, pp. 2451-2471, 2000.
-
(2000)
Neural Computation
, vol.12
, Issue.10
, pp. 2451-2471
-
-
Gers, F.1
Schmidhuber, J.2
Cummins, F.3
-
14
-
-
84893622444
-
The REVERB Challenge: A common evaluation framework for dereverberation and recognition of reverberant speech
-
New Paltz, NY, USA, to appear
-
K. Kinoshita, M. Delcroix, T. Yoshioka, T. Nakatani, E. Habets, R. Haeb-Umbach, V. Leutnant, A. Sehr, W. Kellermann, R. Maas, S. Gannot, and B. Raj, "The REVERB Challenge: A common evaluation framework for dereverberation and recognition of reverberant speech, " in Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, NY, USA, 2013, to appear.
-
(2013)
Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
-
-
Kinoshita, K.1
Delcroix, M.2
Yoshioka, T.3
Nakatani, T.4
Habets, E.5
Haeb-Umbach, R.6
Leutnant, V.7
Sehr, A.8
Kellermann, W.9
Maas, R.10
Gannot, S.11
Raj, B.12
-
15
-
-
85132941272
-
Speech dereverberation using statistical reverberation models
-
P.A. Naylor and N.D. Gaubitch, Eds. Springer
-
E. Habets, "Speech dereverberation using statistical reverberation models, " in Speech Dereverberation, P.A. Naylor and N.D. Gaubitch, Eds., pp. 57-93. Springer, 2010.
-
(2010)
Speech Dereverberation
, pp. 57-93
-
-
Habets, E.1
-
16
-
-
84962920708
-
Evaluating long-term spectral subtraction for reverberant ASR
-
Madonna di Campiglio, ItalyIEEE
-
D. Gelbart and N. Morgan, "Evaluating long-term spectral subtraction for reverberant ASR, " in Proc. of ASRU, Madonna di Campiglio, Italy, 2001, pp. 103-106, IEEE.
-
(2001)
Proc. of ASRU
, pp. 103-106
-
-
Gelbart, D.1
Morgan, N.2
-
17
-
-
65249167097
-
Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction
-
K. Kinoshita, M. Delcroix, T. Nakatani, and M. Miyoshi, "Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 4, pp. 534-545, 2009.
-
(2009)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.17
, Issue.4
, pp. 534-545
-
-
Kinoshita, K.1
Delcroix, M.2
Nakatani, T.3
Miyoshi, M.4
-
18
-
-
84906279378
-
Speech enhancement with weighted denoising auto-encoder
-
Lyon, France
-
B.Y. Xia and C.C. Bao, "Speech enhancement with weighted denoising auto-encoder, " in Proc. of INTERSPEECH, Lyon, France, 2013, pp. 436-440.
-
(2013)
Proc. of INTERSPEECH
, pp. 436-440
-
-
Xia, B.Y.1
Bao, C.C.2
-
19
-
-
84900542109
-
Recurrent neural network feature enhancement: The 2nd CHiME challenge
-
IEEE. Vancouver, Canada, June
-
A.L. Maas, T.M. O'Neil, A.Y. Hannun, and A.Y. Ng, "Recurrent neural network feature enhancement: The 2nd CHiME challenge, " in Proc. The 2nd CHiME Workshop, Vancouver, Canada, June 2013, pp. 79-80, IEEE.
-
(2013)
Proc. The 2nd CHiME Workshop
, pp. 79-80
-
-
Maas, A.L.1
O'neil, T.M.2
Hannun, A.Y.3
Ng, A.Y.4
-
20
-
-
84877253028
-
Dereverberation method with reverberation time estimation using floored ratio of spectral subtraction
-
Y. Tachioka, T. Hanazawa, and T. Iwasaki, "Dereverberation method with reverberation time estimation using floored ratio of spectral subtraction, " Acoustical Science and Technology, vol. 34, no. 3, pp. 212-215, 2013.
-
(2013)
Acoustical Science and Technology
, vol.34
, Issue.3
, pp. 212-215
-
-
Tachioka, Y.1
Hanazawa, T.2
Iwasaki, T.3
-
21
-
-
0018455310
-
Suppression of acoustic noise in speech using spectral subtraction
-
S.F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 27, no. 2, pp. 113-120, 1979.
-
(1979)
IEEE Transactions on Acoustics, Speech and Signal Processing
, vol.27
, Issue.2
, pp. 113-120
-
-
Boll, S.F.1
-
22
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
Vancouver, Canada, May, IEEE
-
A. Graves, A. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Proc. of ICASSP, Vancouver, Canada, May 2013, pp. 6645-6649, IEEE.
-
(2013)
Proc. of ICASSP
, pp. 6645-6649
-
-
Graves, A.1
Mohamed, A.2
Hinton, G.3
-
23
-
-
84865791631
-
Speech-based non-prototypical affect recognition for childrobot interaction in reverberated environments
-
Florence, Italy
-
M.Wollmer, F.Weninger, S. Steidl, A. Batliner, and B. Schuller, "Speech-based non-prototypical affect recognition for childrobot interaction in reverberated environments, " in Proc. of INTERSPEECH, Florence, Italy, 2011, pp. 3113-3116.
-
(2011)
Proc. of INTERSPEECH
, pp. 3113-3116
-
-
Wollmer, M.1
Weninger, F.2
Steidl, S.3
Batliner, A.4
Schuller, B.5
-
24
-
-
0028996854
-
WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition
-
Detroit, MI, USA
-
T. Robinson, J. Fransen, D. Pye, J. Foote, and S. Renals, "WSJCAM0: A British English speech corpus for large vocabulary continuous speech recognition, " in Proc. of ICASSP, Detroit, MI, USA, 1995, pp. 81-84.
-
(1995)
Proc. of ICASSP
, pp. 81-84
-
-
Robinson, T.1
Fransen, J.2
Pye, D.3
Foote, J.4
Renals, S.5
-
25
-
-
84858953642
-
The Kaldi speech recognition toolkit
-
Big Island, HI, USA
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlícek, Y. Qian, P. Schwarz, et al., "The Kaldi speech recognition toolkit, " in Proc. of ASRU, Big Island, HI, USA, 2011.
-
(2011)
Proc. of ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlícek, P.8
Qian, Y.9
Schwarz, P.10
-
26
-
-
60749097551
-
-
Cambridge University Engineering Department, Cambridge, UK
-
S.J. Young, G. Evermann, M.J.F. Gales, D. Kershaw, G. Moore, J.J. Odell, D.G. Ollason, D. Povey, V. Valtchev, and P.C. Woodland, The HTK book version 3.4, Cambridge University Engineering Department, Cambridge, UK, 2006.
-
(2006)
The HTK Book Version 3.4
-
-
Young, S.J.1
Evermann, G.2
Gales, M.J.F.3
Kershaw, D.4
Moore, G.5
Odell, J.J.6
Ollason, D.G.7
Povey, D.8
Valtchev, V.9
Woodland, P.C.10
-
27
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
M. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Transactions on Speech and Audio Processing, vol. 7, pp. 272-281, 1999.
-
(1999)
IEEE Transactions on Speech and Audio Processing
, vol.7
, pp. 272-281
-
-
Gales, M.1
-
28
-
-
84890503970
-
Effectiveness of discriminative training and feature transformation for reverberated and noisy speech
-
Vancouver, Canada
-
Y. Tachioka, S. Watanabe, and J.R. Hershey, "Effectiveness of discriminative training and feature transformation for reverberated and noisy speech, " in Proc. of ICASSP, Vancouver, Canada, 2013, pp. 6935-6939
-
(2013)
Proc. of ICASSP
, pp. 6935-6939
-
-
Tachioka, Y.1
Watanabe, S.2
Hershey, J.R.3
|