-
1
-
-
84964515036
-
The automatic speech recognition in reverberant environments (ASpIRE) challenge
-
M Harper, "The Automatic Speech recognition In Reverberant Environments (ASpIRE) Challenge," in Automatic Speech Recognition and Understanding (ASRU), 2015, pp. 1-6
-
(2015)
Automatic Speech Recognition and Understanding (ASRU)
, pp. 1-6
-
-
Harper, M.1
-
2
-
-
0036299273
-
Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio
-
Xuejing Sun, "Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio," in Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on. IEEE, 2002, vol. 1, pp. I-333
-
(2002)
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference On. IEEE
, vol.1
, pp. I-333
-
-
Sun, X.1
-
3
-
-
84865734075
-
Joint robust voicing detection and pitch estimation based on residual harmonics
-
Thomas Drugman and Abeer Alwan, "Joint robust voicing detection and pitch estimation based on residual harmonics.," in Interspeech, 2011, pp. 1973-1976
-
(2011)
Interspeech
, pp. 1973-1976
-
-
Drugman, T.1
Alwan, A.2
-
4
-
-
0034857681
-
Speech dereverberation via maximum-kurtosis subband adaptive filtering
-
IEEE
-
B.W. Gillespie, H.S. Malvar, and D.A.F. Florencio, "Speech dereverberation via maximum-kurtosis subband adaptive filtering," in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2001, vol. 6, pp. 3701-3704, IEEE
-
(2001)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.6
, pp. 3701-3704
-
-
Gillespie, B.W.1
Malvar, H.S.2
Florencio, D.A.F.3
-
5
-
-
84890521103
-
Speaker adaptation of context dependent deep neural networks
-
Hank Liao, "Speaker adaptation of context dependent deep neural networks," in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7947-7951
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 7947-7951
-
-
Liao, H.1
-
6
-
-
84893650076
-
Semi-supervised training of deep neural networks
-
Karel Vesely, Mirko Hannemann, and Lukas Burget, "Semi-supervised training of deep neural networks," in Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. IEEE, 2013, pp. 267-272
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop On. IEEE
, pp. 267-272
-
-
Vesely, K.1
Hannemann, M.2
Burget, L.3
-
7
-
-
78049409757
-
Discriminative training based on an integrated view of mpe and MMI in margin and error space
-
Erik McDermott, Shinji Watanabe, and Atsushi Nakamura, "Discriminative training based on an integrated view of mpe and MMI in margin and error space," in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 4894-4897
-
(2010)
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference On. IEEE
, pp. 4894-4897
-
-
McDermott, E.1
Watanabe, S.2
Nakamura, A.3
-
8
-
-
84906274730
-
Sequence-discriminative training of deep neural networks
-
Karel Vesely, Arnab Ghoshal, Lukás Burget, and Daniel Povey, "Sequence-discriminative training of deep neural networks.," in INTERSPEECH, 2013, pp. 2345-2349
-
(2013)
INTERSPEECH
, pp. 2345-2349
-
-
Vesely, K.1
Ghoshal, A.2
Burget, L.3
Povey, D.4
-
10
-
-
84959118000
-
The fisher corpus: A resource for the next generations of speech-to-text
-
Christopher Cieri, David Miller, and Kevin Walker, "The fisher corpus a resource for the next generations of speech-to-text.," in LREC, 2004, vol. 4, pp. 69-71
-
(2004)
LREC
, vol.4
, pp. 69-71
-
-
Cieri, C.1
Miller, D.2
Walker, K.3
-
12
-
-
33745185408
-
Extended advanced front end algorithm description, Version 1.1
-
Tech. Rep. ES
-
A Sorin and T Ramabadran, "Extended advanced front end algorithm description, Version 1.1," ETSI STQ Aurora DSR Working Group, Tech. Rep. ES, vol. 202, pp. 212, 2003
-
(2003)
ETSI STQ Aurora DSR Working Group
, vol.202
, pp. 212
-
-
Sorin, A.1
Ramabadran, T.2
-
13
-
-
0016990291
-
The generalized correlation method for estimation of time delay
-
Charles H Knapp and G Clifford Carter, "The generalized correlation method for estimation of time delay," Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 24, no. 4, pp. 320-327, 1976
-
(1976)
Acoustics, Speech and Signal Processing, IEEE Transactions on
, vol.24
, Issue.4
, pp. 320-327
-
-
Knapp, C.H.1
Clifford Carter, G.2
-
14
-
-
4644306223
-
Time delay estimation in the presence of correlated noise and reverberation
-
Yong Rui and Dinei Florencio, "Time delay estimation in the presence of correlated noise and reverberation," in Acoustics, Speech, and Signal Processing, 2004. Proceedings.( ICASSP'04). IEEE International Conference on. IEEE, 2004, vol. 2, pp. ii-133
-
(2004)
Acoustics, Speech, and Signal Processing, 2004. Proceedings.( ICASSP'04). IEEE International Conference On. IEEE
, vol.2
, pp. ii-133
-
-
Rui, Y.1
Florencio, D.2
-
15
-
-
51449085960
-
Why does phat work well in lownoise, reverberative environments?
-
Cha Zhang, Dinei Florencio, and Zhengyou Zhang, "Why does phat work well in lownoise, reverberative environments?," in Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on. IEEE, 2008, pp. 2565-2568
-
(2008)
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference On. IEEE
, pp. 2565-2568
-
-
Zhang, C.1
Florencio, D.2
Zhang, Z.3
-
16
-
-
50449086237
-
Acoustic beamforming for speaker diarization of meetings
-
Xavier Anguera, Chuck Wooters, and Javier Hernando, "Acoustic beamforming for speaker diarization of meetings," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 7, pp. 2011-2022, 2007
-
(2007)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.15
, Issue.7
, pp. 2011-2022
-
-
Anguera, X.1
Wooters, C.2
Hernando, J.3
-
17
-
-
84906235016
-
The kaldi speech recognition toolkit
-
IEEE
-
Daniel Povey, Arnab Ghoshal, Gilles Boulianne, Lukáš Burget, Ondej Glembek, Nagendra Goel, Mirko Hannemann, Petr Motlíček, Yanmin Qian, Petr Schwarz, Jan Silovsḱy, Georg Stemmer, and Karel Veseĺy, "The kaldi speech recognition toolkit," in IEEE workshop on automatic speech recognition and understanding (ASRU). 2011, IEEE
-
(2011)
IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlíček, P.8
Qian, Y.9
Schwarz, P.10
Silovsḱy, J.11
Stemmer, G.12
Veseĺy, K.13
|