-
1
-
-
0033903480
-
Robust voice activity detection algorithm for estimating noise spectrum
-
K. Woo, T. Yang, K. Park, and C. Lee, "Robust Voice Activity Detection Algorithm for Estimating Noise Spectrum," IEEE Electronics Letters, 2000.
-
(2000)
IEEE Electronics Letters
-
-
Woo, K.1
Yang, T.2
Park, K.3
Lee, C.4
-
2
-
-
85026719883
-
Robust energy normalization using speech/non-speech discriminator for german connected digit recognition
-
R. Chengalvarayan, "Robust Energy Normalization using Speech/Non-speech Discriminator for German Connected Digit Recognition," in ISCA Eurospeech, 1999.
-
(1999)
ISCA Eurospeech
-
-
Chengalvarayan, R.1
-
3
-
-
79851495972
-
A silence compression scheme for g.729 optimized for terminals conforming to recommendation v.70
-
Itu-T ITU-T, "A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70," in Recommendation G.729-Annex B, 1996.
-
(1996)
Recommendation G.729-Annex B
-
-
-
4
-
-
84905230151
-
Robust voice activity detection using higher-order statistics in the lpc residual domain
-
E. Nemer, R. Goubran, and S. Mahmoud, "Robust Voice Activity Detection using Higher-order Statistics in the LPC Residual Domain," IEEE Electronics Letters, 2000.
-
(2000)
IEEE Electronics Letters
-
-
Nemer, E.1
Goubran, R.2
Mahmoud, S.3
-
5
-
-
84905248283
-
The segmentation of multichannel meeting recording for automatic speech recognition
-
J. Dines, J. Vepa, and T. Hain, "The Segmentation of Multichannel Meeting Recording for Automatic Speech Recognition," ISCA ICSLP, 2006.
-
(2006)
ISCA ICSLP
-
-
Dines, J.1
Vepa, J.2
Hain, T.3
-
6
-
-
17344389852
-
Robust speech recognition in noisy environments: The 2001 ibm spine evaluation system
-
B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust Speech Recognition in Noisy Environments: The 2001 IBM SPINE Evaluation System," ISCA ICASSP, 2002.
-
(2002)
ISCA ICASSP
-
-
Kingsbury, B.1
Saon, G.2
Mangu, L.3
Padmanabhan, M.4
Sarikaya, R.5
-
9
-
-
33646064275
-
Multi-resolution rasta filtering for tandem-based asr
-
H. Hermansky and P. Fousek, "Multi-resolution RASTA Filtering for TANDEM-based ASR," in ISCA Interspeech, 2005.
-
(2005)
ISCA Interspeech
-
-
Hermansky, H.1
Fousek, P.2
-
10
-
-
84905248277
-
Multi-layer perceptron based speech activity detection for speaker verification
-
S. Ganapathy, P. Rajan, and H. Hermansky, "Multi-layer Perceptron based Speech Activity Detection for Speaker Verification," IEEE WASPAA, 2011.
-
(2011)
IEEE WASPAA
-
-
Ganapathy, S.1
Rajan, P.2
Hermansky, H.3
-
11
-
-
34047272330
-
Discrimination of speech from non-speech based on multiscale spectrotemporal modulations
-
N. Mesgarani, M. Slaney, and S. Shamma, "Discrimination of Speech from Non-speech based on Multiscale Spectrotemporal Modulations," IEEE Transactions on Audio, Speech, and Language Processing, 2006.
-
(2006)
IEEE Transactions on Audio, Speech, and Language Processing
-
-
Mesgarani, N.1
Slaney, M.2
Shamma, S.3
-
12
-
-
0032203257
-
Gradient based learning applied to document recognition
-
Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient based Learning applied to Document Recognition," Proceedings of the IEEE, 1998.
-
(1998)
Proceedings of the IEEE
-
-
Lecun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
13
-
-
84879123473
-
The rats radio traffic collection system
-
K.Walker and S. Strassel, "The RATS Radio Traffic Collection System," in ISCA Odyssey, 2012.
-
(2012)
ISCA Odyssey
-
-
Walker, K.1
Strassel, S.2
-
14
-
-
84878535284
-
Developing a speech activity detection system for the darpa rats program
-
T. Ng et al., "Developing a Speech Activity Detection system for the DARPA RATS Program," in ISCA Interspeech, 2012.
-
(2012)
ISCA Interspeech
-
-
Ng, T.1
-
15
-
-
84878590831
-
Acoustic and data-driven features for robust speech activity detection
-
S. Thomas et al., "Acoustic and Data-driven Features for Robust Speech Activity Detection," in ISCA Interspeech, 2012.
-
(2012)
ISCA Interspeech
-
-
Thomas, S.1
-
16
-
-
84906222432
-
The ibm speech activity detection system for the darpa rats program
-
G. Saon et al., "The IBM Speech Activity Detection System for the DARPA RATS Program," in ISCA Interspeech, 2013.
-
(2013)
ISCA Interspeech
-
-
Saon, G.1
-
17
-
-
84906277631
-
Multi-band long-term signal variability features for robust voice activity detection
-
A. Tsiartas et al., "Multi-band Long-term Signal Variability Features for Robust Voice Activity Detection," in ISCA Interspeech, 2013.
-
(2013)
ISCA Interspeech
-
-
Tsiartas, A.1
-
18
-
-
84906248945
-
All for one: Feature combination for highly channel-degraded speech activity detection
-
M. Graciarena et al., "All for One: Feature Combination for Highly Channel-degraded Speech Activity Detection," in ISCA Interspeech, 2013.
-
(2013)
ISCA Interspeech
-
-
Graciarena, M.1
-
19
-
-
77954761139
-
Learning methods for generic object recognition with invariance to pose and lighting
-
Y. Lecun, F. Huang, and L. Bottou, "Learning Methods for Generic Object Recognition with Invariance to Pose and Lighting," in IEEE CVPR, 2004.
-
(2004)
IEEE CVPR
-
-
Lecun, Y.1
Huang, F.2
Bottou, L.3
-
20
-
-
84867605836
-
Applying convolutional neural network concepts to hybrid nnhmmmodel for speech recognition
-
O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying Convolutional Neural Network concepts to Hybrid NNHMMmodel for Speech Recognition," in IEEE ICASSP, 2012.
-
(2012)
IEEE ICASSP
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
21
-
-
84890525984
-
Deep convolutional neural networks for lvcsr
-
T. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep Convolutional Neural Networks for LVCSR," in IEEE ICASSP, 2013.
-
(2013)
IEEE ICASSP
-
-
Sainath, T.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
22
-
-
84906257050
-
Neural network acoustic models for the darpa rats program
-
H. Soltau, H.K. Kuo, L. Mangu, G. Saon, and T. Beran, "Neural Network Acoustic Models for the DARPA RATS Program," in ISCA Interspeech, 2013.
-
(2013)
ISCA Interspeech
-
-
Soltau, H.1
Kuo, H.K.2
Mangu, L.3
Saon, G.4
Beran, T.5
-
23
-
-
84937880519
-
Connectionist speaker normalization and adaptation
-
V. Abrash, H. Franco, A. Sankar, and M. Cohen, "Connectionist Speaker Normalization and Adaptation," in ISCA Eurospeech, 1995.
-
(1995)
ISCA Eurospeech
-
-
Abrash, V.1
Franco, H.2
Sankar, A.3
Cohen, M.4
-
24
-
-
34548012893
-
Linear hidden transformations for adaptation of hybrid ann/hmm models
-
R. Gemello, F. Mana, S. Scanzio, P. Laface, and R. De Mori, "Linear Hidden Transformations for Adaptation of Hybrid ANN/HMM Models," Speech Communication, 2007.
-
(2007)
Speech Communication
-
-
Gemello, R.1
Mana, F.2
Scanzio, S.3
Laface, P.4
De Mori, R.5
-
25
-
-
84890478625
-
Adaptation of context-dependent deep neural networks for automatic speech recognition
-
K. Yao, D. Yu, F. Seide, H. Su, L.i Deng, and Y. Gong, "Adaptation of Context-dependent Deep Neural Networks for Automatic Speech Recognition," in IEEE SLT, 2012.
-
(2012)
IEEE SLT
-
-
Yao, K.1
Yu, D.2
Seide, F.3
Su, H.4
Deng, L.I.5
Gong, Y.6
-
26
-
-
84906225505
-
Rapid and effective speaker adaptation of convolutional neural network basedmodels for speech recognition
-
O. Abdel-Hamid and H. Jiang, "Rapid and Effective Speaker Adaptation of Convolutional Neural Network basedModels for Speech Recognition," in ISCA Interspeech, 2013.
-
(2013)
ISCA Interspeech
-
-
Abdel-Hamid, O.1
Jiang, H.2
|