SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4500-4504

Improvements to the IBM speech activity detection system for the DARPA RATS program

(4) Thomas, Samuel a Saon, George a Van Segbroeck, Maarten b Narayanan, Shrikanth S b

a IBM T J WATSON RESEARCH CENTER (United States)

b UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

acoustic features; deep neural networks; robust speech recognition; Speech activity detection

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; PETROLEUM RESERVOIR EVALUATION; RATS; SPEECH; SPEECH COMMUNICATION;

ACOUSTIC FEATURES; EQUAL ERROR RATE; PHASE 2; PROGRAM EVALUATION; ROBUST SPEECH RECOGNITION; SPEECH ACTIVITY DETECTIONS; THIRD PHASE; TIME-FREQUENCY REPRESENTATIONS;

SPEECH RECOGNITION;

EID: 84946073523 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178822 Document Type: Conference Paper

Times cited : (58)

References (23)

1
- 84879123473
- The RATS radio traffic collection system
- K.Walker and S. Strassel, The RATS Radio Traffic Collection System, in ISCA Odyssey, 2012
- (2012) ISCA Odyssey
- Walker, K.¹ Strassel, S.²

2
- 84878535284
- Developing a speech activity detection system for the DARPA RATS program
- T. Ng et al., Developing a Speech Activity Detection system for the DARPA RATS Program, in ISCA Interspeech, 2012
- (2012) ISCA Interspeech
- Ng, T.¹

3
- 84878590831
- Acoustic and data-driven features for robust speech activity detection
- S. Thomas et al., Acoustic and Data-driven Features for Robust Speech Activity Detection, in ISCA Interspeech, 2012
- (2012) ISCA Interspeech
- Thomas, S.¹

4
- 84906222432
- The IBM speech activity detection system for the DARPA RATS program
- G. Saon et al., The IBM Speech Activity Detection System for the DARPA RATS Program, in ISCA Interspeech, 2013
- (2013) ISCA Interspeech
- Saon, G.¹

5
- 84906277631
- Multi-band long-term signal variability features for robust voice activity detection
- A. Tsiartas et al., Multi-band Long-term Signal Variability Features for Robust Voice Activity Detection, in ISCA Interspeech, 2013
- (2013) ISCA Interspeech
- Tsiartas, A.¹

6
- 84906248945
- All for one: Feature combination for highly channel-degraded speech activity detection
- M. Graciarena et al., All for One: Feature Combination for Highly Channel-degraded Speech Activity Detection, in ISCA Interspeech, 2013
- (2013) ISCA Interspeech
- Graciarena, M.¹

7
- 84873315510
- Unsupervised speech activity detection using voicing measures and perceptual spectral flux
- S.O. Sadjadi and J.H. Hansen, Unsupervised Speech Activity Detection using Voicing Measures and Perceptual Spectral Flux, IEEE Signal Processing Letters, 2013
- (2013) IEEE Signal Processing Letters
- Sadjadi, S.O.¹ Hansen, J.H.²

8
- 84910088867
- Improving the speech activity detection for the DARPA RATS phase-3 evaluation
- J. Ma, Improving the Speech Activity Detection for the DARPA RATS Phase-3 Evaluation, in ISCA Interspeech, 2014
- (2014) ISCA Interspeech
- Ma, J.¹

9
- 84946097042
- The DARPA RATS phase 3 evaluation
- H. Goldberg and D. Longfellow, The DARPA RATS Phase 3 Evaluation, in DARPA RATS PI Meeting, 2014
- (2014) DARPA RATS PI Meeting
- Goldberg, H.¹ Longfellow, D.²

10
- 42549139762
- MVA processing of speech features
- C.-P. Chen and J. Bilmes, MVA Processing of Speech Features, IEEE Transactions on Audio, Speech, and Language Processing, 2007
- (2007) IEEE Transactions on Audio, Speech, and Language Processing
- Chen, C.-P.¹ Bilmes, J.²

11
- 0036214787
- YIN, a fundamental frequency estimator for speech and music
- A. de Cheveigne and H. Kawahara, YIN, a Fundamental Frequency Estimator for Speech and Music, The Journal of the Acoustical Society of America, 2002
- (2002) The Journal of the Acoustical Society of America
- De Cheveigne, A.¹ Kawahara, H.²

12
- 84890474252
- Phoneme recognition using spectral envelope and modulation frequency features
- S. Thomas, S. Ganapathy, and H. Hermansky, Phoneme Recognition using Spectral Envelope and Modulation Frequency Features, in IEEE ICASSP, 2009
- (2009) IEEE ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

13
- 0033004349
- Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications
- A. Kumerasan and A. Rao, Model-based Approach to Envelope and Positive Instantaneous Frequency Estimation of Signals with Speech Applications, in The Journal of the Acoustical Society of America, 1999
- (1999) The Journal of the Acoustical Society of America
- Kumerasan, A.¹ Rao, A.²

14
- 23744508888
- Multiresolution spectrotemporal analysis of complex sounds
- T. Chi, P. Ru, and S. Shamma, Multiresolution Spectrotemporal Analysis of Complex Sounds, in The Journal of the Acoustical Society of America, 2005
- (2005) The Journal of the Acoustical Society of America
- Chi, T.¹ Ru, P.² Shamma, S.³

15
- 0032203257
- Gradient based learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, Gradient based Learning applied to Document Recognition, Proceedings of the IEEE, 1998
- (1998) Proceedings of the IEEE
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

16
- 84946096950
- Joint training of convolutional and non-convolutional nueral networks
- H. Soltau, G. Saon, and T.N. Sainath, Joint training of convolutional and non-convolutional nueral networks, in IEEE ICASSP, 2014
- (2014) IEEE ICASSP
- Soltau, H.¹ Saon, G.² Sainath, T.N.³

17
- 0003913694
- An efficient implementation of the patterson-holdsworth auditory filterbank
- Tech. Rep
- M. Slaney et al., An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank, Apple Computer, Perception Group, Tech. Rep, 1993
- (1993) Apple Computer, Perception Group
- Slaney, M.¹

18
- 0035209688
- A human nonlinear cochlear filterbank
- E.A. Lopez-Poveda and R. Meddis, A Human Nonlinear Cochlear Filterbank, The Journal of the Acoustical Society of America, 2001
- (2001) The Journal of the Acoustical Society of America
- Lopez-Poveda, E.A.¹ Meddis, R.²

19
- 84946045145
- An auditorybased feature for robust speech recognition
- Y. Shao, Z. Jin, D.L. Wang, and S. Srinivasan, An Auditorybased Feature for Robust Speech Recognition, in IEEE ICASSP, 2009
- (2009) IEEE ICASSP
- Shao, Y.¹ Jin, Z.² Wang, D.L.³ Srinivasan, S.⁴

20
- 84946044944
- Robust speaker identification using auditory features and computational auditory scene analysis
- Y. Shao and D.L. Wang, Robust Speaker Identification using Auditory Features and Computational Auditory Scene Analysis, in IEEE ICASSP, 2008
- (2008) IEEE ICASSP
- Shao, Y.¹ Wang, D.L.²

21
- 84946093754
- Speaker verification using simplified and supervised i-vector modeling
- M. Li, A. Tsiartas, M.V. Segbroeck, and S. Narayanan, Speaker Verification using Simplified and Supervised i-vector Modeling, in IEEE ICASSP, 2013
- (2013) IEEE ICASSP
- Li, M.¹ Tsiartas, A.² Segbroeck, M.V.³ Narayanan, S.⁴

22
- 84910070752
- UBM fused total variability modeling for language identification
- M.V. Segbroeck, R. Travadi, and S. Narayanan, UBM Fused Total Variability Modeling for Language Identification, in ISCA Interspeech, 2014
- (2014) ISCA Interspeech
- Segbroeck, M.V.¹ Travadi, R.² Narayanan, S.³

23
- 84906257050
- Neural network acoustic models for the DARPA RATS program
- H. Soltau, H.K. Kuo, L. Mangu, G. Saon, and T. Beran, Neural Network Acoustic Models for the DARPA RATS Program, in ISCA Interspeech, 2013
- (2013) ISCA Interspeech
- Soltau, H.¹ Kuo, H.K.² Mangu, L.³ Saon, G.⁴ Beran, T.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.