메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4500-4504

Improvements to the IBM speech activity detection system for the DARPA RATS program

Author keywords

acoustic features; deep neural networks; robust speech recognition; Speech activity detection

Indexed keywords

AUDIO SIGNAL PROCESSING; DEEP NEURAL NETWORKS; PETROLEUM RESERVOIR EVALUATION; RATS; SPEECH; SPEECH COMMUNICATION;

EID: 84946073523     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178822     Document Type: Conference Paper
Times cited : (58)

References (23)
  • 1
    • 84879123473 scopus 로고    scopus 로고
    • The RATS radio traffic collection system
    • K.Walker and S. Strassel, The RATS Radio Traffic Collection System, in ISCA Odyssey, 2012
    • (2012) ISCA Odyssey
    • Walker, K.1    Strassel, S.2
  • 2
    • 84878535284 scopus 로고    scopus 로고
    • Developing a speech activity detection system for the DARPA RATS program
    • T. Ng et al., Developing a Speech Activity Detection system for the DARPA RATS Program, in ISCA Interspeech, 2012
    • (2012) ISCA Interspeech
    • Ng, T.1
  • 3
    • 84878590831 scopus 로고    scopus 로고
    • Acoustic and data-driven features for robust speech activity detection
    • S. Thomas et al., Acoustic and Data-driven Features for Robust Speech Activity Detection, in ISCA Interspeech, 2012
    • (2012) ISCA Interspeech
    • Thomas, S.1
  • 4
    • 84906222432 scopus 로고    scopus 로고
    • The IBM speech activity detection system for the DARPA RATS program
    • G. Saon et al., The IBM Speech Activity Detection System for the DARPA RATS Program, in ISCA Interspeech, 2013
    • (2013) ISCA Interspeech
    • Saon, G.1
  • 5
    • 84906277631 scopus 로고    scopus 로고
    • Multi-band long-term signal variability features for robust voice activity detection
    • A. Tsiartas et al., Multi-band Long-term Signal Variability Features for Robust Voice Activity Detection, in ISCA Interspeech, 2013
    • (2013) ISCA Interspeech
    • Tsiartas, A.1
  • 6
    • 84906248945 scopus 로고    scopus 로고
    • All for one: Feature combination for highly channel-degraded speech activity detection
    • M. Graciarena et al., All for One: Feature Combination for Highly Channel-degraded Speech Activity Detection, in ISCA Interspeech, 2013
    • (2013) ISCA Interspeech
    • Graciarena, M.1
  • 7
    • 84873315510 scopus 로고    scopus 로고
    • Unsupervised speech activity detection using voicing measures and perceptual spectral flux
    • S.O. Sadjadi and J.H. Hansen, Unsupervised Speech Activity Detection using Voicing Measures and Perceptual Spectral Flux, IEEE Signal Processing Letters, 2013
    • (2013) IEEE Signal Processing Letters
    • Sadjadi, S.O.1    Hansen, J.H.2
  • 8
    • 84910088867 scopus 로고    scopus 로고
    • Improving the speech activity detection for the DARPA RATS phase-3 evaluation
    • J. Ma, Improving the Speech Activity Detection for the DARPA RATS Phase-3 Evaluation, in ISCA Interspeech, 2014
    • (2014) ISCA Interspeech
    • Ma, J.1
  • 12
    • 84890474252 scopus 로고    scopus 로고
    • Phoneme recognition using spectral envelope and modulation frequency features
    • S. Thomas, S. Ganapathy, and H. Hermansky, Phoneme Recognition using Spectral Envelope and Modulation Frequency Features, in IEEE ICASSP, 2009
    • (2009) IEEE ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 13
    • 0033004349 scopus 로고    scopus 로고
    • Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications
    • A. Kumerasan and A. Rao, Model-based Approach to Envelope and Positive Instantaneous Frequency Estimation of Signals with Speech Applications, in The Journal of the Acoustical Society of America, 1999
    • (1999) The Journal of the Acoustical Society of America
    • Kumerasan, A.1    Rao, A.2
  • 16
    • 84946096950 scopus 로고    scopus 로고
    • Joint training of convolutional and non-convolutional nueral networks
    • H. Soltau, G. Saon, and T.N. Sainath, Joint training of convolutional and non-convolutional nueral networks, in IEEE ICASSP, 2014
    • (2014) IEEE ICASSP
    • Soltau, H.1    Saon, G.2    Sainath, T.N.3
  • 17
    • 0003913694 scopus 로고
    • An efficient implementation of the patterson-holdsworth auditory filterbank
    • Tech. Rep
    • M. Slaney et al., An Efficient Implementation of the Patterson-Holdsworth Auditory Filterbank, Apple Computer, Perception Group, Tech. Rep, 1993
    • (1993) Apple Computer, Perception Group
    • Slaney, M.1
  • 19
  • 20
    • 84946044944 scopus 로고    scopus 로고
    • Robust speaker identification using auditory features and computational auditory scene analysis
    • Y. Shao and D.L. Wang, Robust Speaker Identification using Auditory Features and Computational Auditory Scene Analysis, in IEEE ICASSP, 2008
    • (2008) IEEE ICASSP
    • Shao, Y.1    Wang, D.L.2
  • 21
    • 84946093754 scopus 로고    scopus 로고
    • Speaker verification using simplified and supervised i-vector modeling
    • M. Li, A. Tsiartas, M.V. Segbroeck, and S. Narayanan, Speaker Verification using Simplified and Supervised i-vector Modeling, in IEEE ICASSP, 2013
    • (2013) IEEE ICASSP
    • Li, M.1    Tsiartas, A.2    Segbroeck, M.V.3    Narayanan, S.4
  • 22
    • 84910070752 scopus 로고    scopus 로고
    • UBM fused total variability modeling for language identification
    • M.V. Segbroeck, R. Travadi, and S. Narayanan, UBM Fused Total Variability Modeling for Language Identification, in ISCA Interspeech, 2014
    • (2014) ISCA Interspeech
    • Segbroeck, M.V.1    Travadi, R.2    Narayanan, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.