메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2519-2523

Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions

Author keywords

Convolutional neural networks; Neural network adaptation; Speech activity detection

Indexed keywords

NEURAL NETWORKS; SPEECH RECOGNITION;

EID: 84905248050     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854054     Document Type: Conference Paper
Times cited : (103)

References (26)
  • 1
    • 0033903480 scopus 로고    scopus 로고
    • Robust voice activity detection algorithm for estimating noise spectrum
    • K. Woo, T. Yang, K. Park, and C. Lee, "Robust Voice Activity Detection Algorithm for Estimating Noise Spectrum," IEEE Electronics Letters, 2000.
    • (2000) IEEE Electronics Letters
    • Woo, K.1    Yang, T.2    Park, K.3    Lee, C.4
  • 2
    • 85026719883 scopus 로고    scopus 로고
    • Robust energy normalization using speech/non-speech discriminator for german connected digit recognition
    • R. Chengalvarayan, "Robust Energy Normalization using Speech/Non-speech Discriminator for German Connected Digit Recognition," in ISCA Eurospeech, 1999.
    • (1999) ISCA Eurospeech
    • Chengalvarayan, R.1
  • 3
    • 79851495972 scopus 로고    scopus 로고
    • A silence compression scheme for g.729 optimized for terminals conforming to recommendation v.70
    • Itu-T ITU-T, "A Silence Compression Scheme for G.729 Optimized for Terminals Conforming to Recommendation V.70," in Recommendation G.729-Annex B, 1996.
    • (1996) Recommendation G.729-Annex B
  • 4
    • 84905230151 scopus 로고    scopus 로고
    • Robust voice activity detection using higher-order statistics in the lpc residual domain
    • E. Nemer, R. Goubran, and S. Mahmoud, "Robust Voice Activity Detection using Higher-order Statistics in the LPC Residual Domain," IEEE Electronics Letters, 2000.
    • (2000) IEEE Electronics Letters
    • Nemer, E.1    Goubran, R.2    Mahmoud, S.3
  • 5
    • 84905248283 scopus 로고    scopus 로고
    • The segmentation of multichannel meeting recording for automatic speech recognition
    • J. Dines, J. Vepa, and T. Hain, "The Segmentation of Multichannel Meeting Recording for Automatic Speech Recognition," ISCA ICSLP, 2006.
    • (2006) ISCA ICSLP
    • Dines, J.1    Vepa, J.2    Hain, T.3
  • 6
    • 17344389852 scopus 로고    scopus 로고
    • Robust speech recognition in noisy environments: The 2001 ibm spine evaluation system
    • B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust Speech Recognition in Noisy Environments: The 2001 IBM SPINE Evaluation System," ISCA ICASSP, 2002.
    • (2002) ISCA ICASSP
    • Kingsbury, B.1    Saon, G.2    Mangu, L.3    Padmanabhan, M.4    Sarikaya, R.5
  • 9
    • 33646064275 scopus 로고    scopus 로고
    • Multi-resolution rasta filtering for tandem-based asr
    • H. Hermansky and P. Fousek, "Multi-resolution RASTA Filtering for TANDEM-based ASR," in ISCA Interspeech, 2005.
    • (2005) ISCA Interspeech
    • Hermansky, H.1    Fousek, P.2
  • 10
    • 84905248277 scopus 로고    scopus 로고
    • Multi-layer perceptron based speech activity detection for speaker verification
    • S. Ganapathy, P. Rajan, and H. Hermansky, "Multi-layer Perceptron based Speech Activity Detection for Speaker Verification," IEEE WASPAA, 2011.
    • (2011) IEEE WASPAA
    • Ganapathy, S.1    Rajan, P.2    Hermansky, H.3
  • 13
    • 84879123473 scopus 로고    scopus 로고
    • The rats radio traffic collection system
    • K.Walker and S. Strassel, "The RATS Radio Traffic Collection System," in ISCA Odyssey, 2012.
    • (2012) ISCA Odyssey
    • Walker, K.1    Strassel, S.2
  • 14
    • 84878535284 scopus 로고    scopus 로고
    • Developing a speech activity detection system for the darpa rats program
    • T. Ng et al., "Developing a Speech Activity Detection system for the DARPA RATS Program," in ISCA Interspeech, 2012.
    • (2012) ISCA Interspeech
    • Ng, T.1
  • 15
    • 84878590831 scopus 로고    scopus 로고
    • Acoustic and data-driven features for robust speech activity detection
    • S. Thomas et al., "Acoustic and Data-driven Features for Robust Speech Activity Detection," in ISCA Interspeech, 2012.
    • (2012) ISCA Interspeech
    • Thomas, S.1
  • 16
    • 84906222432 scopus 로고    scopus 로고
    • The ibm speech activity detection system for the darpa rats program
    • G. Saon et al., "The IBM Speech Activity Detection System for the DARPA RATS Program," in ISCA Interspeech, 2013.
    • (2013) ISCA Interspeech
    • Saon, G.1
  • 17
    • 84906277631 scopus 로고    scopus 로고
    • Multi-band long-term signal variability features for robust voice activity detection
    • A. Tsiartas et al., "Multi-band Long-term Signal Variability Features for Robust Voice Activity Detection," in ISCA Interspeech, 2013.
    • (2013) ISCA Interspeech
    • Tsiartas, A.1
  • 18
    • 84906248945 scopus 로고    scopus 로고
    • All for one: Feature combination for highly channel-degraded speech activity detection
    • M. Graciarena et al., "All for One: Feature Combination for Highly Channel-degraded Speech Activity Detection," in ISCA Interspeech, 2013.
    • (2013) ISCA Interspeech
    • Graciarena, M.1
  • 19
    • 77954761139 scopus 로고    scopus 로고
    • Learning methods for generic object recognition with invariance to pose and lighting
    • Y. Lecun, F. Huang, and L. Bottou, "Learning Methods for Generic Object Recognition with Invariance to Pose and Lighting," in IEEE CVPR, 2004.
    • (2004) IEEE CVPR
    • Lecun, Y.1    Huang, F.2    Bottou, L.3
  • 20
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural network concepts to hybrid nnhmmmodel for speech recognition
    • O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying Convolutional Neural Network concepts to Hybrid NNHMMmodel for Speech Recognition," in IEEE ICASSP, 2012.
    • (2012) IEEE ICASSP
    • Abdel-Hamid, O.1    Mohamed, A.2    Jiang, H.3    Penn, G.4
  • 25
    • 84890478625 scopus 로고    scopus 로고
    • Adaptation of context-dependent deep neural networks for automatic speech recognition
    • K. Yao, D. Yu, F. Seide, H. Su, L.i Deng, and Y. Gong, "Adaptation of Context-dependent Deep Neural Networks for Automatic Speech Recognition," in IEEE SLT, 2012.
    • (2012) IEEE SLT
    • Yao, K.1    Yu, D.2    Seide, F.3    Su, H.4    Deng, L.I.5    Gong, Y.6
  • 26
    • 84906225505 scopus 로고    scopus 로고
    • Rapid and effective speaker adaptation of convolutional neural network basedmodels for speech recognition
    • O. Abdel-Hamid and H. Jiang, "Rapid and Effective Speaker Adaptation of Convolutional Neural Network basedModels for Speech Recognition," in ISCA Interspeech, 2013.
    • (2013) ISCA Interspeech
    • Abdel-Hamid, O.1    Jiang, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.