메뉴 건너뛰기




Volumn , Issue , 2009, Pages 2549-2553

A simple but efficient real-time voice activity detection algorithm

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION AREA; AUDIO PROCESSING; FRONT-END PROCESSING; NOISE CONDITIONS; ONLINE PROCESSING; PARAMETER-TUNING; PROCESSING METHOD; REAL APPLICATIONS; REAL-TIME VOICE; SPECTRAL FLATNESS; SPEECH CORPORA; VOICE ACTIVITY DETECTION; VOICE ACTIVITY DETECTORS;

EID: 78049393940     PISSN: 22195491     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (88)

References (18)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation G.729 annex B: A silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Shlomot, H. Y. Su, D. Massaloux, C. Lamblin and J. P. Petit, "ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications," IEEE Communications Magazine 35, pp. 64-73, 1997.
    • (1997) IEEE Communications Magazine , vol.35 , pp. 64-73
    • Benyassine, A.1    Shlomot, E.2    Su, H.Y.3    Massaloux, D.4    Lamblin, C.5    Petit, J.P.6
  • 2
    • 0024620890 scopus 로고
    • A robust algorithm for accurate end pointing of speech
    • M. H. Savoji, "A robust algorithm for accurate end pointing of speech," Speech Communication, pp. 45-60, 1989.
    • (1989) Speech Communication , pp. 45-60
    • Savoji, M.H.1
  • 3
    • 17344389852 scopus 로고    scopus 로고
    • Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system
    • B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM SPINE evaluation system," Proc. ICASSP, 1, pp. 53-56, 2002.
    • (2002) Proc. ICASSP , vol.1 , pp. 53-56
    • Kingsbury, B.1    Saon, G.2    Mangu, L.3    Padmanabhan, M.4    Sarikaya, R.5
  • 5
    • 11144286121 scopus 로고    scopus 로고
    • The spectral autocorrelation peak valley ratio (SAPVR) - A usable speech measure employed as a co-channel detection system
    • R. E. Yantorno, K. L. Krishnamachari and J. M. Lovekin, "The spectral autocorrelation peak valley ratio (SAPVR) - A usable speech measure employed as a co-channel detection system," Proc. IEEE Int. Workshop Intell. Signal Process. 2001.
    • (2001) Proc. IEEE Int. Workshop Intell. Signal Process
    • Yantorno, R.E.1    Krishnamachari, K.L.2    Lovekin, J.M.3
  • 6
    • 0036476655 scopus 로고    scopus 로고
    • Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
    • M. Marzinzik and B. Kollmeier, "Speech pause detection for noise spectrum estimation by tracking power envelope dynamics," IEEE Trans. Speech Audio Process, 10, pp. 109-118, 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , pp. 109-118
    • Marzinzik, M.1    Kollmeier, B.2
  • 7
    • 0442317754 scopus 로고    scopus 로고
    • ETSI ES 202 050 V 1.1.3
    • ETSI standard document, ETSI ES 202 050 V 1.1.3., 2003.
    • (2003) ETSI Standard Document
  • 8
    • 27644475276 scopus 로고    scopus 로고
    • An improved voice activity detection using higher order statistics
    • K. Li, N. S. Swamy and M. O. Ahmad, "An improved voice activity detection using higher order statistics," IEEE Trans. Speech Audio Process., 13, pp. 965-974, 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , pp. 965-974
    • Li, K.1    Swamy, N.S.2    Ahmad, M.O.3
  • 9
    • 0033693061 scopus 로고    scopus 로고
    • Speech/non-speech classification using multiple features for robust endpoint detection
    • W. H. Shin, "Speech/non-speech classification using multiple features for robust endpoint detection," ICASSP, 2000.
    • (2000) ICASSP
    • Shin, W.H.1
  • 10
    • 0034275540 scopus 로고    scopus 로고
    • Word boundary detection with mel scale frequency bank in noisy environment
    • G. D. Wuand and C. T. Lin, "Word boundary detection with mel scale frequency bank in noisy environment," IEEE Trans. Speechand Audio Processing, 2000.
    • (2000) IEEE Trans. Speechand Audio Processing
    • Wuand, G.D.1    Lin, C.T.2
  • 11
    • 85009070560 scopus 로고    scopus 로고
    • Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs
    • A. Lee, K. Nakamura, R. Nisimura, H. Saruwatari and K. Shikano, "Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs," Interspeech, pp. 173-176, 2004.
    • (2004) Interspeech , pp. 173-176
    • Lee, A.1    Nakamura, K.2    Nisimura, R.3    Saruwatari, H.4    Shikano, K.5
  • 12
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., pp. 1-3, 1999.
    • (1999) IEEE Signal Process. Lett. , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 13
    • 84887027253 scopus 로고    scopus 로고
    • Minimum mean squared error A posteriori estimation of high variance vehicular noise
    • Istanbul, Turkey, June
    • B. Lee and M. Hasegawa-Johnson, "Minimum Mean Squared Error A Posteriori Estimation of High Variance Vehicular Noise," in Proc. Biennial on DSP for In-Vehicle and Mobile Systems, Istanbul, Turkey, June 2007.
    • (2007) Proc. Biennial on DSP for In-vehicle and Mobile Systems
    • Lee, B.1    Hasegawa-Johnson, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.