메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7229-7233

A practical, self-adaptive voice activity detector for speaker verification with noisy telephone and microphone data

Author keywords

speaker verification; Voice activity detection

Indexed keywords

LIKELIHOOD RATIOS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; NOISY CONDITIONS; OPEN SOURCE IMPLEMENTATION; ROBUST SPEAKER VERIFICATION; SPEAKER VERIFICATION; VOICE ACTIVITY DETECTION; VOICE ACTIVITY DETECTORS;

EID: 84890449972     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639066     Document Type: Conference Paper
Times cited : (114)

References (27)
  • 1
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation g729 annex b: A silence compression scheme for use with g729 optimized for v.70 digital simultaneous voice and data applications
    • A. Benyassine, E. Schlomot, and H.Y. Su, "ITU-T recommendation g729 annex b: A silence compression scheme for use with g729 optimized for v.70 digital simultaneous voice and data applications," IEEE Communications Magazine, vol. 35, pp. 64-73, 1997.
    • (1997) IEEE Communications Magazine , vol.35 , pp. 64-73
    • Benyassine, A.1    Schlomot, E.2    Su, H.Y.3
  • 2
    • 26844492797 scopus 로고    scopus 로고
    • Speech processing, transmission and quality aspects (STQ); Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms
    • Etsi
    • ETSI, "Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm; compression algorithms," ETSI ES 201 108 Recommendation, 2002.
    • (2002) ETSI ES 201 108 Recommendation
  • 3
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N.S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Processing Letters, vol. 6, pp. 1-3, 1999.
    • (1999) IEEE Signal Processing Letters , vol.6 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 4
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • January
    • T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 9
    • 84886695822 scopus 로고    scopus 로고
    • A simple and effective speech activity detection algorithm for telephone and microphone speech
    • Atlanta, US, December
    • M. McLaren and D. van Leeuwen, "A simple and effective speech activity detection algorithm for telephone and microphone speech," in Proc. NIST SRE 2011 workshop, Atlanta, US, December 2011.
    • (2011) Proc. NIST SRE 2011 Workshop
    • McLaren, M.1    Van Leeuwen, D.2
  • 10
    • 84865791238 scopus 로고    scopus 로고
    • Comparison of voice activity detectors for interview speech in NIST speaker recognition evaluation
    • Florence, Italy, August
    • H.B. Yu and M.W. Mak, "Comparison of voice activity detectors for interview speech in NIST speaker recognition evaluation," in Proc. Interspeech 2011, Florence, Italy, August 2011.
    • (2011) Proc. Interspeech 2011
    • Yu, H.B.1    Mak, M.W.2
  • 12
    • 79960665866 scopus 로고    scopus 로고
    • The delta-phase spectrum with application to voice activity detection and speaker recognition
    • September
    • I. McCowan, D. Dean, M. McLaren, R. Vogt, and S. Sridharan, "The delta-phase spectrum with application to voice activity detection and speaker recognition," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 7, pp. 2026-2038, September 2012.
    • (2012) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.7 , pp. 2026-2038
    • McCowan, I.1    Dean, D.2    McLaren, M.3    Vogt, R.4    Sridharan, S.5
  • 17
    • 47749147613 scopus 로고    scopus 로고
    • Filtering the unknown: Speech activity detection in heterogeneous video collections
    • Antwerp, Belgium
    • M. Huijbregts, C.Wooters, and R. Ordelman, "Filtering the unknown: Speech activity detection in heterogeneous video collections," in Proc. Interspeech 2007, Antwerp, Belgium, 2007, pp. 2925-2928.
    • (2007) Proc. Interspeech 2007 , pp. 2925-2928
    • Huijbregts, M.1    Wooters, C.2    Ordelman, R.3
  • 18
    • 70450161112 scopus 로고    scopus 로고
    • Speaker diarization for meeting room audio
    • Brighton, UK
    • H. Sun, T. L. Nwe, B. Ma, and H. Li, "Speaker diarization for meeting room audio," in Proc. Interspeech 2009, Brighton, UK, 2009, pp. 900-903.
    • (2009) Proc. Interspeech 2009 , pp. 900-903
    • Sun, H.1    Nwe, T.L.2    Ma, B.3    Li, H.4
  • 21
    • 84898068800 scopus 로고    scopus 로고
    • I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification
    • R. Saeidi with 30 coauthors, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification," in Submitted to Interspeech 2013, 2013.
    • (2013) Submitted to Interspeech 2013
    • Saeidi, R.1
  • 22
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. ICASSP 1979, 1979, vol. 4, pp. 208-211.
    • (1979) Proc. ICASSP 1979 , vol.4 , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 23
    • 84857498666 scopus 로고    scopus 로고
    • Unbiased MMSE-based noise power estimation with low complexity and low tracking delay
    • T. Gerkmann and R.C. Hendriks, "Unbiased MMSE-based noise power estimation with low complexity and low tracking delay," IEEE Trans. Audio, Speech and Language Processing, vol. 20, pp. 1383-1393, 2012.
    • (2012) IEEE Trans. Audio, Speech and Language Processing , vol.20 , pp. 1383-1393
    • Gerkmann, T.1    Hendriks, R.C.2
  • 24
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • July
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Trans. on Speech and Audio Processing, vol. 9, no. 5, pp. 504-512, July 2001.
    • (2001) IEEE Trans. on Speech and Audio Processing , vol.9 , Issue.5 , pp. 504-512
    • Martin, R.1
  • 27
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of ivector length normalization in speaker recognition systems
    • Florence, Italy, August
    • D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of ivector length normalization in speaker recognition systems," in Proc. Interspeech 2011, Florence, Italy, August 2011, pp. 249-252.
    • (2011) Proc. Interspeech 2011 , pp. 249-252
    • Garcia-Romero, D.1    Espy-Wilson, C.Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.