메뉴 건너뛰기




Volumn , Issue , 2011, Pages 321-324

Multi-layer perceptron based speech activity detection for speaker verification

Author keywords

Frequency Domain Linear Prediction (FDLP); Speaker Verification; Speech Activity Detection

Indexed keywords

AUTOREGRESSIVE MODELLING; CEPSTRAL MEAN SUBTRACTION; CRITICAL BANDS; ENVELOPE ESTIMATION; EQUAL ERROR RATE; FREQUENCY DOMAINS; LINEAR PREDICTION; MINIMUM MEAN SQUARES; MULTI LAYER PERCEPTRON; NOISY ENVIRONMENT; NOISY VERSIONS; POSTERIOR PROBABILITY; REVERBERANT CONDITION; SPEAKER RECOGNITION; SPEAKER VERIFICATION; SPECTRAL FEATURE; SPEECH ACTIVITY; SPEECH ACTIVITY DETECTION; SPEECH FEATURES; SPEECH SIGNALS; SUB-BANDS; TEMPORAL ENVELOPES; TEMPORAL SEGMENTS;

EID: 83455246037     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASPAA.2011.6082323     Document Type: Conference Paper
Times cited : (9)

References (15)
  • 1
    • 0017742776 scopus 로고
    • Voiced-unvoiced-silence detection using the Itakura LPC distance measure
    • L. R. Rabiner and M. R. Sambur, "Voiced-unvoiced-silence detection using the Itakura LPC distance measure," Proc. ICASSP, pp. 323-326, 1977.
    • (1977) Proc. ICASSP , pp. 323-326
    • Rabiner, L.R.1    Sambur, M.R.2
  • 2
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Letters, Vol. 6 (1), pp. 1-3, 1999.
    • (1999) IEEE Signal Process. Letters , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 3
    • 33646805703 scopus 로고    scopus 로고
    • The 2004 MIT Lincoln laboratory speaker recognition system
    • D. Reynolds et al. " The 2004 MIT Lincoln laboratory speaker recognition system", Proc. ICASSP, pp. 177-180, 2005.
    • (2005) Proc. ICASSP , pp. 177-180
    • Reynolds, D.1
  • 4
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • H. G. Hirsch and C. Ehrlicher, "Noise estimation techniques for robust speech recognition," Proc. ICASSP, pp. 153-156, 1995.
    • (1995) Proc. ICASSP , pp. 153-156
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 5
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from non-speech based on multi scale spectrotemporal modulations
    • N. Mesgarani, M. Slaney, and S. A. Shamma, " Discrimination of speech from non-speech based on multi scale spectrotemporal modulations," IEEE Trans. Audio, Speech and Language Process., Vol. 14(3), pp. 920-930, 2006.
    • (2006) IEEE Trans. Audio, Speech and Language Process. , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.A.3
  • 7
    • 79952171347 scopus 로고    scopus 로고
    • Temporal envelope compensation for robust phoneme recognition using modulation spectrum
    • S. Ganapathy, S. Thomas and H. Hermansky, " Temporal envelope compensation for robust phoneme recognition using modulation spectrum", Jnl. Acoust. Soc. of America, Vol. 128 (6), pp. 3769-3780, 2010.
    • (2010) Jnl. Acoust. Soc. of America , vol.128 , Issue.6 , pp. 3769-3780
    • Ganapathy, S.1    Thomas, S.2    Hermansky, H.3
  • 8
    • 36248966385 scopus 로고    scopus 로고
    • Autoregressive modelling of temporal envelopes
    • M. Athineos and D.P.W. Ellis, "Autoregressive modelling of temporal envelopes", IEEE Trans. Signal Proc., Vol. 55 (11), pp. 5237-5245, 2007.
    • (2007) IEEE Trans. Signal Proc. , vol.55 , Issue.11 , pp. 5237-5245
    • Athineos, M.1    Ellis, D.P.W.2
  • 9
    • 0021645331 scopus 로고
    • "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,"IEEE Trans. Acoust., Speech, Signal Process., Vol. ASSP- 32, pp. 1109-1121, 1984.
    • (1984) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP- 32 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 10
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector Length Normalization in Speaker Recognition Systems
    • D. Romero and c.Y. Espy-Wilson, "Analysis of i-vector Length Normalization in Speaker Recognition Systems", Proc. Interspeech, 2011.
    • (2011) Proc. Interspeech
    • Romero, D.1    Espy-Wilson, C.Y.2
  • 12
    • 84855201474 scopus 로고    scopus 로고
    • available online
    • The NIST 2008 Evaluation Plan, available online (http://www.itl.nist.gov/ iad/mig/tests/ sre/2008/sre08-evalplan-release4.pdf)
    • The NIST 2008 Evaluation Plan
  • 13
    • 33745533302 scopus 로고    scopus 로고
    • The Development of AMI System for Transcription of Speech in Meetings
    • T. Hain et al., " The Development of AMI System for Transcription of Speech in Meetings", Proc. MLMI, pp. 344-356, 2005.
    • (2005) Proc. MLMI , pp. 344-356
    • Hain, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.