SCOPUS 정보 검색 플랫폼

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Volumn , Issue , 2011, Pages 321-324

Multi-layer perceptron based speech activity detection for speaker verification

(3) Ganapathy, Sriram a Rajan, Padmanabhan b Hermansky, Hynek a

a JOHNS HOPKINS UNIVERSITY (United States)

b INDIAN INSTITUTE OF TECHNOLOGY MADRAS (India)

Author keywords

Frequency Domain Linear Prediction (FDLP); Speaker Verification; Speech Activity Detection

Indexed keywords

AUTOREGRESSIVE MODELLING; CEPSTRAL MEAN SUBTRACTION; CRITICAL BANDS; ENVELOPE ESTIMATION; EQUAL ERROR RATE; FREQUENCY DOMAINS; LINEAR PREDICTION; MINIMUM MEAN SQUARES; MULTI LAYER PERCEPTRON; NOISY ENVIRONMENT; NOISY VERSIONS; POSTERIOR PROBABILITY; REVERBERANT CONDITION; SPEAKER RECOGNITION; SPEAKER VERIFICATION; SPECTRAL FEATURE; SPEECH ACTIVITY; SPEECH ACTIVITY DETECTION; SPEECH FEATURES; SPEECH SIGNALS; SUB-BANDS; TEMPORAL ENVELOPES; TEMPORAL SEGMENTS;

AUDIO ACOUSTICS; AUDIO SIGNAL PROCESSING; FREQUENCY DOMAIN ANALYSIS; SIGNAL DETECTION;

SPEECH RECOGNITION;

EID: 83455246037 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASPAA.2011.6082323 Document Type: Conference Paper

Times cited : (9)

References (15)

1
- 0017742776
- Voiced-unvoiced-silence detection using the Itakura LPC distance measure
- L. R. Rabiner and M. R. Sambur, "Voiced-unvoiced-silence detection using the Itakura LPC distance measure," Proc. ICASSP, pp. 323-326, 1977.
- (1977) Proc. ICASSP , pp. 323-326
- Rabiner, L.R.¹ Sambur, M.R.²

2
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Letters, Vol. 6 (1), pp. 1-3, 1999.
- (1999) IEEE Signal Process. Letters , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

3
- 33646805703
- The 2004 MIT Lincoln laboratory speaker recognition system
- D. Reynolds et al. " The 2004 MIT Lincoln laboratory speaker recognition system", Proc. ICASSP, pp. 177-180, 2005.
- (2005) Proc. ICASSP , pp. 177-180
- Reynolds, D.¹

4
- 0028996871
- Noise estimation techniques for robust speech recognition
- H. G. Hirsch and C. Ehrlicher, "Noise estimation techniques for robust speech recognition," Proc. ICASSP, pp. 153-156, 1995.
- (1995) Proc. ICASSP , pp. 153-156
- Hirsch, H.G.¹ Ehrlicher, C.²

5
- 34047272330
- Discrimination of speech from non-speech based on multi scale spectrotemporal modulations
- N. Mesgarani, M. Slaney, and S. A. Shamma, " Discrimination of speech from non-speech based on multi scale spectrotemporal modulations," IEEE Trans. Audio, Speech and Language Process., Vol. 14(3), pp. 920-930, 2006.
- (2006) IEEE Trans. Audio, Speech and Language Process. , vol.14 , Issue.3 , pp. 920-930
- Mesgarani, N.¹ Slaney, M.² Shamma, S.A.³

6
- 0003573244
- Kluwer Academic Publishers
- H. Boulard and N. Morgan, Connectionist Speech Recognition - A Hybrid Approach, Kluwer Academic Publishers, 1994.
- (1994) Connectionist Speech Recognition - A Hybrid Approach
- Boulard, H.¹ Morgan, N.²

7
- 79952171347
- Temporal envelope compensation for robust phoneme recognition using modulation spectrum
- S. Ganapathy, S. Thomas and H. Hermansky, " Temporal envelope compensation for robust phoneme recognition using modulation spectrum", Jnl. Acoust. Soc. of America, Vol. 128 (6), pp. 3769-3780, 2010.
- (2010) Jnl. Acoust. Soc. of America , vol.128 , Issue.6 , pp. 3769-3780
- Ganapathy, S.¹ Thomas, S.² Hermansky, H.³

8
- 36248966385
- Autoregressive modelling of temporal envelopes
- M. Athineos and D.P.W. Ellis, "Autoregressive modelling of temporal envelopes", IEEE Trans. Signal Proc., Vol. 55 (11), pp. 5237-5245, 2007.
- (2007) IEEE Trans. Signal Proc. , vol.55 , Issue.11 , pp. 5237-5245
- Athineos, M.¹ Ellis, D.P.W.²

9
- 0021645331
- "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator,"IEEE Trans. Acoust., Speech, Signal Process., Vol. ASSP- 32, pp. 1109-1121, 1984.
- (1984) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP- 32 , pp. 1109-1121
- Ephraim, Y.¹ Malah, D.²

10
- 84865733857
- Analysis of i-vector Length Normalization in Speaker Recognition Systems
- D. Romero and c.Y. Espy-Wilson, "Analysis of i-vector Length Normalization in Speaker Recognition Systems", Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Romero, D.¹ Espy-Wilson, C.Y.²

11
- 0141699847
- "ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms", 2002.
- (2002) ETSI ES 202 050 V1.1.1 STQ; Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithms

12
- 84855201474
- available online
- The NIST 2008 Evaluation Plan, available online (http://www.itl.nist.gov/ iad/mig/tests/ sre/2008/sre08-evalplan-release4.pdf)
- The NIST 2008 Evaluation Plan

13
- 33745533302
- The Development of AMI System for Transcription of Speech in Meetings
- T. Hain et al., " The Development of AMI System for Transcription of Speech in Meetings", Proc. MLMI, pp. 344-356, 2005.
- (2005) Proc. MLMI , pp. 344-356
- Hain, T.¹

14
- 79952169792
- R. Dhillon, S. Bhagat, R. Carvey, and E. Shriberg, The ICSI Meeting Recorder Project, http://www . icsi. berkeley. edu/Speech/mr, 2002.
- (2002) The ICSI Meeting Recorder Project
- Dhillon, R.¹ Bhagat, S.² Carvey, R.³ Shriberg, E.⁴

15
- 80051618525
- Feature Normalization for Speaker Verification in Room Reverberation
- S. Ganapathy, J. Pelecanos and M.K. Omar, " Feature Normalization for Speaker Verification in Room Reverberation", Proc. ICASSP, Prague, 2011.
- Proc. ICASSP, Prague, 2011
- Ganapathy, S.¹ Pelecanos, J.² Omar, M.K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.