SCOPUS 정보 검색 플랫폼

Odyssey 2012 - Speaker and Language Recognition Workshop

Volumn , Issue , 2012, Pages 229-235

Feature extraction using 2-D autoregressive models for speaker recognition

(3) Ganapathy, Sriram a Thomas, Samuel a Hermansky, Hynek a,b

a JOHNS HOPKINS UNIVERSITY (United States)

b Johns Hopkins University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EXTRACTION; FEATURE EXTRACTION; FREQUENCY DOMAIN ANALYSIS; LARGE DATASET;

2-D AUTOREGRESSIVE MODELS; AUTO REGRESSIVE MODELS; CEPSTRAL COEFFICIENTS; ROBUST FEATURE EXTRACTIONS; SPEAKER RECOGNITION EVALUATIONS; SPEAKER RECOGNITION SYSTEM; SPEAKER VERIFICATION SYSTEM; TWO DIMENSIONAL (2 D);

SPEECH RECOGNITION;

EID: 85073263765 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (24)

References (27)

1
- 63249107289
- Robust speaker recognition in noisy conditions
- Ming, J., Hazen, T.J., Glass, J.R. and Reynolds, D.A., “Robust Speaker Recognition in Noisy Conditions”, IEEE Tran. on Audio Speech Lang. Proc., Vol 15 (5), 2007, pp. 1711 - 1723.
- (2007) IEEE Tran. On Audio Speech Lang. Proc. , vol.15 , Issue.5 , pp. 1711-1723
- Ming, J.¹ Hazen, T.J.² Glass, J.R.³ Reynolds, D.A.⁴

2
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- Boll, S.F.,“Suppression of acoustic noise in speech using spectral subtraction”, IEEE Trans. Acoust. Speech Signal Process., Vol. 27 (2), Apr. 1979, pp. 113-120.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

3
- 0141699847
- “ETSI ES 202 050 v1.1.1 STQ; Distributed speech recognition; Advanced front-end feature extraction algorithm; Compression algorithms”, 2002.
- (2002) ETSI ES 202 050 V1.1.1 STQ; Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms

4
- 0030671924
- Missing data techniques for robust speech recognition
- Cooke, M., Morris, A., Green, P., “Missing data techniques for robust speech recognition”, Proc. ICASSP, 1997, pp. 863-866.
- (1997) Proc. ICASSP , pp. 863-866
- Cooke, M.¹ Morris, A.² Green, P.³

5
- 85073258179
- Feature warping for robust speaker verification
- Greece
- Pelecanos, J. and Sridharan, S., “Feature warping for robust speaker verification”, Proc. Speaker Odyssey 2001 Speaker Recognition Workshop, Greece, pp. 213-218, 2001.
- (2001) Proc. Speaker Odyssey 2001 Speaker Recognition Workshop , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

6
- 0028517164
- RASTA processing of speech
- Hermansky, H. and Morgan, N., “RASTA processing of speech,” IEEE Trans. on Speech and Audio Process., Vol. 2, pp. 578-589, 1994.
- (1994) IEEE Trans. On Speech and Audio Process. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

7
- 84881675408
- Cepstral channel normalization techniques for HMM-based speaker verification
- Rosenberg, A.E., Lee, C. and Soong, F.K., “Cepstral Channel Normalization Techniques for HMM-Based Speaker Verification,” in Proc. ICSLP, pp. 1835-1838, 1994.
- (1994) Proc. ICSLP , pp. 1835-1838
- Rosenberg, A.E.¹ Lee, C.² Soong, F.K.³

8
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- Davis, S. and Mermelstein, R., “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences”, IEEE Trans. Acoust. Speech Signal Process., Vol. 28 (4), Aug. 1980, pp. 357-366.
- (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, R.²

9
- 0015112070
- Speech analysis and synthesis by linear prediction of the speech wave
- Atal, B.S., Hanauer, L.S, “Speech Analysis and Synthesis by Linear Prediction of the Speech Wave”, J. Acoust. America, Vol 50 (28), 1971, pp. 637-655.
- (1971) J. Acoust. America , vol.50 , Issue.28 , pp. 637-655
- Atal, B.S.¹ Hanauer, L.S.²

10
- 0016495091
- Linear prediction: A tutorial review
- Makhoul, J., “Linear Prediction: A Tutorial Review”,in Proc. of the IEEE, Vol 63(4), pp. 561-580, 1975.
- (1975) Proc. Of the IEEE , vol.63 , Issue.4 , pp. 561-580
- Makhoul, J.¹

11
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky, H., “Perceptual Linear Predictive (PLP) Analysis of Speech,” J. Acoust. Soc. Am., vol. 87, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Am. , vol.87 , pp. 1738-1752
- Hermansky, H.¹

12
- 80051618525
- Feature normalization for speaker verification in room reverberation
- Ganapathy, S., Pelecanos, J. and Omar, M.K., “Feature Normalization for Speaker Verification in Room Reverberation”, Proc. ICASSP, 2011, pp. 4836-4839.
- (2011) Proc. ICASSP , pp. 4836-4839
- Ganapathy, S.¹ Pelecanos, J.² Omar, M.K.³

13
- 27144453376
- PLP2 Autoregressive modeling of auditory-like 2-D spectro-temporal patterns
- Athineos, M. and Hermansky, H. and Ellis, D., “PLP2 Autoregressive modeling of auditory-like 2-D spectro-temporal patterns”, Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA04, pp. 3742, 2004.
- (2004) Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA04 , pp. 3742
- Athineos, M.¹ Hermansky, H.² Ellis, D.³

14
- 0033004349
- Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications
- Mar
- Kumerasan, R. and Rao, A., “Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications,” Journal of Acoustical Society of America, Vol. 105, no 3, pp. 1912-1924, Mar. 1999.
- (1999) Journal of Acoustical Society of America , vol.105 , Issue.3 , pp. 1912-1924
- Kumerasan, R.¹ Rao, A.²

15
- 36248966385
- Autoregressive modelling of temporal envelopes
- Athineos, M. and Ellis, D., “Autoregressive modelling of temporal envelopes,” IEEE Tran. Signal Proc., Vol. 55, pp. 5237-5245, 2007.
- (2007) IEEE Tran. Signal Proc. , vol.55 , pp. 5237-5245
- Athineos, M.¹ Ellis, D.²

16
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- Atal, B.S., “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification”, J. Acoust. America, Vol 55 (6), 1974, pp. 1304-1312.
- (1974) J. Acoust. America , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.S.¹

17
- 84906248150
- website
- “National Institute of Standards and Technology (NIST),” speech group website, http://www.nist.gov/speech, 2010.
- (2010) Speech Group

18
- 79951609039
- Front-end factor analysis for speaker verification
- Dehak, N., Kenny, P., Dehak, R., Dumouchel, P and Ouellet, P., “Front-End Factor Analysis for Speaker Verification”, IEEE Transactions on Audio, Speech and Language Processing, Vol. 19(4), pp. 788-798, 2011.
- (2011) IEEE Transactions on Audio, Speech and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

19
- 84865733857
- Analysis of i-vector Length Normalization in Speaker Recognition Systems
- Romero, D. and Espy-Wilson, C.Y., “Analysis of i-vector Length Normalization in Speaker Recognition Systems”, Proc. Interspeech, 2011.
- (2011) Proc. Interspeech
- Romero, D.¹ Espy-Wilson, C.Y.²

20
- 84906266623
- “IARPA BEST Speaker Recognition Challenge 2011”, http://www.nist.gov/itl/iad/mig/best.cfm, 2011
- (2011) IARPA BEST Speaker Recognition Challenge 2011

21
- 0032634932
- Computing the discrete-time analytic signal via FFT
- Marple, L.S., “Computing the Discrete-Time Analytic Signal via FFT”, IEEE Trans. on Acoust., Speech and Sig. Proc., Vol. 47, pp. 2600-2603, 1999.
- (1999) IEEE Trans. On Acoust., Speech and Sig. Proc. , vol.47 , pp. 2600-2603
- Marple, L.S.¹

22
- 67650107416
- Recognition of reverberant speech using frequency domain linear prediction
- Dec
- Thomas, S., Ganapathy, S. and Hermansky, H. “Recognition of Reverberant Speech Using Frequency Domain Linear Prediction,” IEEE Signal Proc. Letters, Vol. 15, Dec. 2008, pp. 681-684.
- (2008) IEEE Signal Proc. Letters , vol.15 , pp. 681-684
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

23
- 0028429724
- Symmetric convolution and the discrete sine and cosine transforms
- Martucci, S.A., “Symmetric convolution and the discrete sine and cosine transforms”, IEEE Tran. Signal Proc., Vol. 42(5), 1994 pp. 1038-1051.
- (1994) IEEE Tran. Signal Proc. , vol.42 , Issue.5 , pp. 1038-1051
- Martucci, S.A.¹

24
- 0029355999
- Speaker Identification and Verification using Gaussian Mixture Speaker Models
- Aug
- Reynolds, D., “Speaker Identification and Verification using Gaussian Mixture Speaker Models,” Speech Comm. Vol. 17, Aug. 1995, pp. 91-108.
- (1995) Speech Comm , vol.17 , pp. 91-108
- Reynolds, D.¹

25
- 79551573428
- Hirsch, H.G., “FaNT: Filtering and Noise Adding Tool”, http://dnt.kr.hsnr.de/download.html.
- FaNT: Filtering and Noise Adding Tool
- Hirsch, H.G.¹

26
- 70450144093
- Ph. D. Thesis, University of California, Berkeley
- Gelbart, D. “Ensemble Feature Selection for Multi-Stream Automatic Speech Recognition”, Ph. D. Thesis, University of California, Berkeley, 2008.
- (2008) Ensemble Feature Selection for Multi-Stream Automatic Speech Recognition
- Gelbart, D.¹

27
- 83455246037
- Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification
- Ganapathy, S., Rajan, P. and Hermansky, H., “Multi-layer Perceptron Based Speech Activity Detection for Speaker Verification”, IEEE Workshop on Application of Signal Proc. to Audio and Acoustics, 2011, pp. 321-324.
- (2011) IEEE Workshop on Application of Signal Proc. To Audio and Acoustics , pp. 321-324
- Ganapathy, S.¹ Rajan, P.² Hermansky, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.