SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 3656-3660

Impact of noise reduction and spectrum estimation on noise robust speaker identification

(3) Godin, Keith W a Sadjadi, Seyed Omid a Hansen, John H L a

a UNIVERSITY OF TEXAS AT DALLAS (United States)

Author keywords

Mismatched condition; Noise robustness; Robust features; Speaker identification; Speech enhancement

Indexed keywords

ALGORITHMS; NOISE ABATEMENT; SPECTRUM ANALYSIS; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

MISMATCHED CONDITIONS; NOISE REDUCTION ALGORITHMS; NOISE ROBUSTNESS; PREDICTION-BASED TECHNIQUES; ROBUST FEATURES; SPEAKER IDENTIFICATION; SPEECH ENHANCEMENT ALGORITHM; TIME DOMAIN WINDOWING;

TIME DOMAIN ANALYSIS;

EID: 84905279854 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (41)

1
- 0029269512
- A comparative study of robust linear predictive analysis methods with applications to speaker identification
- R. P. Ramachandran, M. S. Zilovic, and R. J. Mammone, "A comparative study of robust linear predictive analysis methods with applications to speaker identification, " IEEE Trans. Speech Audio Process., vol. 3, pp. 117-125, 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 117-125
- Ramachandran, R.P.¹ Zilovic, M.S.² Mammone, R.J.³

2
- 0030371776
- Overview of speech enhancement techniques for automatic speaker recognition
- J. Ortega-Garcia and J. Gonzalez-Rodriguez, "Overview of speech enhancement techniques for automatic speaker recognition, " in Proc. ICSLP, 1996, pp. 929-932.
- (1996) Proc. ICSLP , pp. 929-932
- Ortega-Garcia, J.¹ Gonzalez-Rodriguez, J.²

3
- 0031619912
- Speaker verification in noisy environments with combined spectral subtraction and missing feature theory
- A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environments with combined spectral subtraction and missing feature theory, " in Proc. IEEE ICASSP, 1998, pp. 121-124.
- (1998) Proc. IEEE ICASSP , pp. 121-124
- Drygajlo, A.¹ El-Maliki, M.²

4
- 0032075135
- Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions
- M. S. Zilovic, R. P. Ramachandran, and R. J. Mammone, "Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions, " IEEE Trans. Speech Audio Process., vol. 6, pp. 260-267, 1998.
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 260-267
- Zilovic, M.S.¹ Ramachandran, R.P.² Mammone, R.J.³

5
- 4143087399
- Text independent speaker verification for real fast-varying noisy environments
- T. Ganchev, I. Potamitis, N. Fakotakis, and G. Kokkinakis, "Textindependent speaker verification for real fast-varying noisy environments, " Int. J. Speech Technol., vol. 7, pp. 281-292, 2004.
- (2004) Int. J. Speech Technol. , vol.7 , pp. 281-292
- Ganchev, T.¹ Potamitis, I.² Fakotakis, N.³ Kokkinakis, G.⁴

6
- 22544476848
- Combination of autocorrelation-based features and projection measure technique for speaker identification
- K.-H. Yuo, T.-H. Hwang, and H.-C. Wang, "Combination of autocorrelation-based features and projection measure technique for speaker identification, " IEEE Trans. Speech Audio Process., vol. 13, pp. 565-574, 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 565-574
- Yuo, K.-H.¹ Hwang, T.-H.² Wang, H.-C.³

7
- 34547532413
- Acoustic model enhancement: An adaptation technique for speaker verification under noisy environments
- A. Moreno-Daniel, J. Nolazco-Flores, T. Wada, and B.-H. Juang, "Acoustic model enhancement: An adaptation technique for speaker verification under noisy environments, " in Proc. IEEE ICASSP, 2007, pp. 289-292.
- (2007) Proc. IEEE ICASSP , pp. 289-292
- Moreno-Daniel, A.¹ Nolazco-Flores, J.² Wada, T.³ Juang, B.-H.⁴

8
- 34547499683
- Incorporating auditory feature uncertainties in robust speaker identification
- Y. Shao, S. Srinivasan, and D. Wang, "Incorporating auditory feature uncertainties in robust speaker identification, " in Proc. IEEE ICASSP, 2007, pp. 277-280.
- (2007) Proc. IEEE ICASSP , pp. 277-280
- Shao, Y.¹ Srinivasan, S.² Wang, D.³

9
- 57349117784
- Auditory sparse representation for robust speaker recognition based on tensor structure
- Q. Wu and L. Zhang, "Auditory sparse representation for robust speaker recognition based on tensor structure, " EURASIP J. Audio Speech and Music Process., vol. 2008, pp. 1-9, 2008.
- (2008) EURASIP J. Audio Speech and Music Process , vol.2008 , pp. 1-9
- Wu, Q.¹ Zhang, L.²

10
- 79959826333
- What else is new than the hamming window? Robust mfccs for speaker recognition via multitapering
- T. Kinnunen, R. Saeidi, J. Sandberg, and M. Hansson-Sandsten, "What else is new than the hamming window? robust mfccs for speaker recognition via multitapering, " in Proc. INTERSPEECH, 2010, pp. 2734-2737.
- (2010) Proc. INTERSPEECH , pp. 2734-2737
- Kinnunen, T.¹ Saeidi, R.² Sandberg, J.³ Hansson-Sandsten, M.⁴

11
- 78049408631
- Robust speaker identification using an auditory-based feature
- Q. Li and Y. Huang, "Robust speaker identification using an auditory-based feature, " in Proc. IEEE ICASSP, 2010, pp. 4514-4517.
- (2010) Proc. IEEE ICASSP , pp. 4514-4517
- Li, Q.¹ Huang, Y.²

12
- 79959832654
- Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions
- J. Pohjalainen, R. Saeidi, T. Kinnunen, and P. Alku, "Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions, " in Proc. INTERSPEECH, 2010, pp. 1477-1480.
- (2010) Proc. INTERSPEECH , pp. 1477-1480
- Pohjalainen, J.¹ Saeidi, R.² Kinnunen, T.³ Alku, P.⁴

13
- 79959839465
- Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions
- S. O. Sadjadi and J. H. L. Hansen, "Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions, " in Proc. INTERSPEECH, 2010, pp. 2138-2141.
- (2010) Proc. INTERSPEECH , pp. 2138-2141
- Sadjadi, S.O.¹ Hansen, J.H.L.²

14
- 77952192470
- Temporally weighted linear prediction features for tackling additive noise in speaker verification
- R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for tackling additive noise in speaker verification, " IEEE Signal Process. Lett., vol. 17, pp. 599-602, 2010.
- (2010) IEEE Signal Process. Lett. , vol.17 , pp. 599-602
- Saeidi, R.¹ Pohjalainen, J.² Kinnunen, T.³ Alku, P.⁴

15
- 84906227935
- Ph.D. dissertation, Drexel Univ. June
- C. wa Maina, "Approximate Bayesian inference for robust speech processing, " Ph.D. dissertation, Drexel Univ., June 2011.
- (2011) Approximate Bayesian Inference for Robust Speech Processing
- Maina, C.W.¹

16
- 85073225719
- On the use of asymmetric-shaped tapers for speaker verification using i-vectors
- M. J. Alam, P. Kenny, and D. O'Shaughnessy, "On the use of asymmetric-shaped tapers for speaker verification using i-vectors, " in Proc. Odyssey 2012, 2012.
- (2012) Proc. Odyssey 2012
- Alam, M.J.¹ Kenny, P.² O'Shaughnessy, D.³

17
- 84863799485
- Spectro-temporal modulation energy based mask for robust speaker identification
- T.-S. Chi, T.-H. Lin, and C.-C. Hsu, "Spectro-temporal modulation energy based mask for robust speaker identification, " J. Acoust. Soc. Am., vol. 131, pp. EL368-EL374, 2012.
- (2012) J. Acoust. Soc. Am. , vol.131
- Chi, T.-S.¹ Lin, T.-H.² Hsu, C.-C.³

18
- 84890540529
- Feature extraction using 2-D autoregressive models for speaker recognition
- S. Ganapathy, S. Thomas, and H. Hermansky, "Feature extraction using 2-D autoregressive models for speaker recognition, " in Proc. Odyssey 2012, 2012.
- (2012) Proc. Odyssey 2012
- Ganapathy, S.¹ Thomas, S.² Hermansky, H.³

19
- 85073231677
- Regularization of all-pole models for speaker verification under additive noise
- C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, and F. Ertas, "Regularization of all-pole models for speaker verification under additive noise, " in Proc. Odyssey 2012, 2012.
- (2012) Proc. Odyssey 2012
- Hanilci, C.¹ Kinnunen, T.² Saeidi, R.³ Pohjalainen, J.⁴ Alku, P.⁵ Ertas, F.⁶

20
- 84867590081
- Comparing spectrum estimators in speaker verification under additive noise degradation
- C. Hanilci, T. Kinnunen, R. Saeidi, J. Pohjalainen, P. Alku, F. Ertas, J. Sandberg, and M. Hansson-Sandsten, "Comparing spectrum estimators in speaker verification under additive noise degradation, " in Proc. IEEE ICASSP, 2012, pp. 4769-4772.
- (2012) Proc. IEEE ICASSP , pp. 4769-4772
- Hanilci, C.¹ Kinnunen, T.² Saeidi, R.³ Pohjalainen, J.⁴ Alku, P.⁵ Ertas, F.⁶ Sandberg, J.⁷ Hansson-Sandsten, M.⁸

21
- 84860850285
- Low-variance multitaper MFCC features: A case study in robust speaker verification
- T. Kinnunen, R. Saeidi, F. Sedlak, K. A. Lee, J. Sandberg, M. Hansson-Sandsten, and H. Li, "Low-variance multitaper MFCC features: A case study in robust speaker verification, " IEEE Trans. Audio Speech Lang. Process., vol. 20, pp. 1990-2001, 2012.
- (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , pp. 1990-2001
- Kinnunen, T.¹ Saeidi, R.² Sedlak, F.³ Lee, K.A.⁴ Sandberg, J.⁵ Hansson-Sandsten, M.⁶ Li, H.⁷

22
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models, " IEEE Trans. Speech Audio Process., vol. 3, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

23
- 70350454918
- Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition
- J. H. L. Hansen and V. Varadarajan, "Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition, " IEEE Trans. Audio Speech Lang. Process., vol. 17, pp. 366-378, 2009.
- (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , pp. 366-378
- Hansen, J.H.L.¹ Varadarajan, V.²

24
- 80051625480
- The NIST year 2010 speaker recognition evaluation plan
- NIST
- NIST, "The NIST year 2010 speaker recognition evaluation plan, " Natl. Inst. of Standards and Tech. (NIST), Natl. Inst. of Standards and Tech. (NIST), Tech. Rep., 2010.
- (2010) Natl. Inst. of Standards and Tech. (NIST), Natl. Inst. of Standards and Tech. (NIST), Tech. Rep.

25
- 84906223012
- Online
- MATLAB code for Filtering and Noise Adding. [Online]. Available: http://www.utdallas.edu/-sadjadi/AddNoisePSO.m.
- MATLAB Code for Filtering and Noise Adding

26
- 80051623374
- A channel-blind system for speaker verification
- N. Dehak, Z. N. Karam, D. A. Reynolds, R. Dehak, W. M. Campbell, and J. R. Glass, "A channel-blind system for speaker verification, " in Proc. IEEE ICASSP, 2011, pp. 4536-4539.
- (2011) Proc. IEEE ICASSP , pp. 4536-4539
- Dehak, N.¹ Karam, Z.N.² Reynolds, D.A.³ Dehak, R.⁴ Campbell, W.M.⁵ Glass, J.R.⁶

27
- 84873315510
- Unsupervised speech activity detection using voicing measures and perceptual spectral flux
- S. O. Sadjadi and J. H. L. Hansen, "Unsupervised speech activity detection using voicing measures and perceptual spectral flux, " IEEE Signal Process. Lett., vol. 20, pp. 197-200, 2013.
- (2013) IEEE Signal Process. Lett. , vol.20 , pp. 197-200
- Sadjadi, S.O.¹ Hansen, J.H.L.²

28
- 0016939145
- Automatic recognition of speakers from their voices
- Apr
- B. S. Atal, "Automatic recognition of speakers from their voices, " Proc. of the IEEE, vol. 64, pp. 460-475, Apr. 1976.
- (1976) Proc. of the IEEE , vol.64 , pp. 460-475
- Atal, B.S.¹

29
- 0003424145
- IEEE Press, Piscataway, NJ
- J. R. Deller, J. H. L. Hansen, and J. G. Proakis, Discrete-Time Processing of Speech Signals. IEEE Press, Piscataway, NJ, 2000, p. pg. 442.
- (2000) Discrete-Time Processing of Speech Signals , pp. 442
- Deller, J.R.¹ Hansen, J.H.L.² Proakis, J.G.³

30
- 0030677489
- Minimum variance distortionless response (MVDR) modeling of voiced speech
- M. N. Murthi and B. D. Rao, "Minimum variance distortionless response (MVDR) modeling of voiced speech, " in Proc. IEEE ICASSP, 1997, pp. 1687-1690.
- (1997) Proc. IEEE ICASSP , pp. 1687-1690
- Murthi, M.N.¹ Rao, B.D.²

31
- 37649022051
- A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
- U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition, " Speech Commun., vol. 50, pp. 142-152, 2008.
- (2008) Speech Commun. , vol.50 , pp. 142-152
- Yapanel, U.H.¹ Hansen, J.H.L.²

32
- 80051656878
- Survey and evaluation of acoustic features for speaker recognition
- A. Lawson, P. Vabishchevich, M. Huggins, P. Ardis, B. Battles, and A. Stauffer, "Survey and evaluation of acoustic features for speaker recognition, " in Proc. IEEE ICASSP, 2011, pp. 5444-5447.
- (2011) Proc. IEEE ICASSP , pp. 5444-5447
- Lawson, A.¹ Vabishchevich, P.² Huggins, M.³ Ardis, P.⁴ Battles, B.⁵ Stauffer, A.⁶

33
- 84906274288
- Online
- S. Ganapathy. [Online]. Available: http://old-site.clsp.jhu.edu/-sriram/ research/fdlp/featextract.tar.gz.
- Ganapathy, S.¹

34
- 80051641505
- Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
- S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions, " in Proc. IEEE ICASSP, 2011, pp. 5448-5451.
- (2011) Proc. IEEE ICASSP , pp. 5448-5451
- Sadjadi, S.O.¹ Hansen, J.H.L.²

35
- 77249096360
- Multitaper estimation of frequency-warped cepstra with application to speaker verification
- J. Sandberg, M. Hansson-Sandsten, T. Kinnunen, R. Saeidi, P. Flandrin, and P. Borgnat, "Multitaper estimation of frequency-warped cepstra with application to speaker verification, " IEEE Signal Process. Lett., vol. 17, pp. 343-346, 2010.
- (2010) IEEE Signal Process. Lett. , vol.17 , pp. 343-346
- Sandberg, J.¹ Hansson-Sandsten, M.² Kinnunen, T.³ Saeidi, R.⁴ Flandrin, P.⁵ Borgnat, P.⁶

36
- 84906274595
- Online
- MATLAB code for Multi-taper Spectrum Estimation. [Online]. Available: http://cs.joensuu.fi/pages/tkinnu/multitaper/multitaperspectrumfunctions.zip.
- MATLAB Code for Multi-taper Spectrum Estimation

37
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust. Speech Signal Process., vol. 27, pp. 113-120, 1979.
- (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 113-120
- Boll, S.¹

38
- 0029726517
- Speech enhancement based on a priori signal to noise estimation
- P. Scalart and J. V. Filho, "Speech enhancement based on a priori signal to noise estimation, " in Proc. IEEE ICASSP, 1996, pp. 629-632.
- (1996) Proc. IEEE ICASSP , pp. 629-632
- Scalart, P.¹ Filho, J.V.²

39
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, " IEEE Trans. Acoust. Speech Signal Process., vol. 33, pp. 443-445, 1985.
- (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.33 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

40
- 0035396555
- Noise power spectral density estimation based on optimal smoothing and minimum statistics
- R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics, " IEEE Trans. Speech Audio Process., vol. 9, pp. 504-512, 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 504-512
- Martin, R.¹

41
- 48349113750
- Online
- M. Brookes et al. VOICEBOX: Speech procesing toolbox for MATLAB. [Online]. Available: http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html.
- VOICEBOX: Speech Procesing Toolbox for MATLAB
- Brookes, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.