메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3656-3660

Impact of noise reduction and spectrum estimation on noise robust speaker identification

Author keywords

Mismatched condition; Noise robustness; Robust features; Speaker identification; Speech enhancement

Indexed keywords

ALGORITHMS; NOISE ABATEMENT; SPECTRUM ANALYSIS; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

EID: 84905279854     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (41)
  • 1
    • 0029269512 scopus 로고
    • A comparative study of robust linear predictive analysis methods with applications to speaker identification
    • R. P. Ramachandran, M. S. Zilovic, and R. J. Mammone, "A comparative study of robust linear predictive analysis methods with applications to speaker identification, " IEEE Trans. Speech Audio Process., vol. 3, pp. 117-125, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 117-125
    • Ramachandran, R.P.1    Zilovic, M.S.2    Mammone, R.J.3
  • 2
    • 0030371776 scopus 로고    scopus 로고
    • Overview of speech enhancement techniques for automatic speaker recognition
    • J. Ortega-Garcia and J. Gonzalez-Rodriguez, "Overview of speech enhancement techniques for automatic speaker recognition, " in Proc. ICSLP, 1996, pp. 929-932.
    • (1996) Proc. ICSLP , pp. 929-932
    • Ortega-Garcia, J.1    Gonzalez-Rodriguez, J.2
  • 3
    • 0031619912 scopus 로고    scopus 로고
    • Speaker verification in noisy environments with combined spectral subtraction and missing feature theory
    • A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environments with combined spectral subtraction and missing feature theory, " in Proc. IEEE ICASSP, 1998, pp. 121-124.
    • (1998) Proc. IEEE ICASSP , pp. 121-124
    • Drygajlo, A.1    El-Maliki, M.2
  • 4
    • 0032075135 scopus 로고    scopus 로고
    • Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions
    • M. S. Zilovic, R. P. Ramachandran, and R. J. Mammone, "Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions, " IEEE Trans. Speech Audio Process., vol. 6, pp. 260-267, 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 260-267
    • Zilovic, M.S.1    Ramachandran, R.P.2    Mammone, R.J.3
  • 5
    • 4143087399 scopus 로고    scopus 로고
    • Text independent speaker verification for real fast-varying noisy environments
    • T. Ganchev, I. Potamitis, N. Fakotakis, and G. Kokkinakis, "Textindependent speaker verification for real fast-varying noisy environments, " Int. J. Speech Technol., vol. 7, pp. 281-292, 2004.
    • (2004) Int. J. Speech Technol. , vol.7 , pp. 281-292
    • Ganchev, T.1    Potamitis, I.2    Fakotakis, N.3    Kokkinakis, G.4
  • 6
    • 22544476848 scopus 로고    scopus 로고
    • Combination of autocorrelation-based features and projection measure technique for speaker identification
    • K.-H. Yuo, T.-H. Hwang, and H.-C. Wang, "Combination of autocorrelation-based features and projection measure technique for speaker identification, " IEEE Trans. Speech Audio Process., vol. 13, pp. 565-574, 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , pp. 565-574
    • Yuo, K.-H.1    Hwang, T.-H.2    Wang, H.-C.3
  • 7
    • 34547532413 scopus 로고    scopus 로고
    • Acoustic model enhancement: An adaptation technique for speaker verification under noisy environments
    • A. Moreno-Daniel, J. Nolazco-Flores, T. Wada, and B.-H. Juang, "Acoustic model enhancement: An adaptation technique for speaker verification under noisy environments, " in Proc. IEEE ICASSP, 2007, pp. 289-292.
    • (2007) Proc. IEEE ICASSP , pp. 289-292
    • Moreno-Daniel, A.1    Nolazco-Flores, J.2    Wada, T.3    Juang, B.-H.4
  • 8
    • 34547499683 scopus 로고    scopus 로고
    • Incorporating auditory feature uncertainties in robust speaker identification
    • Y. Shao, S. Srinivasan, and D. Wang, "Incorporating auditory feature uncertainties in robust speaker identification, " in Proc. IEEE ICASSP, 2007, pp. 277-280.
    • (2007) Proc. IEEE ICASSP , pp. 277-280
    • Shao, Y.1    Srinivasan, S.2    Wang, D.3
  • 9
    • 57349117784 scopus 로고    scopus 로고
    • Auditory sparse representation for robust speaker recognition based on tensor structure
    • Q. Wu and L. Zhang, "Auditory sparse representation for robust speaker recognition based on tensor structure, " EURASIP J. Audio Speech and Music Process., vol. 2008, pp. 1-9, 2008.
    • (2008) EURASIP J. Audio Speech and Music Process , vol.2008 , pp. 1-9
    • Wu, Q.1    Zhang, L.2
  • 10
    • 79959826333 scopus 로고    scopus 로고
    • What else is new than the hamming window? Robust mfccs for speaker recognition via multitapering
    • T. Kinnunen, R. Saeidi, J. Sandberg, and M. Hansson-Sandsten, "What else is new than the hamming window? robust mfccs for speaker recognition via multitapering, " in Proc. INTERSPEECH, 2010, pp. 2734-2737.
    • (2010) Proc. INTERSPEECH , pp. 2734-2737
    • Kinnunen, T.1    Saeidi, R.2    Sandberg, J.3    Hansson-Sandsten, M.4
  • 11
    • 78049408631 scopus 로고    scopus 로고
    • Robust speaker identification using an auditory-based feature
    • Q. Li and Y. Huang, "Robust speaker identification using an auditory-based feature, " in Proc. IEEE ICASSP, 2010, pp. 4514-4517.
    • (2010) Proc. IEEE ICASSP , pp. 4514-4517
    • Li, Q.1    Huang, Y.2
  • 12
    • 79959832654 scopus 로고    scopus 로고
    • Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions
    • J. Pohjalainen, R. Saeidi, T. Kinnunen, and P. Alku, "Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions, " in Proc. INTERSPEECH, 2010, pp. 1477-1480.
    • (2010) Proc. INTERSPEECH , pp. 1477-1480
    • Pohjalainen, J.1    Saeidi, R.2    Kinnunen, T.3    Alku, P.4
  • 13
    • 79959839465 scopus 로고    scopus 로고
    • Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions
    • S. O. Sadjadi and J. H. L. Hansen, "Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions, " in Proc. INTERSPEECH, 2010, pp. 2138-2141.
    • (2010) Proc. INTERSPEECH , pp. 2138-2141
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 14
    • 77952192470 scopus 로고    scopus 로고
    • Temporally weighted linear prediction features for tackling additive noise in speaker verification
    • R. Saeidi, J. Pohjalainen, T. Kinnunen, and P. Alku, "Temporally weighted linear prediction features for tackling additive noise in speaker verification, " IEEE Signal Process. Lett., vol. 17, pp. 599-602, 2010.
    • (2010) IEEE Signal Process. Lett. , vol.17 , pp. 599-602
    • Saeidi, R.1    Pohjalainen, J.2    Kinnunen, T.3    Alku, P.4
  • 16
    • 85073225719 scopus 로고    scopus 로고
    • On the use of asymmetric-shaped tapers for speaker verification using i-vectors
    • M. J. Alam, P. Kenny, and D. O'Shaughnessy, "On the use of asymmetric-shaped tapers for speaker verification using i-vectors, " in Proc. Odyssey 2012, 2012.
    • (2012) Proc. Odyssey 2012
    • Alam, M.J.1    Kenny, P.2    O'Shaughnessy, D.3
  • 17
    • 84863799485 scopus 로고    scopus 로고
    • Spectro-temporal modulation energy based mask for robust speaker identification
    • T.-S. Chi, T.-H. Lin, and C.-C. Hsu, "Spectro-temporal modulation energy based mask for robust speaker identification, " J. Acoust. Soc. Am., vol. 131, pp. EL368-EL374, 2012.
    • (2012) J. Acoust. Soc. Am. , vol.131
    • Chi, T.-S.1    Lin, T.-H.2    Hsu, C.-C.3
  • 18
    • 84890540529 scopus 로고    scopus 로고
    • Feature extraction using 2-D autoregressive models for speaker recognition
    • S. Ganapathy, S. Thomas, and H. Hermansky, "Feature extraction using 2-D autoregressive models for speaker recognition, " in Proc. Odyssey 2012, 2012.
    • (2012) Proc. Odyssey 2012
    • Ganapathy, S.1    Thomas, S.2    Hermansky, H.3
  • 22
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models, " IEEE Trans. Speech Audio Process., vol. 3, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 23
    • 70350454918 scopus 로고    scopus 로고
    • Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition
    • J. H. L. Hansen and V. Varadarajan, "Analysis and compensation of Lombard speech across noise type and levels with application to in-set/out-of-set speaker recognition, " IEEE Trans. Audio Speech Lang. Process., vol. 17, pp. 366-378, 2009.
    • (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , pp. 366-378
    • Hansen, J.H.L.1    Varadarajan, V.2
  • 27
    • 84873315510 scopus 로고    scopus 로고
    • Unsupervised speech activity detection using voicing measures and perceptual spectral flux
    • S. O. Sadjadi and J. H. L. Hansen, "Unsupervised speech activity detection using voicing measures and perceptual spectral flux, " IEEE Signal Process. Lett., vol. 20, pp. 197-200, 2013.
    • (2013) IEEE Signal Process. Lett. , vol.20 , pp. 197-200
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 28
    • 0016939145 scopus 로고
    • Automatic recognition of speakers from their voices
    • Apr
    • B. S. Atal, "Automatic recognition of speakers from their voices, " Proc. of the IEEE, vol. 64, pp. 460-475, Apr. 1976.
    • (1976) Proc. of the IEEE , vol.64 , pp. 460-475
    • Atal, B.S.1
  • 30
    • 0030677489 scopus 로고    scopus 로고
    • Minimum variance distortionless response (MVDR) modeling of voiced speech
    • M. N. Murthi and B. D. Rao, "Minimum variance distortionless response (MVDR) modeling of voiced speech, " in Proc. IEEE ICASSP, 1997, pp. 1687-1690.
    • (1997) Proc. IEEE ICASSP , pp. 1687-1690
    • Murthi, M.N.1    Rao, B.D.2
  • 31
    • 37649022051 scopus 로고    scopus 로고
    • A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
    • U. H. Yapanel and J. H. L. Hansen, "A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition, " Speech Commun., vol. 50, pp. 142-152, 2008.
    • (2008) Speech Commun. , vol.50 , pp. 142-152
    • Yapanel, U.H.1    Hansen, J.H.L.2
  • 33
    • 84906274288 scopus 로고    scopus 로고
    • Online
    • S. Ganapathy. [Online]. Available: http://old-site.clsp.jhu.edu/-sriram/ research/fdlp/featextract.tar.gz.
    • Ganapathy, S.1
  • 34
    • 80051641505 scopus 로고    scopus 로고
    • Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
    • S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions, " in Proc. IEEE ICASSP, 2011, pp. 5448-5451.
    • (2011) Proc. IEEE ICASSP , pp. 5448-5451
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 37
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust. Speech Signal Process., vol. 27, pp. 113-120, 1979.
    • (1979) IEEE Trans. Acoust. Speech Signal Process. , vol.27 , pp. 113-120
    • Boll, S.1
  • 38
    • 0029726517 scopus 로고    scopus 로고
    • Speech enhancement based on a priori signal to noise estimation
    • P. Scalart and J. V. Filho, "Speech enhancement based on a priori signal to noise estimation, " in Proc. IEEE ICASSP, 1996, pp. 629-632.
    • (1996) Proc. IEEE ICASSP , pp. 629-632
    • Scalart, P.1    Filho, J.V.2
  • 39
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, " IEEE Trans. Acoust. Speech Signal Process., vol. 33, pp. 443-445, 1985.
    • (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.33 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 40
    • 0035396555 scopus 로고    scopus 로고
    • Noise power spectral density estimation based on optimal smoothing and minimum statistics
    • R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics, " IEEE Trans. Speech Audio Process., vol. 9, pp. 504-512, 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 504-512
    • Martin, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.