메뉴 건너뛰기




Volumn , Issue , 2010, Pages 2138-2141

Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions

Author keywords

Feature extraction; Gammatone filterbank; Hilbert envelope; Speaker identification; Speech enhancement

Indexed keywords

AUTOMOTIVE INDUSTRY; FEATURE EXTRACTION; FILTER BANKS; LOUDSPEAKERS; MICROPHONES; SIGNAL TO NOISE RATIO; SPEECH COMMUNICATION; SPEECH ENHANCEMENT; SPEECH RECOGNITION;

EID: 79959839465     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (42)

References (23)
  • 1
    • 0026135903 scopus 로고    scopus 로고
    • Constrained iterative speech enhancement with application to speech recognition
    • J.H.L. Hansen and M. Clements, "Constrained iterative speech enhancement with application to speech recognition," IEEE TSP, vol. 39, no. 4, pp. 795-805.
    • IEEE TSP , vol.39 , Issue.4 , pp. 795-805
    • Hansen, J.H.L.1    Clements, M.2
  • 2
    • 0030371776 scopus 로고    scopus 로고
    • Overview of speech enhancement techniques for automatic speaker recognition
    • Philadelphia, PA, Oct.
    • J. Ortega-Garca and J. Gonzlez-Rodrguez, "Overview of speech enhancement techniques for automatic speaker recognition," in Proc. IC-SLP'96, Philadelphia, PA, Oct. 1996, pp. 929-932.
    • (1996) Proc. IC-SLP'96 , pp. 929-932
    • Ortega-Garca, J.1    Gonzlez-Rodrguez, J.2
  • 3
    • 0031619912 scopus 로고    scopus 로고
    • Speaker verification in noisy environments with combined spectral subtraction and missing feature theory
    • Seattle, WA, May
    • A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environments with combined spectral subtraction and missing feature theory," in Proc. IEEE ICASSP'98, Seattle, WA, May 1998, vol. 1, pp. 121-124.
    • (1998) Proc. IEEE ICASSP'98 , vol.1 , pp. 121-124
    • Drygajlo, A.1    El-Maliki, M.2
  • 4
    • 0028518091 scopus 로고
    • Microphone arrays and speaker identification
    • Oct.
    • Q. Lin, E. Jan, and J. Flanagan, "Microphone arrays and speaker identification," IEEE TSAP, vol. 2, no. 4, pp. 622-629, Oct. 1994.
    • (1994) IEEE TSAP , vol.2 , Issue.4 , pp. 622-629
    • Lin, Q.1    Jan, E.2    Flanagan, J.3
  • 5
    • 0030711164 scopus 로고    scopus 로고
    • Providing single and multi-channel acoustical robustness to speaker identification systems
    • Munich, Germany, Apr.
    • J. Ortega-Garca and J. Gonzlez-Rodrguez, "Providing single and multi-channel acoustical robustness to speaker identification systems," in Proc. IEEE ICASSP'97, Munich, Germany, Apr. 1997, vol. 2, pp.1107-1110.
    • (1997) Proc. IEEE ICASSP'97 , vol.2 , pp. 1107-1110
    • Ortega-Garca, J.1    Gonzlez-Rodrguez, J.2
  • 6
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr.
    • R.C. Rose, E.M. Hofstetter, and D.A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE TSAP, vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE TSAP , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 7
    • 0030125219 scopus 로고    scopus 로고
    • Speaker recognition using HMM composition in noisy environments
    • DOI 10.1006/csla.1996.0007
    • T. Matsui, T. Kanno, and S. Furui, "Speaker recognition using HMM composition in noisy environments," Compuet. Speech Lang., vol. 10, no. 2, pp. 107-116, 1996. (Pubitemid 126346924)
    • (1996) Computer Speech and Language , vol.10 , Issue.2 , pp. 107-116
    • Matsui, T.1    Kanno, T.2    Furui, S.3
  • 8
    • 34547499683 scopus 로고    scopus 로고
    • Incorporating auditory feature uncertainties in robust speaker identification
    • Honolulu, HI, Apr.
    • Y. Shao, S. Srinivasan, and D.L. Wang, "Incorporating auditory feature uncertainties in robust speaker identification," in Proc. IEEE ICASSP'07, Honolulu, HI, Apr. 2007, vol. IV, pp. 277-280.
    • (2007) Proc. IEEE ICASSP'07 , vol.4 , pp. 277-280
    • Shao, Y.1    Srinivasan, S.2    Wang, D.L.3
  • 9
    • 63249107289 scopus 로고    scopus 로고
    • Robust speaker recognition in noisy conditions
    • Jul.
    • J. Ming, T.J. Hazen, J.R. Glass, and D.A. Reynolds, "Robust speaker recognition in noisy conditions," IEEE TASLP, vol. 15, no. 5, pp. 1711-1723, Jul. 2007.
    • (2007) IEEE TASLP , vol.15 , Issue.5 , pp. 1711-1723
    • Ming, J.1    Hazen, T.J.2    Glass, J.R.3    Reynolds, D.A.4
  • 10
    • 44949154590 scopus 로고    scopus 로고
    • Gammatone auditory filterbank and independent component analysis for speaker identification
    • Pittsburgh, PA, Sept.
    • Y. Zhang and W.H. Abdulla "Gammatone auditory filterbank and independent component analysis for speaker identification," in Proc. IN-TERSPEECH'06, Pittsburgh, PA, Sept. 2006, pp. 2098-2101.
    • (2006) Proc. IN-terspeech'06 , pp. 2098-2101
    • Zhang, Y.1    Abdulla, W.H.2
  • 11
    • 70449360175 scopus 로고    scopus 로고
    • Modulation spectral features for robust far-field speaker identification
    • Jan.
    • T.H. Falk and W.-Y. Chan, "Modulation spectral features for robust far-field speaker identification," IEEE TASLP, vol. 18, no. 1, pp. 90-100, Jan. 2010.
    • (2010) IEEE TASLP , vol.18 , Issue.1 , pp. 90-100
    • Falk, T.H.1    Chan, W.-Y.2
  • 12
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • Aug.
    • D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, Aug. 1995.
    • (1995) Speech Commun. , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 13
    • 0028996937 scopus 로고
    • Testing with the YOHO CD-ROM voice verification corpus
    • Detroit, MI, May
    • J.P. Campbell, "Testing with the YOHO CD-ROM voice verification corpus," in Proc. IEEE ICASSP'95, Detroit, MI, May 1995, pp. 341-344.
    • (1995) Proc. IEEE ICASSP'95 , pp. 341-344
    • Campbell, J.P.1
  • 14
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: Ii. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • Jul.
    • A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems," Speech Commun., vol. 12, no. 3, pp. 247-251, Jul. 1993.
    • (1993) Speech Commun. , vol.12 , Issue.3 , pp. 247-251
    • Varga, A.1    Steeneken, H.J.M.2
  • 15
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • M. Berouti, R. Schwartz, and J. Makhoul, "Enhancement of speech corrupted by acoustic noise," in Proc. IEEE ICASSP'79, Washington, DC, Apr. 1979, pp. 208-211. (Pubitemid 9454996)
    • (1979) Proc. IEEE ICASSP'79 , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 16
    • 0021645331 scopus 로고
    • Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator
    • Dec.
    • Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. ASSP, vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984.
    • (1984) IEEE Trans. ASSP , vol.ASSP-32 , Issue.6 , pp. 1109-1121
    • Ephraim, Y.1    Malah, D.2
  • 17
    • 0029726517 scopus 로고    scopus 로고
    • Speech enhancement based on a priori signal to noise estimation
    • Atlanta, GA, May
    • P. Scalart and J. Vieira-Filho, "Speech enhancement based on a priori signal to noise estimation," in Proc. IEEE ICASSP'96, Atlanta, GA, May 1996, pp. 629-632.
    • (1996) Proc. IEEE ICASSP'96 , pp. 629-632
    • Scalart, P.1    Vieira-Filho, J.2
  • 18
    • 0347337999 scopus 로고    scopus 로고
    • Incorporating the human hearing properties in the signal subspace approach for speech enhancement
    • Nov.
    • F. Jabloun and B. Champagne, "Incorporating the human hearing properties in the signal subspace approach for speech enhancement," IEEE Trans. SAP, vol. 11, no. 6, pp. 700-708, Nov. 2003.
    • (2003) IEEE Trans. SAP , vol.11 , Issue.6 , pp. 700-708
    • Jabloun, F.1    Champagne, B.2
  • 19
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • Jan.
    • J. Sohn, N. Kim, and W. Sung, "A statistical model-based voice activity detection," IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, Jan. 1999.
    • (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.2    Sung, W.3
  • 20
    • 34447092407 scopus 로고    scopus 로고
    • Subjective comparison and evaluation of speech enhancement algorithms
    • DOI 10.1016/j.specom.2006.12.006, PII S0167639306001920
    • Y. Hu and P.C. Loizou, "Subjective comparison and evaluation of speech enhancement algorithms", Speech Commun., vol. 49, pp. 588-601, 2007. (Pubitemid 47031352)
    • (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 21
    • 0000460671 scopus 로고
    • Complex sounds and auditory images
    • Y. Cazals, L. Demany, and K. Horner Eds. Oxford: Pergamon Press
    • R.D. Patterson et al., "Complex sounds and auditory images," in Auditory Physiology and Perception, Y. Cazals, L. Demany, and K. Horner Eds. Oxford: Pergamon Press, 1992, pp. 429-446.
    • (1992) Auditory Physiology and Perception , pp. 429-446
    • Patterson, R.D.1
  • 22
    • 11244310452 scopus 로고
    • Objective measurement of active speech level
    • ITU-T P.56 Mar.
    • ITU-T P.56, "Objective measurement of active speech level," ITU-T Recommendation, p. 56, Mar. 1993.
    • (1993) ITU-T Recommendation , pp. 56


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.