메뉴 건너뛰기




Volumn 19, Issue 1, 2011, Pages 196-205

Robust speaker recognition using denoised vocal source and vocal tract features

Author keywords

Robust parameter estimation; source tract features; speaker recognition; spectral subtraction

Indexed keywords

ADDITIVE WHITE GAUSSIAN NOISE; ENHANCEMENT TECHNIQUES; EQUAL ERROR RATE; FEATURE ESTIMATION; GAUSSIAN MIXTURE MODEL; IDENTIFICATION ERROR RATE; LOW SIGNAL-TO-NOISE RATIO; NEW APPROACHES; NOISY ENVIRONMENT; NOISY SPEECH; ROBUST PARAMETER ESTIMATION; ROBUST RECOGNITION; ROBUST SPEAKER RECOGNITION; SIMULATION RESULT; SOURCE-TRACT FEATURES; SPEAKER RECOGNITION; SPECIFIC COMPONENT; SPECTRAL SUBTRACTIONS; VOCAL-TRACTS;

EID: 77957744636     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2045800     Document Type: Article
Times cited : (50)

References (30)
  • 2
    • 33947384963 scopus 로고    scopus 로고
    • Audio-visual biometrics
    • Nov
    • P. S. Aleksic and A. K. Katsaggelos, "Audio-visual biometrics", Proc. IEEE, vol. 94, no. 11, pp. 2025-2044, Nov. 2006.
    • (2006) Proc. IEEE , vol.94 , Issue.11 , pp. 2025-2044
    • Aleksic, P.S.1    Katsaggelos, A.K.2
  • 3
    • 77957726104 scopus 로고    scopus 로고
    • Personalize mobile access by speaker authentication
    • D. D. Zhang, Ed. New York: Springer
    • K. Chen, "Personalize mobile access by speaker authentication", in Biometric Solutions: For Authentication in an E-World, D. D. Zhang, Ed. New York: Springer, 2002.
    • (2002) Biometric Solutions: For Authentication in an E-World
    • Chen, K.1
  • 4
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • Sep
    • J. P. Campbell, "Speaker recognition: A tutorial", Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 5
    • 0029355999 scopus 로고
    • Speaker identification and verification using gaussian mixture speaker models
    • D. A. Reynolds, "Speaker identification and verification using gaussian mixture speaker models", SpeechCommun., vol. 17, pp. 91-108, 1995.
    • (1995) SpeechCommun. , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 6
    • 85075924869 scopus 로고    scopus 로고
    • Comparison of background normalization methods for text-independent speaker verification
    • D. A. Reynolds, "Comparison of background normalization methods for text-independent speaker verification", in Proc. Eurospeech, 1997, vol. 2, pp. 963-966.
    • (1997) Proc. Eurospeech , vol.2 , pp. 963-966
    • Reynolds, D.A.1
  • 7
    • 64449086223 scopus 로고    scopus 로고
    • Discrimination power of vocal source and vocal tract related features for speaker segmentation
    • Aug
    • W. N. Chan, N. Zheng, and T. Lee, "Discrimination power of vocal source and vocal tract related features for speaker segmentation", IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1884-1892, Aug. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1884-1892
    • Chan, W.N.1    Zheng, N.2    Lee, T.3
  • 9
    • 0031224204 scopus 로고    scopus 로고
    • A study of harmonic features for speaker recognition
    • B. Imperl, Z. Kacic, and B. Horvat, "A study of harmonic features for speaker recognition", Speech Commun., vol. 22, no. 4, pp. 385-402, 1997.
    • (1997) Speech Commun. , vol.22 , Issue.4 , pp. 385-402
    • Imperl, B.1    Kacic, Z.2    Horvat, B.3
  • 10
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • Sep
    • M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification", IEEE Trans. Speech Audio Process, vol. 17, no. 5, pp. 569-586, Sep. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.17 , Issue.5 , pp. 569-586
    • Plumpe, M.D.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 11
    • 0029356550 scopus 로고
    • Usefulness of the LPC-residue in text-independent speaker verification
    • P. Thevenaz and H. Hugli, "Usefulness of the LPC-residue in text-independent speaker verification", SpeechCommun., vol. 17, no. 1-2, pp. 145-157, 1995.
    • (1995) SpeechCommun. , vol.17 , Issue.1-2 , pp. 145-157
    • Thevenaz, P.1    Hugli, H.2
  • 12
    • 4544352735 scopus 로고    scopus 로고
    • Improvement of speaker recognition by combining residual and prosodic features with acoustic features
    • S.-H. Chen and H.-C. Wang, "Improvement of speaker recognition by combining residual and prosodic features with acoustic features", in Proc. ICASSP, 2004, pp. 93-96.
    • (2004) Proc. ICASSP , pp. 93-96
    • Chen, S.-H.1    Wang, H.-C.2
  • 13
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specific excitation information from linear prediction of speech
    • S. R. Prasanna, C. S. Gupta, and B. Yegnanarayana, "Extraction of speaker-specific excitation information from linear prediction of speech", Speech Commun., vol. 48, pp. 1243-1261, 2006.
    • (2006) Speech Commun. , vol.48 , pp. 1243-1261
    • Prasanna, S.R.1    Gupta, C.S.2    Yegnanarayana, B.3
  • 14
    • 30444446629 scopus 로고    scopus 로고
    • Combining evidence from residual phase and MFCC features for speaker recognition
    • Jan
    • K. S. Murty and B. Yegnanarayana, "Combining evidence from residual phase and MFCC features for speaker recognition", IEEE Signal Process. Lett., vol. 13, no. 1, pp. 52-55, Jan. 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.1 , pp. 52-55
    • Murty, K.S.1    Yegnanarayana, B.2
  • 15
    • 54549099008 scopus 로고    scopus 로고
    • Investigation on LP-residual representations for speaker identification
    • M. Chetouani, M. Faundez-Zanuy, B. Gas, and J. L. Zarader, "Investigation on LP-residual representations for speaker identification", Pattern Recognition, vol. 42, pp. 487-494, 2009.
    • (2009) Pattern Recognition , vol.42 , pp. 487-494
    • Chetouani, M.1    Faundez-Zanuy, M.2    Gas, B.3    Zarader, J.L.4
  • 16
    • 85009083818 scopus 로고    scopus 로고
    • Time frequency analysis of vocal source signal for speaker recognition
    • N. Zheng, P. C. Ching, and T. Lee, "Time frequency analysis of vocal source signal for speaker recognition", in Proc. ICSLP, 2004, pp. 2333-2336.
    • (2004) Proc. ICSLP , pp. 2333-2336
    • Zheng, N.1    Ching, P.C.2    Lee, T.3
  • 17
    • 33947583290 scopus 로고    scopus 로고
    • Integration of complementary acoustic features for speaker recognition
    • Mar
    • N. Zheng, T. Lee, and P. C. Ching, "Integration of complementary acoustic features for speaker recognition", IEEE Signal Process. Lett., vol. 14, no. 3, pp. 181-184, Mar. 2007.
    • (2007) IEEE Signal Process. Lett. , vol.14 , Issue.3 , pp. 181-184
    • Zheng, N.1    Lee, T.2    Ching, P.C.3
  • 18
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 19
    • 33646772812 scopus 로고    scopus 로고
    • An evaluation of VTS and IMM for speaker verification in noise
    • S. S. Stan, T. Fingscheidt, and C. Beaugeant, "An evaluation of VTS and IMM for speaker verification in noise", in Proc. Eurospeech, 2003, pp. 1669-1672.
    • (2003) Proc. Eurospeech , pp. 1669-1672
    • Stan, S.S.1    Fingscheidt, T.2    Beaugeant, C.3
  • 20
    • 85135375893 scopus 로고
    • HMM recognition in noise using parallel model combination
    • M. J. F. Gales and S. Young, "HMM recognition in noise using parallel model combination", in Proc. Eurospeech, 1993, pp. 837-840.
    • (1993) Proc. Eurospeech , pp. 837-840
    • Gales, M.J.F.1    Young, S.2
  • 22
    • 0031619912 scopus 로고    scopus 로고
    • Speaker verification in noisy environment with combined spectral subtraction and missing data theory
    • A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environment with combined spectral subtraction and missing data theory", in Proc. ICASSP, 1998, pp. 121-124.
    • (1998) Proc. ICASSP , pp. 121-124
    • Drygajlo, A.1    El-Maliki, M.2
  • 23
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (RAPT)
    • W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
    • D. Talkin, "A robust algorithm for pitch tracking (RAPT)", in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995.
    • (1995) Speech Coding and Synthesis
    • Talkin, D.1
  • 25
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 27
    • 64549157066 scopus 로고    scopus 로고
    • CU2C: A dual-condition cantonese speech database for speaker recognition applications
    • N. Zheng, C. Qin, T. Lee, and P. C. Ching, "CU2C: A dual-condition Cantonese speech database for speaker recognition applications", in Proc. Oriental-COCOSDA, 2005, pp. 67-72.
    • (2005) Proc. Oriental-COCOSDA , pp. 67-72
    • Zheng, N.1    Qin, C.2    Lee, T.3    Ching, P.C.4
  • 28
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems", Speech Commun., vol. 12, pp. 247-251, 1993.
    • (1993) Speech Commun. , vol.12 , pp. 247-251
    • Varga, A.1    Steeneken, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.