SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 1, 2011, Pages 196-205

Robust speaker recognition using denoised vocal source and vocal tract features

(4) Wang, Ning a Ching, P C a Zheng, Nengheng b Lee, Tan a

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

b SHENZHEN UNIVERSITY (China)

Author keywords

Robust parameter estimation; source tract features; speaker recognition; spectral subtraction

Indexed keywords

ADDITIVE WHITE GAUSSIAN NOISE; ENHANCEMENT TECHNIQUES; EQUAL ERROR RATE; FEATURE ESTIMATION; GAUSSIAN MIXTURE MODEL; IDENTIFICATION ERROR RATE; LOW SIGNAL-TO-NOISE RATIO; NEW APPROACHES; NOISY ENVIRONMENT; NOISY SPEECH; ROBUST PARAMETER ESTIMATION; ROBUST RECOGNITION; ROBUST SPEAKER RECOGNITION; SIMULATION RESULT; SOURCE-TRACT FEATURES; SPEAKER RECOGNITION; SPECIFIC COMPONENT; SPECTRAL SUBTRACTIONS; VOCAL-TRACTS;

FEATURE EXTRACTION; PARAMETER ESTIMATION; SIGNAL TO NOISE RATIO; SPEECH ENHANCEMENT; WHITE NOISE;

SPEECH RECOGNITION;

EID: 77957744636 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2045800 Document Type: Article

Times cited : (50)

References (30)

1
- 0022794148
- Speaker recognition
- Oct
- D. O'Shaughnessy, "Speaker recognition", IEEE Acoust., Speech, Signal Process. Mag., vol. 3, no. 4, pp. 4-17, Oct. 1986.
- (1986) IEEE Acoust., Speech, Signal Process. Mag. , vol.3 , Issue.4 , pp. 4-17
- O'Shaughnessy, D.¹

2
- 33947384963
- Audio-visual biometrics
- Nov
- P. S. Aleksic and A. K. Katsaggelos, "Audio-visual biometrics", Proc. IEEE, vol. 94, no. 11, pp. 2025-2044, Nov. 2006.
- (2006) Proc. IEEE , vol.94 , Issue.11 , pp. 2025-2044
- Aleksic, P.S.¹ Katsaggelos, A.K.²

3
- 77957726104
- Personalize mobile access by speaker authentication
- D. D. Zhang, Ed. New York: Springer
- K. Chen, "Personalize mobile access by speaker authentication", in Biometric Solutions: For Authentication in an E-World, D. D. Zhang, Ed. New York: Springer, 2002.
- (2002) Biometric Solutions: For Authentication in an E-World
- Chen, K.¹

4
- 0031233424
- Speaker recognition: A tutorial
- Sep
- J. P. Campbell, "Speaker recognition: A tutorial", Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
- (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
- Campbell, J.P.¹

5
- 0029355999
- Speaker identification and verification using gaussian mixture speaker models
- D. A. Reynolds, "Speaker identification and verification using gaussian mixture speaker models", SpeechCommun., vol. 17, pp. 91-108, 1995.
- (1995) SpeechCommun. , vol.17 , pp. 91-108
- Reynolds, D.A.¹

6
- 85075924869
- Comparison of background normalization methods for text-independent speaker verification
- D. A. Reynolds, "Comparison of background normalization methods for text-independent speaker verification", in Proc. Eurospeech, 1997, vol. 2, pp. 963-966.
- (1997) Proc. Eurospeech , vol.2 , pp. 963-966
- Reynolds, D.A.¹

7
- 64449086223
- Discrimination power of vocal source and vocal tract related features for speaker segmentation
- Aug
- W. N. Chan, N. Zheng, and T. Lee, "Discrimination power of vocal source and vocal tract related features for speaker segmentation", IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 6, pp. 1884-1892, Aug. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.6 , pp. 1884-1892
- Chan, W.N.¹ Zheng, N.² Lee, T.³

8
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and B. H. Juang, Fundamentals of SpeechRecognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of SpeechRecognition
- Rabiner, L.R.¹ Juang, B.H.²

9
- 0031224204
- A study of harmonic features for speaker recognition
- B. Imperl, Z. Kacic, and B. Horvat, "A study of harmonic features for speaker recognition", Speech Commun., vol. 22, no. 4, pp. 385-402, 1997.
- (1997) Speech Commun. , vol.22 , Issue.4 , pp. 385-402
- Imperl, B.¹ Kacic, Z.² Horvat, B.³

10
- 0032595183
- Modeling of the glottal flow derivative waveform with application to speaker identification
- Sep
- M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification", IEEE Trans. Speech Audio Process, vol. 17, no. 5, pp. 569-586, Sep. 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.17 , Issue.5 , pp. 569-586
- Plumpe, M.D.¹ Quatieri, T.F.² Reynolds, D.A.³

11
- 0029356550
- Usefulness of the LPC-residue in text-independent speaker verification
- P. Thevenaz and H. Hugli, "Usefulness of the LPC-residue in text-independent speaker verification", SpeechCommun., vol. 17, no. 1-2, pp. 145-157, 1995.
- (1995) SpeechCommun. , vol.17 , Issue.1-2 , pp. 145-157
- Thevenaz, P.¹ Hugli, H.²

12
- 4544352735
- Improvement of speaker recognition by combining residual and prosodic features with acoustic features
- S.-H. Chen and H.-C. Wang, "Improvement of speaker recognition by combining residual and prosodic features with acoustic features", in Proc. ICASSP, 2004, pp. 93-96.
- (2004) Proc. ICASSP , pp. 93-96
- Chen, S.-H.¹ Wang, H.-C.²

13
- 33748443739
- Extraction of speaker-specific excitation information from linear prediction of speech
- S. R. Prasanna, C. S. Gupta, and B. Yegnanarayana, "Extraction of speaker-specific excitation information from linear prediction of speech", Speech Commun., vol. 48, pp. 1243-1261, 2006.
- (2006) Speech Commun. , vol.48 , pp. 1243-1261
- Prasanna, S.R.¹ Gupta, C.S.² Yegnanarayana, B.³

14
- 30444446629
- Combining evidence from residual phase and MFCC features for speaker recognition
- Jan
- K. S. Murty and B. Yegnanarayana, "Combining evidence from residual phase and MFCC features for speaker recognition", IEEE Signal Process. Lett., vol. 13, no. 1, pp. 52-55, Jan. 2006.
- (2006) IEEE Signal Process. Lett. , vol.13 , Issue.1 , pp. 52-55
- Murty, K.S.¹ Yegnanarayana, B.²

15
- 54549099008
- Investigation on LP-residual representations for speaker identification
- M. Chetouani, M. Faundez-Zanuy, B. Gas, and J. L. Zarader, "Investigation on LP-residual representations for speaker identification", Pattern Recognition, vol. 42, pp. 487-494, 2009.
- (2009) Pattern Recognition , vol.42 , pp. 487-494
- Chetouani, M.¹ Faundez-Zanuy, M.² Gas, B.³ Zarader, J.L.⁴

16
- 85009083818
- Time frequency analysis of vocal source signal for speaker recognition
- N. Zheng, P. C. Ching, and T. Lee, "Time frequency analysis of vocal source signal for speaker recognition", in Proc. ICSLP, 2004, pp. 2333-2336.
- (2004) Proc. ICSLP , pp. 2333-2336
- Zheng, N.¹ Ching, P.C.² Lee, T.³

17
- 33947583290
- Integration of complementary acoustic features for speaker recognition
- Mar
- N. Zheng, T. Lee, and P. C. Ching, "Integration of complementary acoustic features for speaker recognition", IEEE Signal Process. Lett., vol. 14, no. 3, pp. 181-184, Mar. 2007.
- (2007) IEEE Signal Process. Lett. , vol.14 , Issue.3 , pp. 181-184
- Zheng, N.¹ Lee, T.² Ching, P.C.³

18
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

19
- 33646772812
- An evaluation of VTS and IMM for speaker verification in noise
- S. S. Stan, T. Fingscheidt, and C. Beaugeant, "An evaluation of VTS and IMM for speaker verification in noise", in Proc. Eurospeech, 2003, pp. 1669-1672.
- (2003) Proc. Eurospeech , pp. 1669-1672
- Stan, S.S.¹ Fingscheidt, T.² Beaugeant, C.³

20
- 85135375893
- HMM recognition in noise using parallel model combination
- M. J. F. Gales and S. Young, "HMM recognition in noise using parallel model combination", in Proc. Eurospeech, 1993, pp. 837-840.
- (1993) Proc. Eurospeech , pp. 837-840
- Gales, M.J.F.¹ Young, S.²

21
- 0347899510
- Jacobian environmental adaptation
- C. Cerisara, L. Rigazio, and J.-C. Junqua, "-Jacobian environmental adaptation", Speech Commun., vol. 42, pp. 25-41, 2004.
- (2004) Speech Commun. , vol.42 , pp. 25-41
- Cerisara, C.¹ Rigazio, L.² Junqua, J.-C.³

22
- 0031619912
- Speaker verification in noisy environment with combined spectral subtraction and missing data theory
- A. Drygajlo and M. El-Maliki, "Speaker verification in noisy environment with combined spectral subtraction and missing data theory", in Proc. ICASSP, 1998, pp. 121-124.
- (1998) Proc. ICASSP , pp. 121-124
- Drygajlo, A.¹ El-Maliki, M.²

23
- 0001455934
- A robust algorithm for pitch tracking (RAPT)
- W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier
- D. Talkin, "A robust algorithm for pitch tracking (RAPT)", in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Amsterdam, The Netherlands: Elsevier, 1995.
- (1995) Speech Coding and Synthesis
- Talkin, D.¹

24
- 0003833285
- Philadelphia, PA: SIAM
- I. Daubechies, Ten Lectures on Wavelets. Philadelphia, PA: SIAM, 1992.
- (1992) Ten Lectures on Wavelets
- Daubechies, I.¹

25
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences", IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

26
- 0003956816
- Englewood Cliffs, NJ: Prentice-Hall
- S. M. Kay, Modern Spectral Estimation: Theory and Applicaition. Englewood Cliffs, NJ: Prentice-Hall, C1988.
- Modern Spectral Estimation: Theory and Applicaition
- Kay, S.M.¹

27
- 64549157066
- CU2C: A dual-condition cantonese speech database for speaker recognition applications
- N. Zheng, C. Qin, T. Lee, and P. C. Ching, "CU2C: A dual-condition Cantonese speech database for speaker recognition applications", in Proc. Oriental-COCOSDA, 2005, pp. 67-72.
- (2005) Proc. Oriental-COCOSDA , pp. 67-72
- Zheng, N.¹ Qin, C.² Lee, T.³ Ching, P.C.⁴

28
- 0027623210
- Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
- A. Varga and H. Steeneken, "Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems", Speech Commun., vol. 12, pp. 247-251, 1993.
- (1993) Speech Commun. , vol.12 , pp. 247-251
- Varga, A.¹ Steeneken, H.²

29
- 51449086024
- Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006
- Sep
- N. Brümmer, L. Burget, J. Černocký, O. Glembek, F. Grézl, M. Karafiát, D. Leeuwen, P. Matějka, P. Schwartz, and A. Strasheim, "Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006", IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2072-2084, Sep. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2072-2084
- Brümmer, N.¹ Burget, L.² Černocký, J.³ Glembek, O.⁴ Grézl, F.⁵ Karafiát, M.⁶ Leeuwen, D.⁷ Matějka, P.⁸ Schwartz, P.⁹ Strasheim, A.¹⁰

30
- 58349106697
- A study of inter-speaker variability in speaker verification
- Jul
- P. Kenny, P. Ouellet, N. Dehak, V. Gupta, and P. Dumouchel, "A study of inter-speaker variability in speaker verification", IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 5, pp. 980-988, Jul. 2008.
- (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.5 , pp. 980-988
- Kenny, P.¹ Ouellet, P.² Dehak, N.³ Gupta, V.⁴ Dumouchel, P.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.