메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1215-1226

Overview of front-end features for robust speaker recognition

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPEAKER RECOGNITION; HIGH-LEVEL FEATURES; LOW-LEVEL FEATURES; ROBUST SPEAKER RECOGNITION; SPEAKER RECOGNITION;

EID: 84866876280     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (18)

References (72)
  • 1
    • 0031223555 scopus 로고    scopus 로고
    • Recent advances in speaker recognition
    • S. Furui, "Recent Advances in Speaker Recognition, " Pattern Recognition Letters, 18, (1997), 859-872.
    • (1997) Pattern Recognition Letters , vol.18 , pp. 859-872
    • Furui, S.1
  • 2
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • J. P. Campbell, "Speaker Recognition: A Tutorial, " Proceedings of the IEEE, 85, 9, (1997), 1437-1462.
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 5
    • 0036293830 scopus 로고    scopus 로고
    • An overview of automatic speaker recognition technology
    • (Orlando, Florida)
    • D.A. Reynolds, "An Overview of Automatic Speaker Recognition Technology, " In Proceedings of ICASSP '2002 (Orlando, Florida, 2002), pp. 4072-4075.
    • (2002) Proceedings of ICASSP '2002 , pp. 4072-4075
    • Reynolds, D.A.1
  • 6
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen and H. Li, "An Overview of Text-Independent Speaker Recognition: From Features to Supervectors, " Speech Communication 2010, 52 (2010), 12-42.
    • (2010) Speech Communication 2010 , vol.52 , pp. 12-42
    • Kinnunen, T.1    Li, H.2
  • 7
    • 84953683778 scopus 로고
    • Efficient acoustic parameters for speaker recognition
    • J. J. Wolf, "Efficient Acoustic Parameters for Speaker Recognition, " JASA, 51, 6, (1972), 2044-2056.
    • (1972) JASA , vol.51 , Issue.6 , pp. 2044-2056
    • Wolf, J.J.1
  • 9
    • 85009210545 scopus 로고    scopus 로고
    • On the combination of speech and speaker recognition
    • Geneva, Switzerland, September
    • M. BenZeghiba, and H. Bourland, "On the combination of speech and speaker recognition, " In Proc. Eurospeech, Geneva, Switzerland, September 2003, pp. 1361-1364.
    • (2003) Proc. Eurospeech , pp. 1361-1364
    • Benzeghiba, M.1    Bourland, H.2
  • 10
    • 85009242711 scopus 로고    scopus 로고
    • Combining speaker and speech recognition systems
    • Denver, Colorado, USA, September
    • L. Heck, and D. Genoud, "Combining speaker and speech recognition systems, " In Proc. ICSLP, Denver, Colorado, USA, September 2002, pp. 1369-1372.
    • (2002) Proc. ICSLP , pp. 1369-1372
    • Heck, L.1    Genoud, D.2
  • 13
    • 85009291564 scopus 로고    scopus 로고
    • ASR dependent techniques for speaker identification
    • Denver, Colorado, USA, September
    • A. Park, and T. Hazen, "ASR dependent techniques for speaker identification, " In Proc. ICSLP, Denver, Colorado, USA, September 2002, pp. 1337-1340.
    • (2002) Proc. ICSLP , pp. 1337-1340
    • Park, A.1    Hazen, T.2
  • 15
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • April
    • S. Furui, "Cepstral analysis technique for automatic speaker verification, " IEEE Transactions on Acoustics, Speech and Signal Processing 29, 2 (April 1981), 254-272.
    • (1981) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 16
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • January
    • D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted gaussian mixture models, " Digital Signal Processing 10, 1, January 2000, 19-41.
    • (2000) Digital Signal Processing , vol.10 , Issue.1 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 17
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • January
    • D. Reynolds, and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models, " IEEE Trans. on Speech and Audio Processing 3, January 1995, 72-83.
    • (1995) IEEE Trans. on Speech and Audio Processing , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 18
    • 0024906979 scopus 로고
    • Speaker verification over long distance telephone lines
    • Glasgow, May
    • J. Naik, L. Netsch, and G. Doddington, "Speaker verification over long distance telephone lines, " In Proc. ICASSP, Glasgow, May 1989, pp. 524-527.
    • (1989) Proc. ICASSP , pp. 524-527
    • Naik, J.1    Netsch, L.2    Doddington, G.3
  • 19
    • 33746432558 scopus 로고    scopus 로고
    • User-customized password speaker verification using multiple reference and background models
    • September
    • M. BenZeghiba, and H. Bourland, "User-customized password speaker verification using multiple reference and background models, " Speech Communication 48, 9, September 2006, pp. 1200-1213.
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1200-1213
    • Benzeghiba, M.1    Bourland, H.2
  • 20
    • 0033746018 scopus 로고    scopus 로고
    • Robustness to telephone handset distortion in speaker recognition by discriminative feature design
    • June
    • L. Heck, Y. Konig, M. Sonmez, and M. Weintraub, "Robustness to telephone handset distortion in speaker recognition by discriminative feature design, " Speech Communication 31, June 2000, 181-192.
    • (2000) Speech Communication , vol.31 , pp. 181-192
    • Heck, L.1    Konig, Y.2    Sonmez, M.3    Weintraub, M.4
  • 21
    • 0035989168 scopus 로고    scopus 로고
    • AANN: An alternative to GMM for pattern recognition
    • 15, April
    • B. Yegnanarayana, and S. Kishore, "AANN: an alternative to GMM for pattern recognition, " Neural Networks 15, April 2002, pp. 459-469.
    • (2002) Neural Networks , pp. 459-469
    • Yegnanarayana, B.1    Kishore, S.2
  • 23
    • 0034505639 scopus 로고    scopus 로고
    • Support vector machines for speaker verification and identification
    • vol.2, doi: 10.1109/NNSP.2000.890157
    • V. Wan and W. Campbell, "Support vector machines for speaker verification and identification, " Proceedings of the 2000 IEEE Signal Processing Society Workshop, vol.2, no., pp.775-784 vol.2, 2000, doi: 10.1109/NNSP.2000.890157.
    • (2000) Proceedings of the 2000 IEEE Signal Processing Society Workshop , vol.2 , pp. 775-784
    • Wan, V.1    Campbell, W.2
  • 25
    • 0035506942 scopus 로고    scopus 로고
    • Comparison of different implementations of MFCC
    • F. Zheng, G. Zhang and Z. Song, "Comparison of Different Implementations of MFCC, " J. Computer Science & Technology, 16(6): 582-589, 2001.
    • (2001) J. Computer Science & Technology , vol.16 , Issue.6 , pp. 582-589
    • Zheng, F.1    Zhang, G.2    Song, Z.3
  • 26
    • 0030677489 scopus 로고    scopus 로고
    • Minimum variance distortionless response (MVDR) modeling of voiced speech
    • vol.3, 21-24 Apr.
    • M. Murthi, B. Rao, "Minimum variance distortionless response (MVDR) modeling of voiced speech, " In Proc. ICASSP, vol.3, no., pp.1687-1690 vol.3, 21-24 Apr 1997.
    • (1997) Proc. ICASSP , vol.3 , pp. 1687-1690
    • Murthi, M.1    Rao, B.2
  • 27
    • 85009198067 scopus 로고    scopus 로고
    • Minimum variance distortionless response on a warped frequency scale
    • M. Wolfel, J. McDonough, and A. Waibel, "Minimum variance distortionless response on a warped frequency scale, " In EUROSPEECH-2003, 1021-1024.
    • EUROSPEECH-2003 , pp. 1021-1024
    • Wolfel, M.1    McDonough, J.2    Waibel, A.3
  • 29
    • 78049405703 scopus 로고    scopus 로고
    • Speaker identification with distant microphone speech
    • Q. Jin, R. Li, Q. Yang, K. Laskowski, and T. Schultz, "Speaker identification with distant microphone speech, " IEEE ICASSP, pp. 4518-4521, 2010.
    • (2010) IEEE ICASSP , pp. 4518-4521
    • Jin, Q.1    Li, R.2    Yang, Q.3    Laskowski, K.4    Schultz, T.5
  • 30
    • 67650107416 scopus 로고    scopus 로고
    • Recognition of reverberant speech using frequency domain linear prediction
    • Thomas, S., Ganapathy, S. and Hermansky, H., "Recognition of reverberant speech using frequency domain linear prediction, " IEEE Signal Proc. Letters, Vol. 15, pp. 681-684, 2008.
    • (2008) IEEE Signal Proc. Letters , vol.15 , pp. 681-684
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 31
    • 80051618525 scopus 로고    scopus 로고
    • Feature normalization for speaker verification in room reverberation
    • S. Ganapathy, J. Pelecanos and M. K. Omar, "Feature normalization for speaker verification in room reverberation, " IEEE ICASSP, pp. 4836-4839, 2011.
    • (2011) IEEE ICASSP , pp. 4836-4839
    • Ganapathy, S.1    Pelecanos, J.2    Omar, M.K.3
  • 34
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • S. Furui, "Cepstral analysis technique for automatic speaker verification", IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 29, pp. 254-272, 1981.
    • (1981) IEEE Trans. on Acoustics, Speech and Signal Processing , vol.29 , pp. 254-272
    • Furui, S.1
  • 35
    • 79959839465 scopus 로고    scopus 로고
    • Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions
    • S. O. Sadjadi and J. H. L. Hansen, "Assessment of single-channel speech enhancement techniques for speaker identification under mismatched conditions, " ISCA Interspeech, pp. 2138-2141, 2010.
    • (2010) ISCA Interspeech , pp. 2138-2141
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 36
    • 80051641505 scopus 로고    scopus 로고
    • Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions
    • S. O. Sadjadi and J. H. L. Hansen, "Hilbert envelope based features for robust speaker identification under reverberant mismatched conditions, " In Proc. ICASSP 2011, pp. 5448-5451.
    • (2011) Proc. ICASSP , pp. 5448-5451
    • Sadjadi, S.O.1    Hansen, J.H.L.2
  • 38
    • 70450161842 scopus 로고    scopus 로고
    • Analysis of band structures for speaker-specific information in fm feature extraction
    • T. Thiruvaran, E. Ambikairajah, and J. Epps, "Analysis of band structures for speaker-specific information in fm feature extraction, " Proc. INTERSPEECH, 2009.
    • (2009) Proc. INTERSPEECH
    • Thiruvaran, T.1    Ambikairajah, E.2    Epps, J.3
  • 39
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. F. Kaiser, and T. F. Quatieri, "Energy separation in signal modulations with application to speech analysis, " IEEE Transactions on Signal Processing, vol. 41, no. 10, pp. 3024-51, 1993.
    • (1993) IEEE Transactions on Signal Processing , vol.41 , Issue.10 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 40
    • 0030376663 scopus 로고    scopus 로고
    • Robust prosodic features for speaker identification
    • Philadelphia PA, USA
    • M. Carey, E. Parris, H. Lloyd-Thomsa, and S. Bennett, "Robust prosodic features for speaker identification, " in Proc. ICSLP, Philadelphia PA, USA, 1996, pp. 1800-1803.
    • (1996) Proc. ICSLP , pp. 1800-1803
    • Carey, M.1    Parris, E.2    Lloyd-Thomsa, H.3    Bennett, S.4
  • 41
    • 85128436986 scopus 로고    scopus 로고
    • Modeling dynamic prosodic variation for speaker verification
    • Sydney, Australia
    • K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub, "Modeling dynamic prosodic variation for speaker verification, " in Proc. ICSLP, Sydney, Australia, 1998, pp. 3189-3192.
    • (1998) Proc. ICSLP , pp. 3189-3192
    • Sonmez, K.1    Shriberg, E.2    Heck, L.3    Weintraub, M.4
  • 42
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • B. Atal, "Automatic speaker recognition based on pitch contours, " in J. ASA, 1972, vol. 52, pp. 1687-1697.
    • (1972) J. ASA , vol.52 , pp. 1687-1697
    • Atal, B.1
  • 43
    • 0141469397 scopus 로고    scopus 로고
    • Modeling prosodic dynamics for speaker recognition
    • Hong Kong, China
    • A. Adami, R. Mihaescu, D. Reynolds, and J. Godfrey, "Modeling prosodic dynamics for speaker recognition, " in Proc. ICASSP, Hong Kong, China, 2003, pp. 19-41.
    • (2003) Proc. ICASSP , pp. 19-41
    • Adami, A.1    Mihaescu, R.2    Reynolds, D.3    Godfrey, J.4
  • 44
    • 70349209406 scopus 로고    scopus 로고
    • Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
    • K. Laskowski and Q. Jin, "Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum, " in Proc. ICASSP 2009.
    • (2009) Proc. ICASSP
    • Laskowski, K.1    Jin, Q.2
  • 45
    • 84902655943 scopus 로고    scopus 로고
    • Learning prosodic sequences using the fundamental frequency variation spectrum
    • Campinas, Brazil
    • K. Laskowski, J. Edlund, and M. Heldner, "Learning prosodic sequences using the fundamental frequency variation spectrum, " in Proc. SPEECH PROSODY, Campinas, Brazil, 2008.
    • (2008) Proc. SPEECH PROSODY
    • Laskowski, K.1    Edlund, J.2    Heldner, M.3
  • 46
    • 51449093800 scopus 로고    scopus 로고
    • An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems
    • Las Vegas NV, USA
    • K. Laskowski, J. Edlund, and M. Heldner, "An instantaneous vector representation of delta pitch for speaker-change prediction in conversational dialogue systems, " in Proc. ICASSP, Las Vegas NV, USA, 2008, pp. 5041-5044.
    • (2008) Proc. ICASSP , pp. 5041-5044
    • Laskowski, K.1    Edlund, J.2    Heldner, M.3
  • 47
    • 85073112188 scopus 로고    scopus 로고
    • Modeling prosody for speaker recognition: Why estimating pitch may be a red herring
    • Brno, Czech Republic
    • K. Laskowski and Q. Jin, "Modeling Prosody for Speaker Recognition: Why Estimating Pitch May Be a Red Herring, " in Proc. Speaker Odyssey: the Speaker Recognition Workshop, Brno, Czech Republic, 2010.
    • (2010) Proc. Speaker Odyssey: The Speaker Recognition Workshop
    • Laskowski, K.1    Jin, Q.2
  • 48
    • 0031095319 scopus 로고    scopus 로고
    • A multiple window method for estimation of peaked spectra
    • Mar.
    • M. Hansson and G. Salomonsson, "A multiple window method for estimation of peaked spectra, " IEEE Trans. on Sign. Proc., vol. 45, no. 3, pp. 778-781, Mar. 1997.
    • (1997) IEEE Trans. on Sign. Proc. , vol.45 , Issue.3 , pp. 778-781
    • Hansson, M.1    Salomonsson, G.2
  • 49
    • 79959826333 scopus 로고    scopus 로고
    • What else is new than the Hamming window? Robust MFCCs for speaker recognition via multitapering
    • T. Kinnunen, R. Saeidi, J. Sandberg, and M. Hansson-Sandsten, "What else is new than the Hamming window? Robust MFCCs for speaker recognition via multitapering, " in Proc. InterSpeech 2010.
    • (2010) Proc. InterSpeech
    • Kinnunen, T.1    Saeidi, R.2    Sandberg, J.3    Hansson-Sandsten, M.4
  • 50
    • 0016939145 scopus 로고
    • Automatic recognition of speakers from their voices
    • B. S. Atal, "Automatic Recognition of Speakers from Their Voices, " Proceedings of the IEEE, 64, 4, (1976), 460-475.
    • (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 460-475
    • Atal, B.S.1
  • 51
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • B.S. Atal, "Automatic speaker recognition based on pitch contours, " JASA, vol. 52, pp.1687-1697, 1972.
    • (1972) JASA , vol.52 , pp. 1687-1697
    • Atal, B.S.1
  • 53
    • 85128436986 scopus 로고    scopus 로고
    • Modeling dynamic prosodic variation for speaker verification
    • Sydney, Dec
    • K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub, "Modeling dynamic prosodic variation for speaker verification, " Proc. ICSLP-98, Sydney, Dec 1998.
    • (1998) Proc. ICSLP-98
    • Sonmez, K.1    Shriberg, E.2    Heck, L.3    Weintraub, M.4
  • 56
    • 0141856298 scopus 로고    scopus 로고
    • Using prosodic and conversational features for high-performance speaker recognition: Report from JHU WS'02
    • B. Peskin, J. Navratil, J. Abramson, D. Jones, D. Klusacek, D. Reynolds, B. Xiang, "Using Prosodic and Conversational Features for High-performance Speaker Recognition: Report from JHU WS'02, " ICASSP 2003.
    • (2003) ICASSP
    • Peskin, B.1    Navratil, J.2    Abramson, J.3    Jones, D.4    Klusacek, D.5    Reynolds, D.6    Xiang, B.7
  • 57
    • 21844454996 scopus 로고    scopus 로고
    • Modeling prosodic feature sequences for speaker recognition
    • Special Issue on Quantitative Prosody Modelling for Natural Speech Description and Generation
    • E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition, " Speech Communication, vol. 46, no. 3-4, pp. 455-472, 2005, Special Issue on Quantitative Prosody Modelling for Natural Speech Description and Generation.
    • (2005) Speech Communication , vol.46 , Issue.3-4 , pp. 455-472
    • Shriberg, E.1    Ferrer, L.2    Kajarekar, S.3    Venkataraman, A.4    Stolcke, A.5
  • 58
    • 64249101047 scopus 로고    scopus 로고
    • Modeling prosodic features with joint factor analysis for speaker verification
    • Sept.
    • N. Dehak, P. Dumouchel, and P. Kenny, "Modeling prosodic features with joint factor analysis for speaker verification, " IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 7, pp. 2095-2103, Sept. 2007.
    • (2007) IEEE Trans. on Audio, Speech, and Language Processing , vol.15 , Issue.7 , pp. 2095-2103
    • Dehak, N.1    Dumouchel, P.2    Kenny, P.3
  • 60
    • 33947627520 scopus 로고    scopus 로고
    • Speaker detection using acoustic event sequences
    • N. Scheffer, J. Bonastre, "Speaker Detection using Acoustic Event Sequences," In Proc. Eurospeech 2005.
    • (2005) Proc. Eurospeech
    • Scheffer, N.1    Bonastre, J.2
  • 62
    • 18144435041 scopus 로고    scopus 로고
    • Phonetic speaker recognition using maximum likelihood binary decision tree models
    • J. Navratil, Q. Jin, W. Andrews, and J. Campbell, "Phonetic Speaker Recognition Using Maximum Likelihood Binary Decision Tree Models, " ICASSP 2003.
    • (2003) ICASSP
    • Navratil, J.1    Jin, Q.2    Andrews, W.3    Campbell, J.4
  • 64
    • 33646348224 scopus 로고    scopus 로고
    • Improved phonetic speaker recognition using lattice decoding
    • A. Hatch, B. Peskin, A. Stolcke, "Improved Phonetic Speaker Recognition Using Lattice Decoding, " In Proc. ICASSP. 2005.
    • (2005) Proc. ICASSP
    • Hatch, A.1    Peskin, B.2    Stolcke, A.3
  • 65
    • 34547511465 scopus 로고    scopus 로고
    • Word-conditioned phone n-grams for speaker recognition
    • H. Lei, N. Mirghafori, "Word-Conditioned Phone N-Grams for Speaker Recognition, " In Proc. ICASSP, 2007.
    • (2007) Proc. ICASSP
    • Lei, H.1    Mirghafori, N.2
  • 68
    • 85009124414 scopus 로고    scopus 로고
    • Speaker recognition based on idiolectal differences between speakers
    • G. Doddington, "Speaker Recognition based on Idiolectal Differences between Speakers, " Eurospeech, Vol. 4, pp. 2517-2520, 2001.
    • (2001) Eurospeech , vol.4 , pp. 2517-2520
    • Doddington, G.1
  • 70
    • 56149108574 scopus 로고    scopus 로고
    • Duration and pronunciation conditioned lexical modeling for speaker verification
    • G. Tur, E. Shriberg, A. Stolcke, S. Kajarekar, "Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification, " In Proc. of Interspeech, 2007.
    • (2007) Proc. of Interspeech
    • Tur, G.1    Shriberg, E.2    Stolcke, A.3    Kajarekar, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.