메뉴 건너뛰기




Volumn 52, Issue 1, 2010, Pages 12-40

An overview of text-independent speaker recognition: From features to supervectors

Author keywords

Discriminative models; Feature extraction; Intersession variability compensation; Speaker recognition; Statistical models; Supervectors; Text independence

Indexed keywords

DISCRIMINATIVE MODELS; INTERSESSION VARIABILITY COMPENSATION; SPEAKER RECOGNITION; STATISTICAL MODELS; SUPERVECTORS; TEXT-INDEPENDENCE;

EID: 70350125882     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2009.08.009     Document Type: Article
Times cited : (1371)

References (248)
  • 1
    • 34047164452 scopus 로고    scopus 로고
    • Modeling prosodic differences for speaker recognition
    • Adami A. Modeling prosodic differences for speaker recognition. Speech Comm. 49 4 (2007) 277-291
    • (2007) Speech Comm. , vol.49 , Issue.4 , pp. 277-291
    • Adami, A.1
  • 3
    • 11844290709 scopus 로고    scopus 로고
    • The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications
    • December
    • Alexander, A., Botti, F., Dessimoz, D., Drygajlo, A., 2004. The effect of mismatched recording conditions on human and automatic speaker recognition in forensic applications. Forensic Science International 146S, December 2004, pp. 95-99.
    • (2004) Forensic Science International , vol.146 S , pp. 95-99
    • Alexander, A.1    Botti, F.2    Dessimoz, D.3    Drygajlo, A.4
  • 4
    • 0032875050 scopus 로고    scopus 로고
    • A method for generating natural-sounding speech stimuli for cognitive brain research
    • Alku P., Tiitinen H., and Näätänen R. A method for generating natural-sounding speech stimuli for cognitive brain research. Clin. Neurophysiol. 110 8 (1999) 1329-1333
    • (1999) Clin. Neurophysiol. , vol.110 , Issue.8 , pp. 1329-1333
    • Alku, P.1    Tiitinen, H.2    Näätänen, R.3
  • 5
    • 18544396408 scopus 로고    scopus 로고
    • Speaker identification by combining multiple classifiers using Dempster-Shafer theory of evidence
    • Altincay H., and Demirekler M. Speaker identification by combining multiple classifiers using Dempster-Shafer theory of evidence. Speech Comm. 41 4 (2003) 531-547
    • (2003) Speech Comm. , vol.41 , Issue.4 , pp. 531-547
    • Altincay, H.1    Demirekler, M.2
  • 11
    • 0015476226 scopus 로고
    • Automatic speaker recognition based on pitch contours
    • Atal B. Automatic speaker recognition based on pitch contours. J. Acoust. Soc. Amer. 52 6 (1972) 1687-1697
    • (1972) J. Acoust. Soc. Amer. , vol.52 , Issue.6 , pp. 1687-1697
    • Atal, B.1
  • 12
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal B. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J. Acoust. Soc. Amer. 55 6 (1974) 1304-1312
    • (1974) J. Acoust. Soc. Amer. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.1
  • 15
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • Auckenthaler R., Carey M., and Lloyd-Thomas H. Score normalization for text-independent speaker verification systems. Digital Signal Process. 10 1-3 (2000) 42-54
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 17
    • 0031238211 scopus 로고    scopus 로고
    • ITU-T recommendation g729 annex b: a silence compression scheme for use with g729 optimized for v.70 digital simultaneous voice and data applications
    • Benyassine A., Schlomot E., and Su H. ITU-T recommendation g729 annex b: a silence compression scheme for use with g729 optimized for v.70 digital simultaneous voice and data applications. IEEE Comm. Mag. 35 (1997) 64-73
    • (1997) IEEE Comm. Mag. , vol.35 , pp. 64-73
    • Benyassine, A.1    Schlomot, E.2    Su, H.3
  • 19
    • 33746432558 scopus 로고    scopus 로고
    • User-customized password speaker verification using multiple reference and background models
    • BenZeghiba M., and Bourland H. User-customized password speaker verification using multiple reference and background models. Speech Comm. 48 9 (2006) 1200-1213
    • (2006) Speech Comm. , vol.48 , Issue.9 , pp. 1200-1213
    • BenZeghiba, M.1    Bourland, H.2
  • 20
    • 0034227991 scopus 로고    scopus 로고
    • Subband architecture for automatic speaker recognition
    • Besacier L., and Bonastre J.-F. Subband architecture for automatic speaker recognition. Signal Process. 80 (2000) 1245-1259
    • (2000) Signal Process. , vol.80 , pp. 1245-1259
    • Besacier, L.1    Bonastre, J.-F.2
  • 21
    • 0033748161 scopus 로고    scopus 로고
    • Localization and selection of speaker-specific information with statistical modeling
    • Besacier L., Bonastre J., and Fredouille C. Localization and selection of speaker-specific information with statistical modeling. Speech Comm. 31 (2000) 89-106
    • (2000) Speech Comm. , vol.31 , pp. 89-106
    • Besacier, L.1    Bonastre, J.2    Fredouille, C.3
  • 22
    • 0029352294 scopus 로고
    • Second-order statistical measures for text-independent speaker identification
    • Bimbot F., Magrin-Chagnolleau I., and Mathan L. Second-order statistical measures for text-independent speaker identification. Speech Comm. 17 (1995) 177-192
    • (1995) Speech Comm. , vol.17 , pp. 177-192
    • Bimbot, F.1    Magrin-Chagnolleau, I.2    Mathan, L.3
  • 26
    • 70350099727 scopus 로고    scopus 로고
    • Boersma, P, Weenink, D, 2009. Praat: doing phonetics by computer [computer program, WWW page, June 2009
    • Boersma, P., Weenink, D., 2009. Praat: doing phonetics by computer [computer program]. WWW page, June 2009, .
  • 27
    • 65349113532 scopus 로고    scopus 로고
    • Artificial impostor voice transformation effects on false acceptance rates
    • Antwerp, Belgium, August
    • Bonastre, J.-F., Matrouf, D., Fredouille, C., 2007. Artificial impostor voice transformation effects on false acceptance rates. In: Proc. Interspeech 2007 (ICSLP), Antwerp, Belgium, August 2007, pp. 2053-2056.
    • (2007) Proc. Interspeech 2007 (ICSLP) , pp. 2053-2056
    • Bonastre, J.-F.1    Matrouf, D.2    Fredouille, C.3
  • 28
    • 29044433376 scopus 로고    scopus 로고
    • Application-independent evaluation of speaker detection
    • Brümmer N., and Preez J. Application-independent evaluation of speaker detection. Comput. Speech Lang. 20 (2006) 230-275
    • (2006) Comput. Speech Lang. , vol.20 , pp. 230-275
    • Brümmer, N.1    Preez, J.2
  • 32
    • 0023293466 scopus 로고
    • Text-dependent speaker verification using vector quantization source coding
    • Burton D. Text-dependent speaker verification using vector quantization source coding. IEEE Trans. Acoustics, Speech, Signal Process. 35 2 (1987) 133-143
    • (1987) IEEE Trans. Acoustics, Speech, Signal Process. , vol.35 , Issue.2 , pp. 133-143
    • Burton, D.1
  • 33
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: a tutorial
    • Campbell J. Speaker recognition: a tutorial. Proc. IEEE 85 9 (1997) 1437-1462
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.1
  • 35
    • 84898983965 scopus 로고    scopus 로고
    • Phonetic speaker recognition with support vector machines
    • Thrun S., Saul L., and Schokopf B. (Eds), MIT Press, Cambridge, MA
    • Campbell W., Campbell J., Reynolds D., Jones D., and Leek T. Phonetic speaker recognition with support vector machines. In: Thrun S., Saul L., and Schokopf B. (Eds). Advances in Neural Information Processing Systems Vol. 16 (2004), MIT Press, Cambridge, MA
    • (2004) Advances in Neural Information Processing Systems , vol.16
    • Campbell, W.1    Campbell, J.2    Reynolds, D.3    Jones, D.4    Leek, T.5
  • 38
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • Campbell W., Sturim D., and Reynolds D. Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13 5 (2006) 308-311
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3
  • 42
    • 64449086223 scopus 로고    scopus 로고
    • Discrimination power of vocal source and vocal tract related features for speaker segmentation
    • Chan W., Zheng N., and Lee T. Discrimination power of vocal source and vocal tract related features for speaker segmentation. IEEE Trans. Audio, Speech Language Process. 15 6 (2007) 1884-1892
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.6 , pp. 1884-1892
    • Chan, W.1    Zheng, N.2    Lee, T.3
  • 44
    • 0037228684 scopus 로고    scopus 로고
    • Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition
    • Chaudhari U., Navratil J., and Maes S. Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition. IEEE Trans. Speech Audio Process. 11 1 (2003) 61-69
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.1 , pp. 61-69
    • Chaudhari, U.1    Navratil, J.2    Maes, S.3
  • 45
    • 0000291808 scopus 로고    scopus 로고
    • Methods of combining multiple classifiers with different features and their applications to text-independent speaker recognition
    • Chen K., Wang L., and Chi H. Methods of combining multiple classifiers with different features and their applications to text-independent speaker recognition. Internat. J. Pattern Recognition Artif. Intell. 11 3 (1997) 417-445
    • (1997) Internat. J. Pattern Recognition Artif. Intell. , vol.11 , Issue.3 , pp. 417-445
    • Chen, K.1    Wang, L.2    Chi, H.3
  • 47
    • 54549099008 scopus 로고    scopus 로고
    • Investigation on LP-residual presentations for speaker identification
    • Chetouani M., Faundez-Zanuy M., Gas B., and Zarader J. Investigation on LP-residual presentations for speaker identification. Pattern Recognition 42 3 (2009) 487-494
    • (2009) Pattern Recognition , vol.42 , Issue.3 , pp. 487-494
    • Chetouani, M.1    Faundez-Zanuy, M.2    Gas, B.3    Zarader, J.4
  • 49
    • 0038005877 scopus 로고    scopus 로고
    • Improving speaker identification in noise by subband processing and decision fusion
    • Damper R., and Higgins J. Improving speaker identification in noise by subband processing and decision fusion. Pattern Recognition Lett. 24 (2003) 2167-2173
    • (2003) Pattern Recognition Lett. , vol.24 , pp. 2167-2173
    • Damper, R.1    Higgins, J.2
  • 50
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S., and Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoustics, Speech, Signal Process. 28 4 (1980) 357-366
    • (1980) IEEE Trans. Acoustics, Speech, Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 51
    • 0036214787 scopus 로고    scopus 로고
    • YIN, a fundamental frequency estimator for speech and music
    • DeCheveigne A., and Kawahara H. YIN, a fundamental frequency estimator for speech and music. J. Acoust. Soc. Amer. 111 4 (2002) 1917-1930
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
    • DeCheveigne, A.1    Kawahara, H.2
  • 53
    • 64249101047 scopus 로고    scopus 로고
    • Modeling prosodic features with joint factor analysis for speaker verification
    • Dehak N., Kenny P., and Dumouchel P. Modeling prosodic features with joint factor analysis for speaker verification. IEEE Trans. Audio, Speech Language Process. 15 7 (2007) 2095-2103
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.7 , pp. 2095-2103
    • Dehak, N.1    Kenny, P.2    Dumouchel, P.3
  • 54
    • 85084012249 scopus 로고    scopus 로고
    • Comparison between factor analysis and GMM support vector machines for speaker verification
    • Stellenbosch, South Africa, January, Paper 009. 2008
    • Dehak, N., Dehak, R., Kenny, P., Dumouchel, P., 2008. Comparison between factor analysis and GMM support vector machines for speaker verification. In: The Speaker and Language Recognition Workshop (Odyssey 2008), Stellenbosch, South Africa, January 2008. Paper 009.
    • (2008) The Speaker and Language Recognition Workshop (Odyssey
    • Dehak, N.1    Dehak, R.2    Kenny, P.3    Dumouchel, P.4
  • 58
    • 0035573146 scopus 로고    scopus 로고
    • Speaker recognition from coded speech and the effects of score normalization
    • Pacific Grove, California, USA, November
    • Dunn, R., Quatieri, T., Reynolds, D., Campbell, J. 2001. Speaker recognition from coded speech and the effects of score normalization. In: Proc. 35th Asilomar Conf. on Signals, Systems and Computers, Vol. 2, Pacific Grove, California, USA, November 2001, pp. 1562-1567.
    • (2001) Proc. 35th Asilomar Conf. on Signals, Systems and Computers , vol.2 , pp. 1562-1567
    • Dunn, R.1    Quatieri, T.2    Reynolds, D.3    Campbell, J.4
  • 59
    • 44949136520 scopus 로고    scopus 로고
    • A new set of features for text-independent speaker identification
    • Pittsburgh, Pennsylvania, USA, September
    • Espy-Wilson, C., Manocha, S., Vishnubhotla, S., 2006. A new set of features for text-independent speaker identification. In: Proc. Interspeech 2006 (ICSLP), Pittsburgh, Pennsylvania, USA, September 2006, pp. 1475-1478.
    • (2006) Proc. Interspeech 2006 (ICSLP) , pp. 1475-1478
    • Espy-Wilson, C.1    Manocha, S.2    Vishnubhotla, S.3
  • 61
    • 51849149120 scopus 로고    scopus 로고
    • Improving speaker recognition performance using phonetically structured gaussian mixture models
    • Aalborg, Denmark, September, 2001
    • Faltlhauser, R., Ruske, G., 2001. Improving speaker recognition performance using phonetically structured gaussian mixture models. In: Proc. Seventh European Conf. on Speech Communication and Technology (Eurospeech 2001), Aalborg, Denmark, September 2001, pp. 751-754.
    • (2001) Proc. Seventh European Conf. on Speech Communication and Technology (Eurospeech , pp. 751-754
    • Faltlhauser, R.1    Ruske, G.2
  • 62
    • 0028204659 scopus 로고
    • Speaker recognition using neural networks and conventional classifiers
    • Farrell K., Mammone R., and Assaleh K. Speaker recognition using neural networks and conventional classifiers. IEEE Trans. Speech Audio Process. 2 1 (1994) 194-205
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 194-205
    • Farrell, K.1    Mammone, R.2    Assaleh, K.3
  • 65
    • 85084011389 scopus 로고    scopus 로고
    • Improving the performance of text-independent short duration SVM- and GMM-based speaker verification
    • Stellenbosch, South Africa, January, Paper 018. 2008
    • Fauve, B., Evans, N., Mason, J., 2008. Improving the performance of text-independent short duration SVM- and GMM-based speaker verification. In: The Speaker and Language Recognition Workshop (Odyssey 2008), Stellenbosch, South Africa, January 2008. Paper 018.
    • (2008) The Speaker and Language Recognition Workshop (Odyssey
    • Fauve, B.1    Evans, N.2    Mason, J.3
  • 68
    • 85084011884 scopus 로고    scopus 로고
    • An anticorrelation kernel for improved system combination in speaker verification
    • Stellenbosch, South Africa, January, Paper 022. 2008
    • Ferrer, L., Sönmez, K., Shriberg, E., 2008. An anticorrelation kernel for improved system combination in speaker verification. In: The Speaker and Language Recognition Workshop (Odyssey 2008), Stellenbosch, South Africa, January 2008. Paper 022.
    • (2008) The Speaker and Language Recognition Workshop (Odyssey
    • Ferrer, L.1    Sönmez, K.2    Shriberg, E.3
  • 69
    • 0033878650 scopus 로고    scopus 로고
    • AMIRAL: a block-segmental multirecognizer architecture for automatic speaker recognition
    • Fredouille C., Bonastre J.-F., and Merlin T. AMIRAL: a block-segmental multirecognizer architecture for automatic speaker recognition. Digital Signal Process. 10 1-3 (2000) 172-197
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 172-197
    • Fredouille, C.1    Bonastre, J.-F.2    Merlin, T.3
  • 70
    • 0019555090 scopus 로고
    • Cepstral analysis technique for automatic speaker verification
    • Furui S. Cepstral analysis technique for automatic speaker verification. IEEE Trans. Acoustics, Speech Signal Process. 29 2 (1981) 254-272
    • (1981) IEEE Trans. Acoustics, Speech Signal Process. , vol.29 , Issue.2 , pp. 254-272
    • Furui, S.1
  • 71
    • 0031223555 scopus 로고    scopus 로고
    • Recent advances in speaker recognition
    • Furui S. Recent advances in speaker recognition. Pattern Recognition Lett. 18 9 (1997) 859-872
    • (1997) Pattern Recognition Lett. , vol.18 , Issue.9 , pp. 859-872
    • Furui, S.1
  • 75
    • 51849095864 scopus 로고    scopus 로고
    • Pitch synchronous based feature extraction for noise-robust speaker verification
    • May
    • Gong, W.-G., Yang, L.-P., Chen, D., 2008. Pitch synchronous based feature extraction for noise-robust speaker verification. In: Proc. Image and Signal Processing (CISP 2008), Vol. 5, (May 2008), pp. 295-298.
    • (2008) Proc. Image and Signal Processing (CISP 2008) , vol.5 , pp. 295-298
    • Gong, W.-G.1    Yang, L.-P.2    Chen, D.3
  • 77
    • 0032628065 scopus 로고    scopus 로고
    • A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion
    • Gopalan K., Anderson T., and Cupples E. A comparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion. IEEE Trans. Speech Audio Process. 7 3 (1999) 289-294
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 289-294
    • Gopalan, K.1    Anderson, T.2    Cupples, E.3
  • 79
    • 44049123087 scopus 로고
    • Text-independent speaker verification based on broad phonetic segmentation of speech
    • Gupta S., and Savic M. Text-independent speaker verification based on broad phonetic segmentation of speech. Digital Signal Process. 2 2 (1992) 69-79
    • (1992) Digital Signal Process. , vol.2 , Issue.2 , pp. 69-79
    • Gupta, S.1    Savic, M.2
  • 83
    • 0017851927 scopus 로고
    • On the use of windows for harmonic analysis with the discrete fourier transform
    • Harris F. On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66 1 (1978) 51-84
    • (1978) Proc. IEEE , vol.66 , Issue.1 , pp. 51-84
    • Harris, F.1
  • 84
    • 33947695242 scopus 로고    scopus 로고
    • Generalized linear kernels for one-versus-all classification: Application to speaker recognition
    • Toulouse, France, May
    • Hatch, A., Stolcke, A., 2006. Generalized linear kernels for one-versus-all classification: application to speaker recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), Toulouse, France, May 2006, pp. 585-588.
    • (2006) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) , pp. 585-588
    • Hatch, A.1    Stolcke, A.2
  • 85
    • 33846200839 scopus 로고    scopus 로고
    • Hatch, A., Stolcke, A., Peskin, B., 2005. Combining feature sets with support vector machines: application to speaker recognition. In: The 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), November 2005, pp. 75-79.
    • Hatch, A., Stolcke, A., Peskin, B., 2005. Combining feature sets with support vector machines: application to speaker recognition. In: The 2005 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), November 2005, pp. 75-79.
  • 86
    • 44949114401 scopus 로고    scopus 로고
    • Within-class covariance normalization for SVM-based speaker recognition
    • Pittsburgh, Pennsylvania, USA, September
    • Hatch, A., Kajarekar, S., Stolcke, A., 2006. Within-class covariance normalization for SVM-based speaker recognition. In: Proc. Interspeech 2006 (ICSLP), Pittsburgh, Pennsylvania, USA, September 2006, pp. 1471-1474.
    • (2006) Proc. Interspeech 2006 (ICSLP) , pp. 1471-1474
    • Hatch, A.1    Kajarekar, S.2    Stolcke, A.3
  • 88
    • 43249122568 scopus 로고    scopus 로고
    • Text-independent speaker recognition using graph matching
    • Hautamäki V., Kinnunen T., and Fränti P. Text-independent speaker recognition using graph matching. Pattern Recognition Lett. 29 9 (2008) 1427-1432
    • (2008) Pattern Recognition Lett. , vol.29 , Issue.9 , pp. 1427-1432
    • Hautamäki, V.1    Kinnunen, T.2    Fränti, P.3
  • 90
    • 85024895429 scopus 로고    scopus 로고
    • Text-dependent speaker recognition
    • Benesty J., Sondhi M., and Huang Y. (Eds), Springer-Verlag, Heidelberg
    • Hébert M. Text-dependent speaker recognition. In: Benesty J., Sondhi M., and Huang Y. (Eds). Springer Handbook of Speech Processing (2008), Springer-Verlag, Heidelberg 743-762
    • (2008) Springer Handbook of Speech Processing , pp. 743-762
    • Hébert, M.1
  • 92
    • 85009242711 scopus 로고    scopus 로고
    • Combining speaker and speech recognition systems
    • Denver, Colorado, USA, September, 2002
    • Heck, L., Genoud, D., 2002. Combining speaker and speech recognition systems. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2002), Denver, Colorado, USA, September 2002, pp. 1369-1372.
    • (2002) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , pp. 1369-1372
    • Heck, L.1    Genoud, D.2
  • 93
    • 0030710668 scopus 로고    scopus 로고
    • Handset-dependent background models for robust text-independent speaker recognition
    • Munich, Germany, April
    • Heck, L., and Weintraub, M. 1997. Handset-dependent background models for robust text-independent speaker recognition. In Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1997) (Munich, Germany, April 1997), pp. 1071-1074.
    • (1997) Proc. Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 1997) , pp. 1071-1074
    • Heck, L.1    Weintraub, M.2
  • 94
    • 0033746018 scopus 로고    scopus 로고
    • Robustness to telephone handset distortion in speaker recognition by discriminative feature design
    • Heck L., Konig Y., Sönmez M., and Weintraub M. Robustness to telephone handset distortion in speaker recognition by discriminative feature design. Speech Comm. 31 (2000) 181-192
    • (2000) Speech Comm. , vol.31 , pp. 181-192
    • Heck, L.1    Konig, Y.2    Sönmez, M.3    Weintraub, M.4
  • 95
    • 4544293687 scopus 로고    scopus 로고
    • Application of the modified group delay function to speaker identification and discrimination
    • Montreal, Canada, May
    • Hedge, R., Murthy, H., Rao, G., 2004. Application of the modified group delay function to speaker identification and discrimination. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2004), Vol. 1, Montreal, Canada, May 2004, pp. 517-520.
    • (2004) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2004) , vol.1 , pp. 517-520
    • Hedge, R.1    Murthy, H.2    Rao, G.3
  • 96
    • 0032675222 scopus 로고    scopus 로고
    • A discriminative training algorithm for VQ-based speaker identification
    • He J., Liu L., and Palm G. A discriminative training algorithm for VQ-based speaker identification. IEEE Trans. Speech Audio Process. 7 3 (1999) 353-356
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 353-356
    • He, J.1    Liu, L.2    Palm, G.3
  • 97
    • 0025041264 scopus 로고
    • Perceptual linear prediction (PLP) analysis for speech
    • Hermansky H. Perceptual linear prediction (PLP) analysis for speech. J. Acoust. Soc. Amer. 87 (1990) 1738-1752
    • (1990) J. Acoust. Soc. Amer. , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 98
    • 0032139768 scopus 로고    scopus 로고
    • Should recognizers have ears?
    • Hermansky H. Should recognizers have ears?. Speech Comm. 25 1-3 (1998) 3-27
    • (1998) Speech Comm. , vol.25 , Issue.1-3 , pp. 3-27
    • Hermansky, H.1
  • 101
    • 0003019863 scopus 로고
    • Speaker verification using randomized phrase prompting
    • Higgins A., Bahler L., and Porter J. Speaker verification using randomized phrase prompting. Digital Signal Process. 1 (1991) 89-106
    • (1991) Digital Signal Process. , vol.1 , pp. 89-106
    • Higgins, A.1    Bahler, L.2    Porter, J.3
  • 103
    • 0031224204 scopus 로고    scopus 로고
    • A study of harmonic features for the speaker recognition
    • Imperl B., Kacic Z., and Horvat B. A study of harmonic features for the speaker recognition. Speech Comm. 22 4 (1997) 385-402
    • (1997) Speech Comm. , vol.22 , Issue.4 , pp. 385-402
    • Imperl, B.1    Kacic, Z.2    Horvat, B.3
  • 105
    • 0036947697 scopus 로고    scopus 로고
    • Learning statistically efficient features for speaker recognition
    • Jang G.-J., Lee T.-W., and Oh Y.-H. Learning statistically efficient features for speaker recognition. Neurocomputing 49 (2002) 329-348
    • (2002) Neurocomputing , vol.49 , pp. 329-348
    • Jang, G.-J.1    Lee, T.-W.2    Oh, Y.-H.3
  • 108
    • 51449096374 scopus 로고    scopus 로고
    • A new kernel for SVM MLLR based speaker recognition
    • Antwerp, Belgium, August
    • Karam, Z., Campbell, W., 2007. A new kernel for SVM MLLR based speaker recognition. In: Proc. Interspeech 2007 (ICSLP), Antwerp, Belgium, August 2007, pp. 290-293.
    • (2007) Proc. Interspeech 2007 (ICSLP) , pp. 290-293
    • Karam, Z.1    Campbell, W.2
  • 110
    • 33947637189 scopus 로고    scopus 로고
    • Joint factor analysis of speaker and session variability: Theory and algorithms
    • Technical Report CRIM-06/08-14
    • Kenny, P., 2006. Joint factor analysis of speaker and session variability: theory and algorithms. Technical Report CRIM-06/08-14.
    • (2006)
    • Kenny, P.1
  • 113
    • 85009240235 scopus 로고    scopus 로고
    • Designing a speaker-discriminative adaptive filter bank for speaker recognition
    • Denver, Colorado, USA, September, 2002
    • Kinnunen, T., 2002. Designing a speaker-discriminative adaptive filter bank for speaker recognition. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2002), Denver, Colorado, USA, September 2002, pp. 2325-2328.
    • (2002) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , pp. 2325-2328
    • Kinnunen, T.1
  • 116
  • 119
    • 33845543841 scopus 로고    scopus 로고
    • Fusion of spectral feature sets for accurate speaker identification
    • St. Petersburg, Russia, September, 2004
    • Kinnunen, T., Hautamäki, V., Fränti, P., 2004. Fusion of spectral feature sets for accurate speaker identification. In: Proc. Ninth Internat. Conf. on Speech and Computer (SPECOM 2004), St. Petersburg, Russia, September 2004, pp. 361-365.
    • (2004) Proc. Ninth Internat. Conf. on Speech and Computer (SPECOM , pp. 361-365
    • Kinnunen, T.1    Hautamäki, V.2    Fränti, P.3
  • 123
    • 37849045632 scopus 로고    scopus 로고
    • Speaker verification with adaptive spectral subband centroids
    • Seoul, Korea, August, 2007
    • Kinnunen, T., Zhang, B., Zhu, J., Wang, Y., 2007. Speaker verification with adaptive spectral subband centroids. In: Proc. Internat. Conf. on Biometrics (ICB 2007), Seoul, Korea, August 2007, pp. 58-66.
    • (2007) Proc. Internat. Conf. on Biometrics (ICB , pp. 58-66
    • Kinnunen, T.1    Zhang, B.2    Zhu, J.3    Wang, Y.4
  • 124
    • 85084015242 scopus 로고    scopus 로고
    • Dimension reduction of the modulation spectrogram for speaker verification
    • Stellenbosch, South Africa, January, 2008
    • Kinnunen, T., Lee, K.-A., Li, H. 2008. Dimension reduction of the modulation spectrogram for speaker verification. In: The Speaker and Language Recognition Workshop (Odyssey 2008), Stellenbosch, South Africa, January 2008.
    • (2008) The Speaker and Language Recognition Workshop (Odyssey
    • Kinnunen, T.1    Lee, K.-A.2    Li, H.3
  • 125
    • 58349105008 scopus 로고    scopus 로고
    • Comparative evaluation of maximum a posteriori vector quantization and Gaussian mixture models in speaker verification
    • Kinnunen T., Saastamoinen J., Hautamäki V., Vinni M., and Fränti P. Comparative evaluation of maximum a posteriori vector quantization and Gaussian mixture models in speaker verification. Pattern Recognition Lett. 30 4 (2009) 341-347
    • (2009) Pattern Recognition Lett. , vol.30 , Issue.4 , pp. 341-347
    • Kinnunen, T.1    Saastamoinen, J.2    Hautamäki, V.3    Vinni, M.4    Fränti, P.5
  • 126
    • 84867211827 scopus 로고    scopus 로고
    • Acoustic analysis of imitated voice produced by a professional impersonator
    • September, 2008
    • Kitamura, T., 2008. Acoustic analysis of imitated voice produced by a professional impersonator. In: Proc. Interspeech 2008, September 2008, pp. 813-816.
    • (2008) Proc. Interspeech , pp. 813-816
    • Kitamura, T.1
  • 130
    • 0036650810 scopus 로고    scopus 로고
    • Unsupervised speaker recognition based on competition between self-organizing maps
    • Lapidot I., Guterman H., and Cohen A. Unsupervised speaker recognition based on competition between self-organizing maps. IEEE Trans. Neural Networks 13 (2002) 877-887
    • (2002) IEEE Trans. Neural Networks , vol.13 , pp. 877-887
    • Lapidot, I.1    Guterman, H.2    Cohen, A.3
  • 131
    • 70349209406 scopus 로고    scopus 로고
    • Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum
    • Taipei, Taiwan, April
    • Laskowski, K., Jin, Q., 2009. Modeling instantaneous intonation for speaker identification using the fundamental frequency variation spectrum. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, April 2009, pp. 4541-4544.
    • (2009) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009) , pp. 4541-4544
    • Laskowski, K.1    Jin, Q.2
  • 132
    • 51449124381 scopus 로고    scopus 로고
    • A GMM-based probabilistic sequence kernel for speaker verification
    • Antwerp, Belgium, August
    • Lee, K.-A., You, C., Li, H., Kinnunen, T., 2007. A GMM-based probabilistic sequence kernel for speaker verification. In: Proc. Interspeech 2007 (ICSLP), Antwerp, Belgium, August 2007, pp. 294-297.
    • (2007) Proc. Interspeech 2007 (ICSLP) , pp. 294-297
    • Lee, K.-A.1    You, C.2    Li, H.3    Kinnunen, T.4
  • 133
    • 84867218500 scopus 로고    scopus 로고
    • Characterizing speech utterances for speaker verification with sequence kernel SVM
    • Brisbane, Australia, September, 2008
    • Lee, K., You, C., Li, H., Kinnunen, T., Zhu, D., 2008. Characterizing speech utterances for speaker verification with sequence kernel SVM. In: Proc. Ninth Interspeech (Interspeech 2008), Brisbane, Australia, September 2008, pp. 1397-1400.
    • (2008) Proc. Ninth Interspeech (Interspeech , pp. 1397-1400
    • Lee, K.1    You, C.2    Li, H.3    Kinnunen, T.4    Zhu, D.5
  • 134
    • 29044433161 scopus 로고    scopus 로고
    • NIST and NFI-TNO evaluations of automatic speaker recognition
    • Leeuwen D., Martin A., Przybocki M., and Bouten J. NIST and NFI-TNO evaluations of automatic speaker recognition. Comput. Speech Lang. 20 (2006) 128-158
    • (2006) Comput. Speech Lang. , vol.20 , pp. 128-158
    • Leeuwen, D.1    Martin, A.2    Przybocki, M.3    Bouten, J.4
  • 135
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • Leggetter C., and Woodland P. Maximum likelihood linear regression for speaker adaptation of continuous density HMMs. Comput. Speech Lang. 9 (1995) 171-185
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 136
    • 85164647214 scopus 로고    scopus 로고
    • Word-conditioned HMM supervectors for speaker recognition
    • Antwerp, Belgium, August
    • Lei, H., Mirghafori, N., 2007. Word-conditioned HMM supervectors for speaker recognition. In: Proc. Interspeech 2007 (ICSLP), Antwerp, Belgium, August 2007, pp. 746-749.
    • (2007) Proc. Interspeech 2007 (ICSLP) , pp. 746-749
    • Lei, H.1    Mirghafori, N.2
  • 137
    • 28644451546 scopus 로고    scopus 로고
    • Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification
    • Leung K., Mak M., Siu M., and Kung S. Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification. Speech Comm. 48 1 (2006) 71-84
    • (2006) Speech Comm. , vol.48 , Issue.1 , pp. 71-84
    • Leung, K.1    Mak, M.2    Siu, M.3    Kung, S.4
  • 140
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Linde Y., Buzo A., and Gray R. An algorithm for vector quantizer design. IEEE Trans. Comm. 28 1 (1980) 84-95
    • (1980) IEEE Trans. Comm. , vol.28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.3
  • 141
    • 70350101559 scopus 로고    scopus 로고
    • Combining derivative and parametric kernels for speaker verification
    • Longworth C., and Gales M. Combining derivative and parametric kernels for speaker verification. IEEE Trans. Audio, Speech Language Process. 6 1 (2007) 1-10
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.6 , Issue.1 , pp. 1-10
    • Longworth, C.1    Gales, M.2
  • 144
    • 40249090511 scopus 로고    scopus 로고
    • An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
    • Lu X., and Dang J. An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification. Speech Comm. 50 4 (2007) 312-322
    • (2007) Speech Comm. , vol.50 , Issue.4 , pp. 312-322
    • Lu, X.1    Dang, J.2
  • 146
    • 44949143337 scopus 로고    scopus 로고
    • Speaker cluster based GMM tokenization for speaker recognition
    • Pittsburgh, Pennsylvania, USA, September
    • Ma, B., Zhu, D., Tong, R., Li, H., 2006. Speaker cluster based GMM tokenization for speaker recognition. In: Proc. Interspeech 2006 (ICSLP), Pittsburgh, Pennsylvania, USA, September 2006, pp. 505-508.
    • (2006) Proc. Interspeech 2006 (ICSLP) , pp. 505-508
    • Ma, B.1    Zhu, D.2    Tong, R.3    Li, H.4
  • 147
    • 60849102345 scopus 로고    scopus 로고
    • Spoken language recognition with ensemble classifiers
    • Ma B., Li H., and Tong R. Spoken language recognition with ensemble classifiers. IEEE Trans. Audio, Speech Language Process. 15 7 (2007) 2053-2062
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.7 , pp. 2053-2062
    • Ma, B.1    Li, H.2    Tong, R.3
  • 148
    • 0036754056 scopus 로고    scopus 로고
    • Application of time-frequency principal component analysis to text-independent speaker identification
    • Magrin-Chagnolleau I., Durou G., and Bimbot F. Application of time-frequency principal component analysis to text-independent speaker identification. IEEE Trans. Speech Audio Process. 10 6 (2002) 371-378
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 371-378
    • Magrin-Chagnolleau, I.1    Durou, G.2    Bimbot, F.3
  • 149
    • 2942532899 scopus 로고    scopus 로고
    • Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification
    • Mak M.-W., and Tsang C.-L. Stochastic feature transformation with divergence-based out-of-handset rejection for robust speaker verification. EURASIP J. Appl. Signal Process. 4 (2004) 452-465
    • (2004) EURASIP J. Appl. Signal Process. , vol.4 , pp. 452-465
    • Mak, M.-W.1    Tsang, C.-L.2
  • 150
    • 0141676592 scopus 로고    scopus 로고
    • Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation
    • Hong Kong, China, April
    • Mak, M.-W., Cheung, M., Kung, S., 2003. Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2003), Vol. 2, Hong Kong, China, April 2003, pp. 745-748.
    • (2003) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2003) , vol.2 , pp. 745-748
    • Mak, M.-W.1    Cheung, M.2    Kung, S.3
  • 151
    • 33947670488 scopus 로고    scopus 로고
    • A comparison of various adaptation methods for speaker verification with limited enrollment data
    • Toulouse, France, May
    • Mak, M.-W., Hsiao, R., Mak, B., 2006. A comparison of various adaptation methods for speaker verification with limited enrollment data. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006), Vol. 1, Toulouse, France, May 2006, pp. 929-932.
    • (2006) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2006) , vol.1 , pp. 929-932
    • Mak, M.-W.1    Hsiao, R.2    Mak, B.3
  • 152
    • 0016495091 scopus 로고
    • Linear prediction: a tutorial review
    • Makhoul J. Linear prediction: a tutorial review. Proc. IEEE 64 4 (1975) 561-580
    • (1975) Proc. IEEE , vol.64 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 153
    • 0033885411 scopus 로고    scopus 로고
    • Data-driven temporal filters and alternatives to GMM in speaker verification
    • Malayath N., Hermansky H., Kajarekar S., and Yegnanarayana B. Data-driven temporal filters and alternatives to GMM in speaker verification. Digital Signal Process. 10 1-3 (2000) 55-74
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 55-74
    • Malayath, N.1    Hermansky, H.2    Kajarekar, S.3    Yegnanarayana, B.4
  • 154
    • 29444456613 scopus 로고    scopus 로고
    • Speaker recognition by location in the space of reference speakers
    • Mami Y., and Charlet D. Speaker recognition by location in the space of reference speakers. Speech Comm. 48 2 (2006) 127-411
    • (2006) Speech Comm. , vol.48 , Issue.2 , pp. 127-411
    • Mami, Y.1    Charlet, D.2
  • 155
  • 156
    • 85009266907 scopus 로고    scopus 로고
    • A comparative study of adaptation methods for speaker verification
    • Denver, Colorado, USA, September, 2002
    • Mariéthoz, J., Bengio, S., 2002. A comparative study of adaptation methods for speaker verification. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2002), Denver, Colorado, USA, September 2002, pp. 581-584.
    • (2002) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , pp. 581-584
    • Mariéthoz, J.1    Bengio, S.2
  • 159
    • 44949117862 scopus 로고    scopus 로고
    • Prosodic features for speaker verification
    • Pittsburgh, Pennsylvania, USA, September
    • Mary, L., Yegnanarayana, B., 2006. Prosodic features for speaker verification. In: Proc. Interspeech 2006 (ICSLP), Pittsburgh, Pennsylvania, USA, September 2006, pp. 917-920.
    • (2006) Proc. Interspeech 2006 (ICSLP) , pp. 917-920
    • Mary, L.1    Yegnanarayana, B.2
  • 160
    • 52949094265 scopus 로고    scopus 로고
    • Extraction and representation of prosodic features for language and speaker recognition
    • Mary L., and Yegnanarayana B. Extraction and representation of prosodic features for language and speaker recognition. Speech Comm. 50 10 (2008) 782-796
    • (2008) Speech Comm. , vol.50 , Issue.10 , pp. 782-796
    • Mary, L.1    Yegnanarayana, B.2
  • 161
    • 33745207323 scopus 로고    scopus 로고
    • Data-driven clustering for blind feature mapping in speaker verification
    • Lisboa, Portugal, September, 2005
    • Mason, M., Vogt, R., Baker, B., Sridharan, S., 2005. Data-driven clustering for blind feature mapping in speaker verification. In: Proc. Interspeech 2005, Lisboa, Portugal, September 2005, pp. 3109-3112.
    • (2005) Proc. Interspeech , pp. 3109-3112
    • Mason, M.1    Vogt, R.2    Baker, B.3    Sridharan, S.4
  • 163
    • 0037290421 scopus 로고    scopus 로고
    • Speaker-specific mapping for text-independent speaker recognition
    • Misra H., Ikbal S., and Yegnanarayana B. Speaker-specific mapping for text-independent speaker recognition. Speech Comm. 39 3-4 (2003) 301-310
    • (2003) Speech Comm. , vol.39 , Issue.3-4 , pp. 301-310
    • Misra, H.1    Ikbal, S.2    Yegnanarayana, B.3
  • 164
    • 0035478790 scopus 로고    scopus 로고
    • A new approach to designing a feature extractor in speaker identification based on discriminative feature extraction
    • Miyajima C., Watanabe H., Tokuda K., Kitamura T., and Katagiri S. A new approach to designing a feature extractor in speaker identification based on discriminative feature extraction. Speech Comm. 35 (2001) 203-218
    • (2001) Speech Comm. , vol.35 , pp. 203-218
    • Miyajima, C.1    Watanabe, H.2    Tokuda, K.3    Kitamura, T.4    Katagiri, S.5
  • 165
    • 0034850631 scopus 로고    scopus 로고
    • A committee of neural networks for automatic speaker recognition (ASR) systems
    • Washington, DC, USA, July, 2001
    • Moonasar, V., Venayagamoorthy, G., 2001. A committee of neural networks for automatic speaker recognition (ASR) systems. In: Proc. Internat. Joint Conf. on Neural Networks (IJCNN 2001), Washington, DC, USA, July 2001, pp. 2936-2940.
    • (2001) Proc. Internat. Joint Conf. on Neural Networks (IJCNN , pp. 2936-2940
    • Moonasar, V.1    Venayagamoorthy, G.2
  • 166
    • 84871413152 scopus 로고    scopus 로고
    • Müller C. (Ed), Springer
    • In: Müller C. (Ed). Speaker Classification I: Fundamentals, Features, and Methods. Lecture Notes in Computer Science Vol. 4343 (2007), Springer
    • (2007) Lecture Notes in Computer Science , vol.4343
  • 167
    • 38149019216 scopus 로고    scopus 로고
    • Müller C. (Ed), Springer
    • In: Müller C. (Ed). Speaker Classification II: Selected Projects. Lecture Notes in Computer Science Vol. 4441 (2007), Springer
    • (2007) Lecture Notes in Computer Science , vol.4441
  • 169
    • 30444446629 scopus 로고    scopus 로고
    • Combining evidence from residual phase and MFCC features for speaker recognition
    • Murty K., and Yegnanarayana B. Combining evidence from residual phase and MFCC features for speaker recognition. IEEE Signal Process. Lett. 13 1 (2006) 52-55
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.1 , pp. 52-55
    • Murty, K.1    Yegnanarayana, B.2
  • 172
    • 0012078715 scopus 로고    scopus 로고
    • Statistical language modeling using leaving-one-out
    • Young S., and Bloothooft G. (Eds), Kluwer Academic Publishers
    • Ney H., Martin S., and Wessel F. Statistical language modeling using leaving-one-out. In: Young S., and Bloothooft G. (Eds). Corpus-based Methods in Language and Speech Processing (1997), Kluwer Academic Publishers 174-207
    • (1997) Corpus-based Methods in Language and Speech Processing , pp. 174-207
    • Ney, H.1    Martin, S.2    Wessel, F.3
  • 174
    • 70350112676 scopus 로고    scopus 로고
    • September
    • NIST 2008 SRE results page, September 2008. .
    • (2008) SRE results page
  • 179
    • 85009291564 scopus 로고    scopus 로고
    • ASR dependent techniques for speaker identification
    • Denver, Colorado, USA, September, 2002
    • Park, A., Hazen, T., 2002. ASR dependent techniques for speaker identification. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2002), Denver, Colorado, USA, September 2002, pp. 1337-1340.
    • (2002) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , pp. 1337-1340
    • Park, A.1    Hazen, T.2
  • 182
    • 0032207163 scopus 로고    scopus 로고
    • An efficient scoring algorithm for gaussian mixture model based speaker identification
    • Pellom B., and Hansen J. An efficient scoring algorithm for gaussian mixture model based speaker identification. IEEE Signal Process. Lett. 5 11 (1998) 281-284
    • (1998) IEEE Signal Process. Lett. , vol.5 , Issue.11 , pp. 281-284
    • Pellom, B.1    Hansen, J.2
  • 185
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • Plumpe M., Quatieri T., and Reynolds D. Modeling of the glottal flow derivative waveform with application to speaker identification. IEEE Trans. Speech Audio Process. 7 5 (1999) 569-586
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 569-586
    • Plumpe, M.1    Quatieri, T.2    Reynolds, D.3
  • 186
    • 4544304036 scopus 로고    scopus 로고
    • Why do multi-stream, multi-band and multi-modal approaches work on biometric user authentication tasks?
    • Montreal, Canada, May
    • Poh, N., Bengio, S., 2004. Why do multi-stream, multi-band and multi-modal approaches work on biometric user authentication tasks? In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2004), Vol. 5, Montreal, Canada, May 2004, pp. 893-896.
    • (2004) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2004) , vol.5 , pp. 893-896
    • Poh, N.1    Bengio, S.2
  • 187
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specific excitation information from linear prediction residual of speech
    • Prasanna S., Gupta C., and Yegnanarayana B. Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Comm. 48 (2006) 1243-1261
    • (2006) Speech Comm. , vol.48 , pp. 1243-1261
    • Prasanna, S.1    Gupta, C.2    Yegnanarayana, B.3
  • 188
    • 62349134627 scopus 로고    scopus 로고
    • NIST speaker recognition evaluations utilizing the mixer corpora - 2004, 2005, 2006
    • Przybocki M.A., Martin A., and Le A. NIST speaker recognition evaluations utilizing the mixer corpora - 2004, 2005, 2006. IEEE Trans. Audio, Speech Language Process. 15 7 (2007) 1951-1959
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.7 , pp. 1951-1959
    • Przybocki, M.A.1    Martin, A.2    Le, A.3
  • 190
    • 0036887596 scopus 로고    scopus 로고
    • Speaker recognition - general classifier approaches and data fusion methods
    • Ramachandran R., Farrell K., Ramachandran R., and Mammone R. Speaker recognition - general classifier approaches and data fusion methods. Pattern Recognition 35 (2002) 2801-2821
    • (2002) Pattern Recognition , vol.35 , pp. 2801-2821
    • Ramachandran, R.1    Farrell, K.2    Ramachandran, R.3    Mammone, R.4
  • 191
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • Ramirez J., Segura J., Benítez C., de la Torre A., and Rubio A. Efficient voice activity detection algorithms using long-term speech information. Speech Comm. 42 3-4 (2004) 271-287
    • (2004) Speech Comm. , vol.42 , Issue.3-4 , pp. 271-287
    • Ramirez, J.1    Segura, J.2    Benítez, C.3    de la Torre, A.4    Rubio, A.5
  • 193
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • Reynolds D. Speaker identification and verification using Gaussian mixture speaker models. Speech Comm. 17 (1995) 91-108
    • (1995) Speech Comm. , vol.17 , pp. 91-108
    • Reynolds, D.1
  • 195
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Reynolds D., and Rose R. Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Speech Audio Process. 3 (1995) 72-83
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 196
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted gaussian mixture models
    • Reynolds D., Quatieri T., and Dunn R. Speaker verification using adapted gaussian mixture models. Digital Signal Process. 10 1 (2000) 19-41
    • (2000) Digital Signal Process. , vol.10 , Issue.1 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 199
    • 28644435158 scopus 로고    scopus 로고
    • Gaussian-selection-based non-optimal search for speaker identification
    • Roch M. Gaussian-selection-based non-optimal search for speaker identification. Speech Commu. 48 (2006) 85-95
    • (2006) Speech Commu. , vol.48 , pp. 85-95
    • Roch, M.1
  • 205
    • 0038035138 scopus 로고    scopus 로고
    • Sub-band based text-dependent speaker verification
    • Sivakumaran P., Ariyaeeinia A., and Loomes M. Sub-band based text-dependent speaker verification. Speech Comm. 41 (2003) 485-509
    • (2003) Speech Comm. , vol.41 , pp. 485-509
    • Sivakumaran, P.1    Ariyaeeinia, A.2    Loomes, M.3
  • 207
    • 85009209978 scopus 로고    scopus 로고
    • A comparison of fusion techniques in mel-cepstral based speaker identification
    • Sydney, Australia, November, 1998
    • Slomka, S., Sridharan, S., Chandran, V., 1998. A comparison of fusion techniques in mel-cepstral based speaker identification. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 1998), Sydney, Australia, November 1998, pp. 225-228.
    • (1998) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , pp. 225-228
    • Slomka, S.1    Sridharan, S.2    Chandran, V.3
  • 209
    • 64249133832 scopus 로고    scopus 로고
    • Using post-classifiers to enhance fusion of low- and high-level speaker recognition
    • Solewicz Y., and Koppel M. Using post-classifiers to enhance fusion of low- and high-level speaker recognition. IEEE Trans. Audio, Speech Language Process. 15 7 (2007) 2063-2071
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.7 , pp. 2063-2071
    • Solewicz, Y.1    Koppel, M.2
  • 213
    • 0024035182 scopus 로고
    • On the use of instantaneous and transitional spectral information in speaker recognition
    • Soong F., and Rosenberg A. On the use of instantaneous and transitional spectral information in speaker recognition. IEEE Trans. Acoustics, Speech Signal Process. 36 6 (1988) 871-879
    • (1988) IEEE Trans. Acoustics, Speech Signal Process. , vol.36 , Issue.6 , pp. 871-879
    • Soong, F.1    Rosenberg, A.2
  • 215
    • 51449111842 scopus 로고    scopus 로고
    • Speaker recognition with session variability normalization based on MLLR adaptation transforms
    • Stolcke A., Kajarekar S., Ferrer L., and Shriberg E. Speaker recognition with session variability normalization based on MLLR adaptation transforms. IEEE Trans. Audio, Speech Language Process. 15 7 (2007) 1987-1998
    • (2007) IEEE Trans. Audio, Speech Language Process. , vol.15 , Issue.7 , pp. 1987-1998
    • Stolcke, A.1    Kajarekar, S.2    Ferrer, L.3    Shriberg, E.4
  • 217
  • 219
    • 84988224855 scopus 로고    scopus 로고
    • A model-based transformational approach to robust speaker recognition
    • Beijing, China, October, 2000
    • Teunen, R., Shahshahani, B., Heck, L., 2000. A model-based transformational approach to robust speaker recognition. In: Proc. Internat. Conf. on Spoken Language Processing (ICSLP 2000), Vol. 2, Beijing, China, October 2000, pp. 495-498.
    • (2000) Proc. Internat. Conf. on Spoken Language Processing (ICSLP , vol.2 , pp. 495-498
    • Teunen, R.1    Shahshahani, B.2    Heck, L.3
  • 220
    • 0029356550 scopus 로고
    • Usefulness of the LPC-residue in text-independent speaker verification
    • Thévenaz P., and Hügli H. Usefulness of the LPC-residue in text-independent speaker verification. Speech Comm. 17 1-2 (1995) 145-157
    • (1995) Speech Comm. , vol.17 , Issue.1-2 , pp. 145-157
    • Thévenaz, P.1    Hügli, H.2
  • 221
    • 35048877248 scopus 로고    scopus 로고
    • Spectral subband centroids as complementary features for speaker authentication
    • Hong Kong, China, July, 2004
    • Thian, N., Sanderson, C., Bengio, S., 2004. Spectral subband centroids as complementary features for speaker authentication. In: Proc. First Internat. Conf. on Biometric Authentication (ICBA 2004), Hong Kong, China, July 2004, pp. 631-639.
    • (2004) Proc. First Internat. Conf. on Biometric Authentication (ICBA , pp. 631-639
    • Thian, N.1    Sanderson, C.2    Bengio, S.3
  • 222
    • 40749129313 scopus 로고    scopus 로고
    • Extraction of FM components from speech signals using all-pole model
    • Thiruvaran T., Ambikairajah E., and Epps J. Extraction of FM components from speech signals using all-pole model. Electronics Lett. 44 6 (2008)
    • (2008) Electronics Lett. , vol.44 , Issue.6
    • Thiruvaran, T.1    Ambikairajah, E.2    Epps, J.3
  • 223
    • 84867205929 scopus 로고    scopus 로고
    • FM features for automatic forensic speaker recognition
    • Brisbane, Australia, September, 2008
    • Thiruvaran, T., Ambikairajah, E., Epps, J., 2008. FM features for automatic forensic speaker recognition. In: Proc. Interspeech 2008, Brisbane, Australia, September 2008, pp. 1497-1500.
    • (2008) Proc. Interspeech , pp. 1497-1500
    • Thiruvaran, T.1    Ambikairajah, E.2    Epps, J.3
  • 228
    • 0032141206 scopus 로고    scopus 로고
    • Cepstral domain segmental feature vector normalization for noise robust speech recognition
    • Viikki O., and Laurila K. Cepstral domain segmental feature vector normalization for noise robust speech recognition. Speech Comm. 25 (1998) 133-147
    • (1998) Speech Comm. , vol.25 , pp. 133-147
    • Viikki, O.1    Laurila, K.2
  • 229
    • 34548248573 scopus 로고    scopus 로고
    • Explicit modeling of session variability for speaker verification
    • Vogt R., and Sridharan S. Explicit modeling of session variability for speaker verification. Comput. Speech Lang. 22 1 (2008) 17-38
    • (2008) Comput. Speech Lang. , vol.22 , Issue.1 , pp. 17-38
    • Vogt, R.1    Sridharan, S.2
  • 230
    • 33745210768 scopus 로고    scopus 로고
    • Modelling session variability in text-independent speaker verification
    • Lisboa, Portugal, September, 2005
    • Vogt, R., Baker, B., Sridharan, S., 2005. Modelling session variability in text-independent speaker verification. In: Proc. Interspeech 2005, Lisboa, Portugal, September 2005, pp. 3117-3120.
    • (2005) Proc. Interspeech , pp. 3117-3120
    • Vogt, R.1    Baker, B.2    Sridharan, S.3
  • 232
    • 14644412368 scopus 로고    scopus 로고
    • Speaker verification using sequence discriminant support vector machines
    • Wan V., and Renals S. Speaker verification using sequence discriminant support vector machines. IEEE Trans. Speech Audio Process. 13 2 (2005) 203-210
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 203-210
    • Wan, V.1    Renals, S.2
  • 234
    • 84953683778 scopus 로고
    • Efficient acoustic parameters for speaker recognition
    • (Part 2)
    • Wolf J. Efficient acoustic parameters for speaker recognition. J. Acoust. Soc. Amer. 51 6 (1972) 2044-2056 (Part 2)
    • (1972) J. Acoust. Soc. Amer. , vol.51 , Issue.6 , pp. 2044-2056
    • Wolf, J.1
  • 235
    • 0038681940 scopus 로고    scopus 로고
    • Text-independent speaker verification with dynamic trajectory model
    • Xiang B. Text-independent speaker verification with dynamic trajectory model. IEEE Signal Process. Lett. 10 (2003) 141-143
    • (2003) IEEE Signal Process. Lett. , vol.10 , pp. 141-143
    • Xiang, B.1
  • 236
    • 0041360472 scopus 로고    scopus 로고
    • Efficient text-independent speaker verification with structural gaussian mixture models and neural network
    • Xiang B., and Berger T. Efficient text-independent speaker verification with structural gaussian mixture models and neural network. IEEE Trans. Speech Audio Process. 11 (2003) 447-456
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 447-456
    • Xiang, B.1    Berger, T.2
  • 238
    • 33748586494 scopus 로고    scopus 로고
    • A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification
    • Xiong Z., Zheng T., Song Z., Soong F., and Wu W. A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification. Speech Comm. 48 (2006) 1273-1282
    • (2006) Speech Comm. , vol.48 , pp. 1273-1282
    • Xiong, Z.1    Zheng, T.2    Song, Z.3    Soong, F.4    Wu, W.5
  • 239
    • 0035989168 scopus 로고    scopus 로고
    • AANN: an alternative to GMM for pattern recognition
    • Yegnanarayana B., and Kishore S. AANN: an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
    • (2002) Neural Networks , vol.15 , pp. 459-469
    • Yegnanarayana, B.1    Kishore, S.2
  • 240
    • 85008056687 scopus 로고    scopus 로고
    • An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition
    • You C., Lee K., and Li H. An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16 1 (2009) 49-52
    • (2009) IEEE Signal Process. Lett. , vol.16 , Issue.1 , pp. 49-52
    • You, C.1    Lee, K.2    Li, H.3
  • 241
    • 0033154048 scopus 로고    scopus 로고
    • Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification
    • Yuo K.-H., and Wang H.-C. Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification. Speech Comm. 28 3 (1999) 227-241
    • (1999) Speech Comm. , vol.28 , Issue.3 , pp. 227-241
    • Yuo, K.-H.1    Wang, H.-C.2
  • 242
    • 33947583290 scopus 로고    scopus 로고
    • Integration of complementary acoustic features for speaker recognition
    • Zheng N., Lee T., and Ching P. Integration of complementary acoustic features for speaker recognition. IEEE Signal Process. Lett. 14 3 (2007) 181-184
    • (2007) IEEE Signal Process. Lett. , vol.14 , Issue.3 , pp. 181-184
    • Zheng, N.1    Lee, T.2    Ching, P.3
  • 244
    • 84867218530 scopus 로고    scopus 로고
    • Using MAP estimation of feature transformation for speaker recognition
    • Brisbane, Australia, September, 2008
    • Zhu, D., Ma, B., Li, H., 2008. Using MAP estimation of feature transformation for speaker recognition. In: Proc. Interspeech 2008, Brisbane, Australia, September 2008.
    • (2008) Proc. Interspeech
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 245
    • 70349200796 scopus 로고    scopus 로고
    • Joint MAP adaptation of feature transformation and gaussian mixture model for speaker recognition
    • Taipei, Taiwan, April
    • Zhu, D., Ma, B., Li, H., 2009. Joint MAP adaptation of feature transformation and gaussian mixture model for speaker recognition. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, April 2009, pp. 4045-4048.
    • (2009) Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009) , pp. 4045-4048
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 246
    • 0036753895 scopus 로고    scopus 로고
    • Text-independent speaker verification using utterance level scoring and covariance modeling
    • Zilca R. Text-independent speaker verification using utterance level scoring and covariance modeling. IEEE Trans. Speech Audio Process. 10 6 (2002) 363-370
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.6 , pp. 363-370
    • Zilca, R.1
  • 248
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • Zissman M. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process. 4 1 (1996) 31-44
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.