메뉴 건너뛰기




Volumn 32, Issue 13, 2011, Pages 1604-1617

Comparison of clustering methods: A case study of text-independent speaker modeling

Author keywords

Clustering methods; Gaussian mixture model; Speaker recognition; Universal background model; Vector quantization

Indexed keywords

CLUSTERING METHODS; CLUSTERING QUALITY; CLUSTERING VALIDITY; CRITICAL TASKS; EXPECTATION MAXIMIZATION; EXPERIMENTAL COMPARISON; FUZZY C MEAN; GAUSSIAN MIXTURE MODEL; K-MEANS; LOW ORDER MODELS; NEAREST NEIGHBORS; NON-PARAMETRIC; PERSON AUTHENTICATION; PROCESSING TIME; RECOGNITION ACCURACY; SELF ORGANIZING; SPEAKER MODELING; SPEAKER RECOGNITION; SPLIT-AND-MERGE; TEXT-INDEPENDENT SPEAKER VERIFICATION; UNIVERSAL BACKGROUND MODEL;

EID: 79960442111     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2011.06.023     Document Type: Article
Times cited : (51)

References (81)
  • 2
    • 34547912516 scopus 로고    scopus 로고
    • Automatic classification of musical genres using inter-genre similarity
    • DOI 10.1109/LSP.2006.891320
    • U. Bagci, and E. Erzin Automatic classification of musical genres using inter-gender similarity IEEE Signal Process Lett. 14 8 2007 521 524 (Pubitemid 47250021)
    • (2007) IEEE Signal Processing Letters , vol.14 , Issue.8 , pp. 521-524
    • Bagci, U.1    Erzin, E.2
  • 8
    • 0023293466 scopus 로고
    • Text-dependent speaker verification using vector quantization source coding
    • D. Burton Text-dependent speaker verification using vector quantization source coding IEEE Trans. Acoust. Speech Signal Process. 35 2 1987 133 143 (Pubitemid 17549544)
    • (1987) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-35 , Issue.2 , pp. 133-143
    • Burton, D.K.1
  • 9
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • PII S0018921997069478
    • J. Campbell Speaker recognition: A tutorial Proc. IEEE 85 9 1997 1437 1462 (Pubitemid 127745630)
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 10
    • 29044444825 scopus 로고    scopus 로고
    • Support vector machines for speaker and language recognition
    • DOI 10.1016/j.csl.2005.06.003, PII S0885230805000318, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • W.M. Campbell, J.P. Campbell, D.A. Reynolds, E. Singer, and P.A. Torres-Carrasquillo Support vector machines for speaker and language recognition Comput. Speech Lang. 20 2-3 2006 210 229 (Pubitemid 41787537)
    • (2006) Computer Speech and Language , vol.20 , Issue.SPEC. ISS. , pp. 210-229
    • Campbell, W.M.1    Campbell, J.P.2    Reynolds, D.A.3    Singer, E.4    Torres-Carrasquillo, P.A.5
  • 11
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • W.M. Campbell, D.E. Sturim, and D.A. Reynolds Support vector machines using GMM supervectors for speaker verification IEEE Signal Process. Lett. 13 5 2006 308 311
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3
  • 15
    • 0015644825 scopus 로고
    • A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters
    • J.C. Dunn A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters J. Cybernet. 3 3 1974 32 57
    • (1974) J. Cybernet. , vol.3 , Issue.3 , pp. 32-57
    • Dunn, J.C.1
  • 16
    • 0024752328 scopus 로고
    • A new vector quantization clustering algorithm
    • W.H. Equitz A new vector quantization clustering algorithm IEEE Trans. Acoust. Speech Signal Process. 37 10 1989 1568 1575
    • (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.10 , pp. 1568-1575
    • Equitz, W.H.1
  • 17
  • 18
  • 19
    • 0001300260 scopus 로고    scopus 로고
    • On the splitting method for vector quantization codebook generation
    • P. Fränti, T. Kaukoranta, and O. Nevalainen On the splitting method for vector quantization codebook generation Opt. Eng. 36 11 1997 3043 3051 (Pubitemid 127629715)
    • (1997) Optical Engineering , vol.36 , Issue.11 , pp. 3043-3051
    • Franti, P.1    Kaukoranta, T.2    Nevalainen, O.3
  • 20
    • 25744477768 scopus 로고    scopus 로고
    • On the usefulness of self-organizing maps for the clustering problem in vector quantization
    • Kangerlussuaq, Greenland
    • Fränti, P., 1999. On the usefulness of self-organizing maps for the clustering problem in vector quantization. In: Proc. 11th Scandinavian Conf. on Image Analysis (SCIA99), Kangerlussuaq, Greenland, pp. 415-422.
    • (1999) Proc. 11th Scandinavian Conf. on Image Analysis (SCIA99) , pp. 415-422
    • Fränti, P.1
  • 21
    • 0033877047 scopus 로고    scopus 로고
    • Genetic algorithm with deterministic crossover for vector quantization
    • DOI 10.1016/S0167-8655(99)00133-6
    • P. Fränti Genetic algorithm with deterministic crossover for vector quantization Pattern Recognition Lett. 21 1 2000 61 68 (Pubitemid 30534049)
    • (2000) Pattern Recognition Letters , vol.21 , Issue.1 , pp. 61-68
    • Franti, P.1
  • 22
    • 0034345982 scopus 로고    scopus 로고
    • Randomized local search algorithm for the clustering problem
    • DOI 10.1007/s100440070007
    • P. Fränti, and J. Kivijärvi Randomized local search algorithm for the clustering problem Pattern Anal. Appl. 3 4 2000 358 369 (Pubitemid 33213769)
    • (2000) Pattern Analysis and Applications , vol.3 , Issue.4 , pp. 358-369
    • Franti, P.1    Kivijarvi, J.2
  • 24
    • 33244462619 scopus 로고    scopus 로고
    • Iterative shrinking method for clustering problems
    • P. Fränti, and O. Virmajoki Iterative shrinking method for clustering problems Pattern Recognition 39 5 2006 761 765
    • (2006) Pattern Recognition , vol.39 , Issue.5 , pp. 761-765
    • Fränti, P.1    Virmajoki, O.2
  • 25
    • 77957934034 scopus 로고    scopus 로고
    • Probabilistic clustering by random swap algorithm
    • Tampa, Florida, USA.
    • Fränti, P., Virmajoki, O., Hautamäki, V., 2008. Probabilistic clustering by random swap algorithm. In: IAPR Internat. Conf. on Pattern Recognition (ICPR'08), Tampa, Florida, USA.
    • (2008) IAPR Internat. Conf. on Pattern Recognition (ICPR'08)
    • Fränti, P.1
  • 27
    • 0033739050 scopus 로고    scopus 로고
    • A comparison of cluster validity criteria for a mixture of normal distributed data
    • A.B. Geva, Y. Steinber, S. Bruckmair, and G. Nahum A comparison of cluster validity criteria for a mixture of normal distributed data Pattern Recognition Lett. 21 2000 511 529
    • (2000) Pattern Recognition Lett. , vol.21 , pp. 511-529
    • Geva, A.B.1    Steinber, Y.2    Bruckmair, S.3    Nahum, G.4
  • 28
    • 0032675222 scopus 로고    scopus 로고
    • A discriminative training algorithm for VQ-based speaker identification
    • J. He, L. Liu, and G. Palm A discriminative training algorithm for VQ-based speaker identification IEEE Trans. Speech Audio Process. 7 3 1999 353 356
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 353-356
    • He, J.1    Liu, L.2    Palm, G.3
  • 29
    • 79251600402 scopus 로고    scopus 로고
    • Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition
    • C. Hanilci, and F. Ertas Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition Comput. Electr. Eng. 37 2011 41 56
    • (2011) Comput. Electr. Eng. , vol.37 , pp. 41-56
    • Hanilci, C.1    Ertas, F.2
  • 31
    • 67650196186 scopus 로고    scopus 로고
    • Improving speaker verification by periodicity based voice activity detection
    • Moscow
    • Hautamäki, V., Tuononen, M., Niemi-Laitinen, T., Fränti, P., 2007. Improving speaker verification by periodicity based voice activity detection. In: Proc. 12th Internat. Conf. on Speech and Computer (SPECOM 2007), vol. 2, Moscow, pp. 645-650.
    • (2007) Proc. 12th Internat. Conf. on Speech and Computer (SPECOM 2007) , vol.2 , pp. 645-650
    • Hautamäki, V.1
  • 35
    • 70350324949 scopus 로고
    • Robustness of ANOVA and MANOVA test procedures
    • P.R. Krishnaiah, North-Holland Publishing Company
    • P.K. Ito Robustness of ANOVA and MANOVA test procedures P.R. Krishnaiah, Handbook of Statistics 1: Analysis of Variance 1980 North-Holland Publishing Company pp. 199-236
    • (1980) Handbook of Statistics 1: Analysis of Variance , pp. 199-236
    • Ito, P.K.1
  • 37
    • 77950369345 scopus 로고    scopus 로고
    • Data clustering: 50 years beyond K-means
    • A.K. Jain Data clustering: 50 years beyond K-means Pattern Recognition Lett. 31 8 2010 651 666
    • (2010) Pattern Recognition Lett. , vol.31 , Issue.8 , pp. 651-666
    • Jain, A.K.1
  • 38
    • 0042830826 scopus 로고    scopus 로고
    • Speaker adaptation based on MAP estimation using fuzzy controller
    • DOI 10.1016/S0167-8655(03)00125-9
    • Y.-T. Juang, K.-C. Huang, and I.-J. Ding Speaker adaptation based on MAP estimation using fuzzy controller Pattern Recognition Lett. 24 2003 2807 2813 (Pubitemid 37027800)
    • (2003) Pattern Recognition Letters , vol.24 , Issue.15 , pp. 2807-2813
    • Juang, Y.-T.1    Huang, K.-C.2    Ding, I.-J.3
  • 39
    • 0001174605 scopus 로고    scopus 로고
    • Iterative split-and-merge algorithm for vector quantization codebook generation
    • T. Kaukoranta, P. Fränti, and O. Nevalainen Iterative split-and-merge algorithm for VQ codebook generation Opt. Eng. 37 10 1998 2726 2732 (Pubitemid 128495293)
    • (1998) Optical Engineering , vol.37 , Issue.10 , pp. 2726-2732
    • Kaukoranta, T.1    Franti, P.2    Nevalainen, O.3
  • 42
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen, and H. Li An overview of text-independent speaker recognition: From features to supervectors Speech Commun. 52 1 2010 12 40
    • (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 44
    • 78651081288 scopus 로고    scopus 로고
    • Is speech data clustered? - Statistical analysis of cepstral features
    • Aalborg, Denmark
    • Kinnunen, T., Kärkkäinen, I., Fränti, P., 2001. Is speech data clustered? - Statistical analysis of cepstral features. In: Proc. 7th European Conf. on Speech Communication and Technology, (Eurospeech 2001), vol. 4, Aalborg, Denmark, pp. 2627-2630.
    • (2001) Proc. 7th European Conf. on Speech Communication and Technology, (Eurospeech 2001) , vol.4 , pp. 2627-2630
    • Kinnunen, T.1
  • 46
    • 58349105008 scopus 로고    scopus 로고
    • Comparative evaluation of maximum a posteriori vector quantization and Gaussian mixture models in speaker verification
    • T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, and P. Fränti Comparative evaluation of maximum a posteriori vector quantization and Gaussian mixture models in speaker verification Pattern Recognition Lett. 30 4 2009 341 347
    • (2009) Pattern Recognition Lett. , vol.30 , Issue.4 , pp. 341-347
    • Kinnunen, T.1    Saastamoinen, J.2    Hautamäki, V.3    Vinni, M.4    Fränti, P.5
  • 49
    • 84867218500 scopus 로고    scopus 로고
    • Characterizing speech utterances for speaker verification with sequence kernel SVM
    • Brisbane, Australia
    • Lee, K.A., You, C., Li, H., Kinnunen, T., Zhu, D., 2008. Characterizing speech utterances for speaker verification with sequence kernel SVM. In: Proc. Interspeech 2008, Brisbane, Australia, pp. 1397-1400.
    • (2008) Proc. Interspeech 2008 , pp. 1397-1400
    • Lee, K.A.1    You, C.2    Li, H.3    Kinnunen, T.4    Zhu, D.5
  • 50
    • 33745202264 scopus 로고    scopus 로고
    • Mixture of support vector machines for text-independent speaker recognition
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • Lei, Z., Yang, Y., Wu, Z., 2005. Mixture of support vector machines for text-independent speaker recognition. In: Proc. 9th European Conf. on Speech Communication and Technology (Interspeech'2005), pp. 2041-2044. (Pubitemid 43908492)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 2041-2044
    • Lei, Z.1    Yang, Y.2    Wu, Z.3
  • 51
    • 29044433161 scopus 로고    scopus 로고
    • NIST and NFI-TNO evaluations of automatic speaker recognition
    • DOI 10.1016/j.csl.2005.07.001, PII S088523080500032X, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • D.A.v. Leeuwen, A.F. Martin, M.A. Przybocki, and J.S. Bouten NIST and NFI-TNO evaluations of automatic speaker recognition Comput. Speech Lang. 20 2006 128 158 (Pubitemid 41787534)
    • (2006) Computer Speech and Language , vol.20 , Issue.SPEC. ISS. , pp. 128-158
    • Van Leeuwen, D.A.1    Martin, A.F.2    Przybocki, M.A.3    Bouten, J.S.4
  • 52
    • 0018918171 scopus 로고
    • An algorithm for vector quantizer design
    • Y. Linde, A. Buzo, and R.M. Gray An algorithm for vector quantizer design IEEE Trans. Commun. 28 1 1980 84 95
    • (1980) IEEE Trans. Commun. , vol.28 , Issue.1 , pp. 84-95
    • Linde, Y.1    Buzo, A.2    Gray, R.M.3
  • 56
    • 0033901151 scopus 로고    scopus 로고
    • NIST 1999 Speaker Recognition Evaluation - an overview
    • DOI 10.1006/dspr.1999.0355
    • A. Martin, and M. Przybocki The NIST 1999 speaker recognition evaluation - An overview Digital Signal Process. 10 2000 1 18 (Pubitemid 30592167)
    • (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 1-18
    • Martin, A.1    Przybocki, M.2
  • 59
    • 0034826101 scopus 로고    scopus 로고
    • Experimental comparison of model-based clustering methods
    • DOI 10.1023/A:1007648401407
    • M. Meil, and D. Heckerman An experimental comparison of model-based clustering methods Machine Learn. 42 2001 9 29 (Pubitemid 32872397)
    • (2001) Machine Learning , vol.42 , Issue.1-2 , pp. 9-29
    • Meila, M.1    Heckerman, D.2
  • 60
    • 0000228352 scopus 로고
    • A Monte Carlo study of thirty internal criterion measures for cluster analysis
    • G.W. Milligan A Monte Carlo study of thirty internal criterion measures for cluster analysis Psychometrika 46 2 1981 187 199
    • (1981) Psychometrika , vol.46 , Issue.2 , pp. 187-199
    • Milligan, G.W.1
  • 61
    • 0024167764 scopus 로고
    • Vector quantization of images based upon the Kohonen self-organization feature maps
    • N.M. Nasrabadi, and Y. Feng Vector quantization of images based upon the Kohonen self-organization feature maps Neural Networks 1 1988 518
    • (1988) Neural Networks , vol.1 , pp. 518
    • Nasrabadi, N.M.1    Feng, Y.2
  • 64
  • 65
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D.A. Reynolds, and R.C. Rose Robust text-independent speaker identification using Gaussian mixture speaker models IEEE Trans. Speech Audio Process. 3 1 1995 72 83
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 68
    • 28644435158 scopus 로고    scopus 로고
    • Gaussian-selection-based non-optimal search for speaker identification
    • DOI 10.1016/j.specom.2005.06.003, PII S016763930500141X
    • M. Roch Gaussian-selection-based non-optimal search for speaker identification Speech Commun. 48 2006 85 95 (Pubitemid 41751109)
    • (2006) Speech Communication , vol.48 , Issue.1 , pp. 85-95
    • Roch, M.1
  • 70
    • 0031248740 scopus 로고    scopus 로고
    • A clustering algorithm using an evolutionary programming-based approach
    • PII S0167865597001220
    • M. Sarkar, B. Yegnanarayana, and D. Khemani A clustering algorithm using an evolutionary programming-based approach Pattern Recognition Lett. 18 10 1997 975 986 (Pubitemid 127434126)
    • (1997) Pattern Recognition Letters , vol.18 , Issue.10 , pp. 975-986
    • Sarkar, M.1    Yegnanarayana, B.2    Khemani, D.3
  • 78
    • 84945251208 scopus 로고    scopus 로고
    • Fuzzy C-Means Clustering-based speaker verification
    • Calcutta, India, 2002
    • Tran, D., Wagner, M., 2002. Fuzzy C-Means Clustering-based speaker verification. In: Proc. Advances in Soft Computing (AFSS 2002), Calcutta, India, 2002, pp. 318-324.
    • (2002) Proc. Advances in Soft Computing (AFSS 2002) , pp. 318-324
    • Tran, D.1    Wagner, M.2
  • 80
    • 61849130623 scopus 로고    scopus 로고
    • α-Gaussian mixture modelling for speaker recognition
    • D. Wu, J. Li, and H. Wu α-Gaussian mixture modelling for speaker recognition Pattern Recognition Lett. 30 2009 589 594
    • (2009) Pattern Recognition Lett. , vol.30 , pp. 589-594
    • Wu, D.1    Li, J.2    Wu, H.3
  • 81
    • 0035989168 scopus 로고    scopus 로고
    • AANN: An alternative to GMM for pattern recognition
    • DOI 10.1016/S0893-6080(02)00019-9, PII S0893608002000199
    • B. Yegnanarayana, and S.P. Kishore AANN: An alternative to GMM for pattern recognition Neural Networks 15 2002 459 469 (Pubitemid 34518411)
    • (2002) Neural Networks , vol.15 , Issue.3 , pp. 459-469
    • Yegnanarayana, B.1    Kishore, S.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.