메뉴 건너뛰기




Volumn 11, Issue 2, 2007, Pages 155-173

Mixture-model cluster analysis using information theoretical criteria

Author keywords

Cluster analysis; Finite mixture models; Information theoretical criteria; Model selection

Indexed keywords


EID: 41849101656     PISSN: 1088467X     EISSN: 15714128     Source Type: Journal    
DOI: 10.3233/ida-2007-11204     Document Type: Article
Times cited : (73)

References (68)
  • 1
    • 41849146187 scopus 로고    scopus 로고
    • H. Akaike, Information Theory and an Extension of Maximum Likelihood Principle, in: Selected Papers of Hirotugu Akaike, in Proceedings of the Second International Symposium on Information Theory, B.N. Petrov and F. caski, eds, Akademiai Kiado, Budapest, 1973, pp. 267-281, K.T. Emanuel Parzen and Genshiro Kitagawa, eds, Springer-Verlag New York, Inc, Texas, 1973, p. 434.
    • H. Akaike, Information Theory and an Extension of Maximum Likelihood Principle, in: Selected Papers of Hirotugu Akaike, in Proceedings of the Second International Symposium on Information Theory, B.N. Petrov and F. caski, eds, Akademiai Kiado, Budapest, 1973, pp. 267-281, K.T. Emanuel Parzen and Genshiro Kitagawa, eds, Springer-Verlag New York, Inc, Texas, 1973, p. 434.
  • 2
    • 85041932998 scopus 로고
    • Maximum likelihood identification of Gaussian autorregressive moving average models
    • H. Akaike, Maximum likelihood identification of Gaussian autorregressive moving average models, Biometrika 60 (1973), 255-265.
    • (1973) Biometrika , vol.60 , pp. 255-265
    • Akaike, H.1
  • 3
    • 0011611162 scopus 로고    scopus 로고
    • A comparison of segment retention for finite Mixture Logit Models
    • R.L. Andrews and I.S. Currim, A comparison of segment retention for finite Mixture Logit Models, Journal of Marketing Research XI (2003a), 235-243.
    • (2003) Journal of Marketing Research , vol.11 , pp. 235-243
    • Andrews, R.L.1    Currim, I.S.2
  • 5
    • 0027453616 scopus 로고
    • Model-Based Gaussian and Non-Gaussian Clustering
    • J.D. Banfield and A.E. Raftery, Model-Based Gaussian and Non-Gaussian Clustering, Biometrics 49 (1993), 803-821.
    • (1993) Biometrics , vol.49 , pp. 803-821
    • Banfield, J.D.1    Raftery, A.E.2
  • 6
    • 0037899007 scopus 로고    scopus 로고
    • Model-based Clustering with Noise: Bayesian Inference and Estimation
    • H. Bensmail and J.J. Meulman, Model-based Clustering with Noise: Bayesian Inference and Estimation, Journal of Classification 20 (2003), 49-76.
    • (2003) Journal of Classification , vol.20 , pp. 49-76
    • Bensmail, H.1    Meulman, J.J.2
  • 7
    • 0001684051 scopus 로고    scopus 로고
    • A geometric approach to cluster validity for normal mixtures
    • J.C. Bezdek, W.Q. Li, Y. Attikiouzel and M. Windham, A geometric approach to cluster validity for normal mixtures, Soft Computing 1 (1997), 166-179.
    • (1997) Soft Computing , vol.1 , pp. 166-179
    • Bezdek, J.C.1    Li, W.Q.2    Attikiouzel, Y.3    Windham, M.4
  • 8
    • 1842832753 scopus 로고    scopus 로고
    • A latent class segmentation analysis of e-shoppers
    • A. Bhatnagar and S. Ghose, A latent class segmentation analysis of e-shoppers, Journal of Business Research 57 (2004), 758-767.
    • (2004) Journal of Business Research , vol.57 , pp. 758-767
    • Bhatnagar, A.1    Ghose, S.2
  • 11
    • 0037469122 scopus 로고    scopus 로고
    • Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models
    • C. Biernacki, G. Celeux and G. Govaert, Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models, Computational Statistics & Data Analysis 41 (2003), 561-575.
    • (2003) Computational Statistics & Data Analysis , vol.41 , pp. 561-575
    • Biernacki, C.1    Celeux, G.2    Govaert, G.3
  • 12
    • 0033105385 scopus 로고    scopus 로고
    • An improvement of the NEC criterion for assessing the number of clusters in mixture model
    • C. Biernacki, G. Celeux and G. Govaert, An improvement of the NEC criterion for assessing the number of clusters in mixture model, Pattern Recognition Letters 20 (1999), 267-272.
    • (1999) Pattern Recognition Letters , vol.20 , pp. 267-272
    • Biernacki, C.1    Celeux, G.2    Govaert, G.3
  • 13
    • 0000308948 scopus 로고    scopus 로고
    • Using the classification likelihood to choose the number of clusters
    • C Biernacki and G. Govaert, Using the classification likelihood to choose the number of clusters, Computing Science and Statistics 29 (1997), 451-457.
    • (1997) Computing Science and Statistics , vol.29 , pp. 451-457
    • Biernacki, C.1    Govaert, G.2
  • 15
    • 0002259289 scopus 로고
    • A New Model Selection criterion
    • ICOMP:, Hans H. Bock ed, North-Holland, Amesterdam
    • H. Bozdogan, ICOMP: A New Model Selection criterion, in: Classification and Related Methods of Data Analysis, Hans H. Bock (ed.), North-Holland, Amesterdam, 1988, pp. 599-608.
    • (1988) Classification and Related Methods of Data Analysis , pp. 599-608
    • Bozdogan, H.1
  • 16
    • 0002728524 scopus 로고
    • Mixture-Model Cluster Analysis using Model Selection criteria and a new Informational Measure of Complexity
    • H. Bozdogan, ed, Kluwer Academic Publishers
    • H. Bozdogan, Mixture-Model Cluster Analysis using Model Selection criteria and a new Informational Measure of Complexity, in: Proceedings of the First US/Japan Conference on the Frontiers of Statistical Modeling: An Approach, 69-113, H. Bozdogan, ed., Kluwer Academic Publishers, 1994, pp. 69-113.
    • (1994) Proceedings of the First US/Japan Conference on the Frontiers of Statistical Modeling: An Approach, 69-113 , pp. 69-113
    • Bozdogan, H.1
  • 17
    • 34250108028 scopus 로고
    • Model Selection and Akaikes's Information Criterion (AIC): The General Theory and its Analytical Extensions
    • H. Bozdogan, Model Selection and Akaikes's Information Criterion (AIC): The General Theory and its Analytical Extensions, Psycometrika 52 (1987), 345-370.
    • (1987) Psycometrika , vol.52 , pp. 345-370
    • Bozdogan, H.1
  • 18
    • 0037409188 scopus 로고    scopus 로고
    • Information complexity criteria for detecting influential observations in dynamic multivariate linear models using the genetic algorithm
    • H. Bozdogan and P. Bearse, Information complexity criteria for detecting influential observations in dynamic multivariate linear models using the genetic algorithm, Journal of Statistical Planning and Inference 114 (2003), 31-44.
    • (2003) Journal of Statistical Planning and Inference , vol.114 , pp. 31-44
    • Bozdogan, H.1    Bearse, P.2
  • 19
    • 8744307994 scopus 로고    scopus 로고
    • Multimodel Inference. Understanding AIC and BIC in Model Selection
    • K.P. Burnham and D.R. Anderson, Multimodel Inference. Understanding AIC and BIC in Model Selection, Sociological Methods & Research 33 (2004), 261-304.
    • (2004) Sociological Methods & Research , vol.33 , pp. 261-304
    • Burnham, K.P.1    Anderson, D.R.2
  • 20
    • 0030351528 scopus 로고    scopus 로고
    • An entropy criterion for acessing the number of clusters in a mixture model
    • G. Celeux and G. Soromenho, An entropy criterion for acessing the number of clusters in a mixture model, Journal of Classification 13(1996), 195-212.
    • (1996) Journal of Classification , vol.13 , pp. 195-212
    • Celeux, G.1    Soromenho, G.2
  • 22
    • 0001381164 scopus 로고
    • Estimating the Components of a mixture of normal Distributions
    • N.E. Day, Estimating the Components of a mixture of normal Distributions, Biometrika 56 (1969), 463-474.
    • (1969) Biometrika , vol.56 , pp. 463-474
    • Day, N.E.1
  • 24
    • 33744807766 scopus 로고    scopus 로고
    • Finite Mixture Models; Review, Applications, and Computer-intensive Methods
    • Groningen University, PhD Thesis, Groningen, 199
    • J.G. Dias, Finite Mixture Models; Review, Applications, and Computer-intensive Methods, Econimics, Groningen University, PhD Thesis, Groningen, 2004, 199.
    • (2004) Econimics
    • Dias, J.G.1
  • 25
    • 0001739222 scopus 로고
    • Latent structure and other mixture models in marketing: An integrative survey and overview
    • chapter 9, R.P. Bagozi, ed, Cambridge: blackwell Publishers
    • W.R. Dillon and A. Kumar, Latent structure and other mixture models in marketing: An integrative survey and overview, chapter 9, in: Advanced methods of Marketing Research, R.P. Bagozi, ed., Cambridge: blackwell Publishers, 1994, pp. 352-388.
    • (1994) Advanced methods of Marketing Research , pp. 352-388
    • Dillon, W.R.1    Kumar, A.2
  • 26
    • 3543054548 scopus 로고    scopus 로고
    • Evaluation and optimization of clustering in gene expression data analysis
    • A.F. Famili, G. Liu and Z. Liu, Evaluation and optimization of clustering in gene expression data analysis, Bioinfonnatics 20 (2004), 1535-1545.
    • (2004) Bioinfonnatics , vol.20 , pp. 1535-1545
    • Famili, A.F.1    Liu, G.2    Liu, Z.3
  • 28
    • 33744782583 scopus 로고    scopus 로고
    • R.S. Fonseca, Jaime and G.M.S. Cardoso, Margarida, Retail Clients Latent Segments. Proceedings of the 12th Portuguese Conference on Artificial Intelligence, EPIA 2005, December 5-8, in: Progress in Artificial Intelligence, C. Bento, A. Cardoso and G. Dias, eds, Springer-Verlag, Lecture Notes in Computing Science, 3808, 2005, pp. 348-358.
    • R.S. Fonseca, Jaime and G.M.S. Cardoso, Margarida, Retail Clients Latent Segments. Proceedings of the 12th Portuguese Conference on Artificial Intelligence, EPIA 2005, December 5-8, in: Progress in Artificial Intelligence, C. Bento, A. Cardoso and G. Dias, eds, Springer-Verlag, Lecture Notes in Computing Science, Vol. 3808, 2005, pp. 348-358.
  • 29
    • 0037847280 scopus 로고
    • A Comparision of Some methods for Estimating Mixed Normal Distributions
    • J.G. Fryer and C.A. Robertson, A Comparision of Some methods for Estimating Mixed Normal Distributions, Biometrika 59 (1972), 639-648.
    • (1972) Biometrika , vol.59 , pp. 639-648
    • Fryer, J.G.1    Robertson, C.A.2
  • 32
    • 0001354471 scopus 로고
    • A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions
    • R.J. Hataway, A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions, The Annals of Statistics 13 (1985), 795-800.
    • (1985) The Annals of Statistics , vol.13 , pp. 795-800
    • Hataway, R.J.1
  • 34
    • 0037469115 scopus 로고    scopus 로고
    • Mixture model clustering for mixed data with missing information
    • L. Hunt and M. Jorgensen, Mixture model clustering for mixed data with missing information, Computational Statistics & Data Analysis 41 (2003), 429-440.
    • (2003) Computational Statistics & Data Analysis , vol.41 , pp. 429-440
    • Hunt, L.1    Jorgensen, M.2
  • 35
    • 0033235690 scopus 로고    scopus 로고
    • Fitting a Mixture Model to Three-Mode Trhee-Way Data with Categorical and Continuous Variables
    • L.A. Hunt and K.E. Basford, Fitting a Mixture Model to Three-Mode Trhee-Way Data with Categorical and Continuous Variables, Journal of Classification 16 (1999), 283-296.
    • (1999) Journal of Classification , vol.16 , pp. 283-296
    • Hunt, L.A.1    Basford, K.E.2
  • 36
    • 70349119250 scopus 로고
    • regression and Time Series Model Selection in Small Samples
    • C.M. Hurvich and C.-L. Tsai, regression and Time Series Model Selection in Small Samples, Biometrika 76 (1989), 297-307.
    • (1989) Biometrika , vol.76 , pp. 297-307
    • Hurvich, C.M.1    Tsai, C.-L.2
  • 38
    • 21144471169 scopus 로고
    • A Maximum Likelihood Method for Latent Class Regression involving a Censored dependent Variable
    • K. Jedidi, V. Ramaswamy and W.S. DeSarbo, A Maximum Likelihood Method for Latent Class Regression involving a Censored dependent Variable, Psycometrika 58 (1993), 375-394.
    • (1993) Psycometrika , vol.58 , pp. 375-394
    • Jedidi, K.1    Ramaswamy, V.2    DeSarbo, W.S.3
  • 40
    • 0031616571 scopus 로고    scopus 로고
    • Estimation consistante de Forde de modèes de mélange, Comptes Rendues de l'Academie des Sciences, Paris, t
    • I
    • C. Keribin, Estimation consistante de Forde de modèes de mélange, Comptes Rendues de l'Academie des Sciences, Paris, t. 326 Séie I (1998), 243-248.
    • (1998) 326 Séie , pp. 243-248
    • Keribin, C.1
  • 41
    • 24344503628 scopus 로고    scopus 로고
    • Evolutionary model selection in unsupervised learning
    • Y. Kim, W.N. Street and F. Menezer, Evolutionary model selection in unsupervised learning, Intelligent Data Analysis 6 (2002), 531-556.
    • (2002) Intelligent Data Analysis , vol.6 , pp. 531-556
    • Kim, Y.1    Street, W.N.2    Menezer, F.3
  • 42
    • 21144475085 scopus 로고
    • Consistent Estimation of a Mixing Distribution
    • B.G. Leroux, Consistent Estimation of a Mixing Distribution, The Annals of Statistics 20 (1992), 1350-1360.
    • (1992) The Annals of Statistics , vol.20 , pp. 1350-1360
    • Leroux, B.G.1
  • 43
    • 0026652789 scopus 로고
    • Maximum-Penalized-Likelihood Estimation for Independent and Markov-Dependent Mixture Models
    • B.G. Leroux and M.L. Puterman, Maximum-Penalized-Likelihood Estimation for Independent and Markov-Dependent Mixture Models, Biometrics 48 (1992), 545-558.
    • (1992) Biometrics , vol.48 , pp. 545-558
    • Leroux, B.G.1    Puterman, M.L.2
  • 44
    • 0016203667 scopus 로고
    • Multivariate Clustering Procedures with variable Metrics
    • R. Maronna and P.M. Jacovkis, Multivariate Clustering Procedures with variable Metrics, Biometrics 30 (1974), 499-505.
    • (1974) Biometrics , vol.30 , pp. 499-505
    • Maronna, R.1    Jacovkis, P.M.2
  • 46
    • 23744437299 scopus 로고    scopus 로고
    • On a resampling approach for tests on the number of clusters with mixture model-based clustering of tissue samples
    • G.F. McLachlan and N. Khan, On a resampling approach for tests on the number of clusters with mixture model-based clustering of tissue samples, Journal of multivariate Analysis 90 (2004), 90-105.
    • (2004) Journal of multivariate Analysis , vol.90 , pp. 90-105
    • McLachlan, G.F.1    Khan, N.2
  • 48
    • 0023570352 scopus 로고
    • On Bootstrapping the Likelihood Ratio Test Statistic for the Number of components in a Normal Mixture
    • G.J. McLachlan, On Bootstrapping the Likelihood Ratio Test Statistic for the Number of components in a Normal Mixture, Appllied Statistics 36 (1987), 318-324.
    • (1987) Appllied Statistics , vol.36 , pp. 318-324
    • McLachlan, G.J.1
  • 52
    • 0001371071 scopus 로고
    • A Generalized Theory of the Combination of Observations so as to Obtain the Best Result
    • S. Newcomb, A Generalized Theory of the Combination of Observations so as to Obtain the Best Result, American journal of Mathematics 8 (1886), 343-366.
    • (1886) American journal of Mathematics , vol.8 , pp. 343-366
    • Newcomb, S.1
  • 54
    • 77956982581 scopus 로고
    • On the probability that two independent Distributions of frequency are really samples of the same population with special reference to recent work on the identity of trypanosome strains
    • K. Pearson, On the probability that two independent Distributions of frequency are really samples of the same population with special reference to recent work on the identity of trypanosome strains, Biometrika 10 (1914), 85-143.
    • (1914) Biometrika , vol.10 , pp. 85-143
    • Pearson, K.1
  • 55
    • 4444356084 scopus 로고    scopus 로고
    • Supervised cluster analysis for microarray data based on multivariate Gaussian mixture
    • Y. Qu and S. Xu, Supervised cluster analysis for microarray data based on multivariate Gaussian mixture, Bioinformatics 20 (2004), 1905-1913.
    • (2004) Bioinformatics , vol.20 , pp. 1905-1913
    • Qu, Y.1    Xu, S.2
  • 56
    • 0018015137 scopus 로고
    • Modeling by shortest data description
    • J. Rissanen, Modeling by shortest data description, Automatica 14 (1978), 465-471.
    • (1978) Automatica , vol.14 , pp. 465-471
    • Rissanen, J.1
  • 57
    • 0001475895 scopus 로고
    • A graphical Technique for Determining the Number of components in a Mixture of Normals
    • K. Roeder, A graphical Technique for Determining the Number of components in a Mixture of Normals, Journal of the American Statistical Association 89 (1994), 487-495.
    • (1994) Journal of the American Statistical Association , vol.89 , pp. 487-495
    • Roeder, K.1
  • 58
    • 0000120766 scopus 로고
    • Estimating the Dimenson of a Model
    • G. Schwarz, Estimating the Dimenson of a Model, The Annals of Statistics 6 (1978), 461-464.
    • (1978) The Annals of Statistics , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 59
    • 3543133355 scopus 로고    scopus 로고
    • Interactively optimizing signal-to-noise ratios in expression profiling: Project-specific algorithm selection and detection p-value weighting in Affymetrix mocroarrays
    • J. Seo, M. Bakay, Y.-W. Chen, S. Hilmer, B. Shneiderman and E.P. Hoffman, Interactively optimizing signal-to-noise ratios in expression profiling: project-specific algorithm selection and detection p-value weighting in Affymetrix mocroarrays, Bioinformatics 20 (2004), 2534-2544.
    • (2004) Bioinformatics , vol.20 , pp. 2534-2544
    • Seo, J.1    Bakay, M.2    Chen, Y.-W.3    Hilmer, S.4    Shneiderman, B.5    Hoffman, E.P.6
  • 62
    • 3042684740 scopus 로고    scopus 로고
    • Drug risk assessment with determining the number of sub-populations under finite mixture normal models
    • J. Tao, N.-Z. Shi and S.-Y. Lee, Drug risk assessment with determining the number of sub-populations under finite mixture normal models, Computational Statistics & Data Analysis 46 (2004), 661-676.
    • (2004) Computational Statistics & Data Analysis , vol.46 , pp. 661-676
    • Tao, J.1    Shi, N.-Z.2    Lee, S.-Y.3
  • 63
    • 0012253128 scopus 로고    scopus 로고
    • Latent class cluster analysis
    • J.A. Hagenaars and A.L. McCutcheon, eds, Cambridge University Press
    • J.K. Vermunt and J. Magidson, Latent class cluster analysis., in: Applied Latent Class Analysis, J.A. Hagenaars and A.L. McCutcheon, eds, Cambridge University Press, 2002, 89-106.
    • (2002) Applied Latent Class Analysis , pp. 89-106
    • Vermunt, J.K.1    Magidson, J.2
  • 64
    • 33746656457 scopus 로고    scopus 로고
    • Market Segmentation. Analytical Developments and Applications Guidlines
    • Millward Brown IntelliQuest
    • M. Vriens, Market Segmentation. Analytical Developments and Applications Guidlines, Technical Overview Series, Millward Brown IntelliQuest, 2001, 1-42.
    • (2001) Technical Overview Series , pp. 1-42
    • Vriens, M.1
  • 65
    • 1842714898 scopus 로고    scopus 로고
    • Robust mixture modelling using multivariate t-distribution with missing information
    • H.x. Wang, Q.b. Zhang, B. Luo and S. Wei, Robust mixture modelling using multivariate t-distribution with missing information, Pattern Recognition Letters 25 (2004), 701-710.
    • (2004) Pattern Recognition Letters , vol.25 , pp. 701-710
    • Wang, H.X.1    Zhang, Q.B.2    Luo, B.3    Wei, S.4
  • 66
    • 21644476820 scopus 로고    scopus 로고
    • Model-Based Clustering for Image Segmentation and Large Datasets via Sampling
    • R. Wehrens, L.M.C Buydens, C. Fraley and A.E. Raftery, Model-Based Clustering for Image Segmentation and Large Datasets via Sampling, Journal of Classification 21 (2004), 231-253.
    • (2004) Journal of Classification , vol.21 , pp. 231-253
    • Wehrens, R.1    Buydens, L.M.C.2    Fraley, C.3    Raftery, A.E.4
  • 68
    • 2942587071 scopus 로고    scopus 로고
    • Image segmentation using finite mixtures and spatial information
    • X. Yang and S.M. Krishnan, Image segmentation using finite mixtures and spatial information, Image and Vision Computing 22 (2004), 735-745.
    • (2004) Image and Vision Computing , vol.22 , pp. 735-745
    • Yang, X.1    Krishnan, S.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.