메뉴 건너뛰기




Volumn 177, Issue 20, 2007, Pages 4474-4492

Hierarchical clustering of mixed data based on distance hierarchy

Author keywords

Categorical data; Distance hierarchy; Hierarchical clustering; k Means; Mixed data

Indexed keywords

CATEGORICAL DATA; DISTANCE HIERARCHY; HIERARCHICAL CLUSTERING; K-MEANS; MIXED DATA;

EID: 34447501949     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ins.2007.05.003     Document Type: Article
Times cited : (85)

References (46)
  • 3
    • 0038494682 scopus 로고    scopus 로고
    • D. Barbara, J. Couto, Y. Li, COOLCAT: an entropy-based algorithm for categorical clustering, in: Proceedings of the 7th International Conference on Information and Knowledge Management, McLean, VI, USA, 2002, pp. 582-589.
  • 4
    • 0002546079 scopus 로고
    • Attribute-oriented induction in relational databases
    • Piatetsky-Shapiro G., and Frawley W.J. (Eds), AAA/MIT Press, Cambridge, MA
    • Cai Y., Cercone N., and Han J. Attribute-oriented induction in relational databases. In: Piatetsky-Shapiro G., and Frawley W.J. (Eds). Knowledge Discovery in Databases (1991), AAA/MIT Press, Cambridge, MA 213-228
    • (1991) Knowledge Discovery in Databases , pp. 213-228
    • Cai, Y.1    Cercone, N.2    Han, J.3
  • 5
    • 0035788889 scopus 로고    scopus 로고
    • T. Chiu, D. Fang, J. Chen, Y. Wang, C. Jeris, A robust and scalable clustering algorithm for mixed type attributes in large database environment, in: Proceedings of the 2001 International Conference on Knowledge Discovery and Data Mining (KDD'01), San Francisco, CA, 2001, pp. 263-268.
  • 6
    • 34447568079 scopus 로고    scopus 로고
    • G. Das, H. Mannila, P. Ronkainen, Similarity of attributes by external probes, in: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, Los Alamitos, CA, 1998, pp. 23-29.
  • 9
    • 0030289446 scopus 로고    scopus 로고
    • Data mining and knowledge discovery in databases
    • Fayyad U., and Uthurusammy R. Data mining and knowledge discovery in databases. Communications of the ACM 39 (1996) 24-26
    • (1996) Communications of the ACM , vol.39 , pp. 24-26
    • Fayyad, U.1    Uthurusammy, R.2
  • 10
    • 33750972001 scopus 로고    scopus 로고
    • Anomaly detection in web documents using crisp and fuzzy-based cosine clustering methodology
    • Friedman M., Last M., Makover Y., and Kandel A. Anomaly detection in web documents using crisp and fuzzy-based cosine clustering methodology. Information Sciences 177 2 (2007) 467-475
    • (2007) Information Sciences , vol.177 , Issue.2 , pp. 467-475
    • Friedman, M.1    Last, M.2    Makover, Y.3    Kandel, A.4
  • 11
    • 34447562491 scopus 로고    scopus 로고
    • V. Ganti, J. Gehrke, R. Ramakrishnan, Cactus-clustering categorical data using summaries, in: Proceedings of the International Conference on Knowledge Discovery and Data Mining, San Diego, CA, 1999, pp. 73-84.
  • 12
    • 34447545409 scopus 로고    scopus 로고
    • D. Gibson, J. Kleinberg, P. Raghavan, Clustering categorical data: an approach based on dynamical systems, in: Proceedings of the 24th VLDB Conference, New York, 1998, pp. 311-322.
  • 13
    • 34447557590 scopus 로고    scopus 로고
    • M. Gluck, J. Corter, Information, uncertainty, and the utility of categories, in: Proceedings of the 7th Annual Conference Cognitive Soc., Hillsdale, NJ, 1985, pp. 283-287.
  • 14
    • 0001337675 scopus 로고
    • A new similarity index based on probability
    • Goodall D.W. A new similarity index based on probability. Biometrics 22 (1966) 882-907
    • (1966) Biometrics , vol.22 , pp. 882-907
    • Goodall, D.W.1
  • 15
    • 85175741615 scopus 로고    scopus 로고
    • S. Guha, R. Rastogi, K. Shim, Rock: A robust clustering algorithm for categorical attributes, in: Proceedings of the IEEE International Conference on Data Engineering, Sydney, Australia, 1999, pp. 512-521.
  • 17
    • 34447565799 scopus 로고    scopus 로고
    • J. Han, Y. Fu, Dynamic generation and refinement of concept hierarchies for knowledge discovery in databases, in: Proceedings of the AAAI'94 Workshop Knowledge Discovery in Databases (KDD'94), Seattle, WA, 1994, pp. 157-168.
  • 19
    • 0036740348 scopus 로고    scopus 로고
    • Squeezer: an efficient algorithm for clustering categorical data
    • He Z., Xu X., and Deng S. Squeezer: an efficient algorithm for clustering categorical data. Journal of Computational Science and Technology 17 5 (2002) 611-624
    • (2002) Journal of Computational Science and Technology , vol.17 , Issue.5 , pp. 611-624
    • He, Z.1    Xu, X.2    Deng, S.3
  • 20
    • 27844433509 scopus 로고    scopus 로고
    • Scalable algorithms for clustering large datasets with mixed type attributes
    • He Z., Xu X., and Deng S. Scalable algorithms for clustering large datasets with mixed type attributes. International Journal of Intelligent Systems 20 10 (2005) 1077-1089
    • (2005) International Journal of Intelligent Systems , vol.20 , Issue.10 , pp. 1077-1089
    • He, Z.1    Xu, X.2    Deng, S.3
  • 22
    • 2942589031 scopus 로고    scopus 로고
    • Extending attribute-oriented induction algorithm for major values and numeric values
    • Hsu C.C. Extending attribute-oriented induction algorithm for major values and numeric values. Expert Systems with Applications 27 2 (2004) 187-202
    • (2004) Expert Systems with Applications , vol.27 , Issue.2 , pp. 187-202
    • Hsu, C.C.1
  • 23
    • 33644899363 scopus 로고    scopus 로고
    • Generalizing self-organizing map for categorical data
    • Hsu C.C. Generalizing self-organizing map for categorical data. IEEE Transactions on Neural Networks 17 2 (2006) 294-304
    • (2006) IEEE Transactions on Neural Networks , vol.17 , Issue.2 , pp. 294-304
    • Hsu, C.C.1
  • 24
    • 33746632119 scopus 로고    scopus 로고
    • Modified adaptive resonance theory network of mixed data based on distance hierarchy
    • Hsu C.C., Huang Y.-P., and Hsiao C.-M. Modified adaptive resonance theory network of mixed data based on distance hierarchy. Lecture Notes in Computer Science 3994 (2006) 757-764
    • (2006) Lecture Notes in Computer Science , vol.3994 , pp. 757-764
    • Hsu, C.C.1    Huang, Y.-P.2    Hsiao, C.-M.3
  • 26
    • 27144536001 scopus 로고    scopus 로고
    • Extensions to the K-means algorithm for clustering large data sets with categorical values
    • Huang Z. Extensions to the K-means algorithm for clustering large data sets with categorical values. Data Mining Knowledge Discovery 2 3 (1998) 283-304
    • (1998) Data Mining Knowledge Discovery , vol.2 , Issue.3 , pp. 283-304
    • Huang, Z.1
  • 27
    • 0032595161 scopus 로고    scopus 로고
    • A fuzzy k-modes algorithm for clustering categorical data
    • Huang Z., and Ng M.K. A fuzzy k-modes algorithm for clustering categorical data. IEEE Transactions on Fuzzy Systems 7 4 (1999) 446-452
    • (1999) IEEE Transactions on Fuzzy Systems , vol.7 , Issue.4 , pp. 446-452
    • Huang, Z.1    Ng, M.K.2
  • 32
    • 23844536246 scopus 로고    scopus 로고
    • Fuzzy clustering of categorical data using fuzzy centroids
    • Kim D.W., Lee K.H., and Lee D. Fuzzy clustering of categorical data using fuzzy centroids. Pattern Recognition Letters 25 (2004) 1263-1271
    • (2004) Pattern Recognition Letters , vol.25 , pp. 1263-1271
    • Kim, D.W.1    Lee, K.H.2    Lee, D.3
  • 35
    • 31944451673 scopus 로고    scopus 로고
    • Looking into the seeds of time: discovering temporal patterns in large transaction sets
    • Li Y., Zhu S., Wang X., and Jajodia S. Looking into the seeds of time: discovering temporal patterns in large transaction sets. Information Sciences 176 8 (2006) 1003-1031
    • (2006) Information Sciences , vol.176 , Issue.8 , pp. 1003-1031
    • Li, Y.1    Zhu, S.2    Wang, X.3    Jajodia, S.4
  • 36
    • 18144396167 scopus 로고    scopus 로고
    • Temporal analysis of clusters of supermarket customers: conventional versus interval set approach
    • Lingras P., Hogo M., Snorek M., and West C. Temporal analysis of clusters of supermarket customers: conventional versus interval set approach. Information Sciences 172 1-2 (2005) 215-240
    • (2005) Information Sciences , vol.172 , Issue.1-2 , pp. 215-240
    • Lingras, P.1    Hogo, M.2    Snorek, M.3    West, C.4
  • 37
    • 0018910525 scopus 로고
    • The validation of four ultrametric clustering algorithms
    • Milligan G.W., and Isaac P.D. The validation of four ultrametric clustering algorithms. Pattern Recognition 12 (1980) 41-50
    • (1980) Pattern Recognition , vol.12 , pp. 41-50
    • Milligan, G.W.1    Isaac, P.D.2
  • 38
    • 34447557589 scopus 로고    scopus 로고
    • P.M. Murphy, D.W. Aha., UCI repository of Machine Learning Databases, , 1992.
  • 39
    • 0035792377 scopus 로고    scopus 로고
    • C.H. Oh, K. Honda, H. Ichihashi, Fuzzy clustering for categorical multivariate data, in: Joint 9th IFSA World Congress and 20th NAFIPS International Conference, (4), Vancouver, BC, Canada, 2001, pp. 2154-2159.
  • 40
    • 7444235466 scopus 로고    scopus 로고
    • C.R. Palmer, C. Faloutsos, Electricity based external similarity of categorical attributes, in: Proceedings of the 7th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'03), Seoul, South Korea, 2003, pp. 486-500.
  • 43
    • 0035504070 scopus 로고    scopus 로고
    • A discrete-valued clustering algorithm with applications to bimolecular data
    • Wong A.K.C., Chiu K.Y., and Huang W. A discrete-valued clustering algorithm with applications to bimolecular data. Information Sciences 139 (2001) 97-112
    • (2001) Information Sciences , vol.139 , pp. 97-112
    • Wong, A.K.C.1    Chiu, K.Y.2    Huang, W.3
  • 44
    • 0036925783 scopus 로고    scopus 로고
    • X.-R. Yang, J.-Y. Shen, Q. Liu, A novel clustering algorithm based on weighted support and its application, In: Proceedings of the First International Conference on Machine Learning and Cybernetics, Beijing, China, 2002, pp. 95-100.
  • 45
    • 2442705676 scopus 로고    scopus 로고
    • GeneScout: a data mining system for predicting vertebrate genes in genomic DNA sequences
    • Yin M.M., and Wang T.L. GeneScout: a data mining system for predicting vertebrate genes in genomic DNA sequences. Information Sciences 163 1-3 (2004) 201-218
    • (2004) Information Sciences , vol.163 , Issue.1-3 , pp. 201-218
    • Yin, M.M.1    Wang, T.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.