메뉴 건너뛰기




Volumn 1, Issue , 2008, Pages 243-254

Similarity measures for categorical data: A comparative evaluation

Author keywords

[No Author keywords available]

Indexed keywords

CLUSTERING ALGORITHMS;

EID: 52649136576     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1137/1.9781611972788.22     Document Type: Conference Paper
Times cited : (442)

References (37)
  • 1
    • 33750473714 scopus 로고    scopus 로고
    • A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set
    • A. Ahmad and L. Dey. A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set. Pattern Recogn. Lett., 28(1):110-118, 2007.
    • (2007) Pattern Recogn. Lett , vol.28 , Issue.1 , pp. 110-118
    • Ahmad, A.1    Dey, L.2
  • 3
    • 36948999941 scopus 로고    scopus 로고
    • Irvine, CA: University of California, School of Information and Computer Science
    • A. Asuncion and D. J. Newman. UCI machine learning repository, [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science, 2007.
    • (2007) UCI machine learning repository
    • Asuncion, A.1    Newman, D.J.2
  • 5
    • 0344448953 scopus 로고
    • On a method for character weighting a similarity coefficient, employing the concept of information
    • T. Burnaby. On a method for character weighting a similarity coefficient, employing the concept of information. Mathematical Geology, 2(1):25-38, 1970.
    • (1970) Mathematical Geology , vol.2 , Issue.1 , pp. 25-38
    • Burnaby, T.1
  • 6
    • 52649147493 scopus 로고    scopus 로고
    • Similarity measures for categorical data - a comparative study
    • Technical Report 07-022, Department of Computer Science & Engineering, University of Minnesota, October
    • V. Chandola, S. Boriah, and V. Kumar. Similarity measures for categorical data - a comparative study. Technical Report 07-022, Department of Computer Science & Engineering, University of Minnesota, October 2007.
    • (2007)
    • Chandola, V.1    Boriah, S.2    Kumar, V.3
  • 9
    • 0141797880 scopus 로고    scopus 로고
    • A geometric framework for unsupervised anomaly detection
    • D. Barbara and S. Jajodia, editors, Kluwer Academic Publishers, Norwell, MA
    • E. Eskin, A. Arnold, M. Prerau, L. Portnoy, and S. Stolfo. A geometric framework for unsupervised anomaly detection. In D. Barbara and S. Jajodia, editors, Applications of Data Mining in Computer Security, pages 78-100. Kluwer Academic Publishers, Norwell, MA, 2002.
    • (2002) Applications of Data Mining in Computer Security , pp. 78-100
    • Eskin, E.1    Arnold, A.2    Prerau, M.3    Portnoy, L.4    Stolfo, S.5
  • 10
    • 0002593344 scopus 로고
    • Multi-interval discretization of continuous-valued attributes for classification learning
    • San Francisco, CA, Morgan Kaufmann
    • U. M. Fayyad and K. B. Irani. Multi-interval discretization of continuous-valued attributes for classification learning. In Proceedings of the 13th International Joint Conference on Artificial Intelligence, pages 1022-1029, San Francisco, CA, 1993. Morgan Kaufmann.
    • (1993) Proceedings of the 13th International Joint Conference on Artificial Intelligence , pp. 1022-1029
    • Fayyad, U.M.1    Irani, K.B.2
  • 11
    • 52649180500 scopus 로고
    • A mathematical model of taxonomy
    • P. Gambaryan. A mathematical model of taxonomy. Izvest. Akad. Nauk Armen. SSR, 17(12):47-53, 1964.
    • (1964) Izvest. Akad. Nauk Armen. SSR , vol.17 , Issue.12 , pp. 47-53
    • Gambaryan, P.1
  • 13
    • 0034133769 scopus 로고    scopus 로고
    • Clustering categorical data: An approach based on dynamical systems
    • D. Gibson, J. Kleinberg, and P. Raghavan. Clustering categorical data: an approach based on dynamical systems. The VLDB Journal, 8(3):222-236, 2000.
    • (2000) The VLDB Journal , vol.8 , Issue.3 , pp. 222-236
    • Gibson, D.1    Kleinberg, J.2    Raghavan, P.3
  • 14
    • 0001337675 scopus 로고
    • A new similarity index based on probability
    • D. W. Goodall. A new similarity index based on probability. Biometrics, 22(4):882-907, 1966.
    • (1966) Biometrics , vol.22 , Issue.4 , pp. 882-907
    • Goodall, D.W.1
  • 15
    • 0034228041 scopus 로고    scopus 로고
    • ROCK: A robust clustering algorithm for categorical attributes
    • S. Guha, R. Rastogi, and K. Shim. ROCK: A robust clustering algorithm for categorical attributes. Information Systems, 25(5):345-366, 2000.
    • (2000) Information Systems , vol.25 , Issue.5 , pp. 345-366
    • Guha, S.1    Rastogi, R.2    Shim, K.3
  • 17
    • 27144536001 scopus 로고    scopus 로고
    • Extensions to the k-means algorithm for clustering large data sets with categorical values
    • Z. Huang. Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Mining and Knowledge Discovery, 2(3):283-304, 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.3 , pp. 283-304
    • Huang, Z.1
  • 19
    • 2942631525 scopus 로고
    • A statistical interpretation of term specificity and its application in retrieval
    • Document Retrieval Systems, of, Taylor Graham Publishing, London, UK, ISBN 0-947568-21-2
    • K. S. Jones. A statistical interpretation of term specificity and its application in retrieval. In Document Retrieval Systems, volume 3 of Taylor Graham Series In Foundations Of Information Science, pages 132-142. Taylor Graham Publishing, London, UK, 1988. ISBN 0-947568-21-2.
    • (1988) Taylor Graham Series In Foundations Of Information Science , vol.3 , pp. 132-142
    • Jones, K.S.1
  • 20
    • 0023454367 scopus 로고
    • Pictures of relevance: A geometric analysis of similarity measures
    • W. P. Jones and G. W. Furnas. Pictures of relevance: a geometric analysis of similarity measures. J. Am. Soc. Inf. Sci., 38(6):420-442, 1987.
    • (1987) J. Am. Soc. Inf. Sci , vol.38 , Issue.6 , pp. 420-442
    • Jones, W.P.1    Furnas, G.W.2
  • 22
    • 27644451472 scopus 로고    scopus 로고
    • An association-based dissimilarity measure for categorical data
    • S. Q. Le and T. B. Ho. An association-based dissimilarity measure for categorical data. Pattern Recogn. Lett., 26(16):2549-2557, 2005.
    • (2005) Pattern Recogn. Lett , vol.26 , Issue.16 , pp. 2549-2557
    • Le, S.Q.1    Ho, T.B.2
  • 23
    • 0005180705 scopus 로고    scopus 로고
    • An information-theoretic definition of similarity
    • San Francisco, CA, USA, Morgan Kaufmann Publishers Inc
    • D. Lin. An information-theoretic definition of similarity. In ICML '98: Proceedings of the 15th International Conference on Machine Learning, pages 296-304, San Francisco, CA, USA, 1998. Morgan Kaufmann Publishers Inc.
    • (1998) ICML '98: Proceedings of the 15th International Conference on Machine Learning , pp. 296-304
    • Lin, D.1
  • 24
    • 0039726245 scopus 로고
    • Measurement of association in a contingency table with special reference to the pigmentation of hair and eye colours of Scottish school children
    • K. Maung. Measurement of association in a contingency table with special reference to the pigmentation of hair and eye colours of Scottish school children. Annals of Eugenics, 11:189-223, 1941.
    • (1941) Annals of Eugenics , vol.11 , pp. 189-223
    • Maung, K.1
  • 27
    • 0009981746 scopus 로고
    • A comparative assessment of measures of similarity of fuzzy values
    • C. P. Pappis and N. I. Karacapilidis. A comparative assessment of measures of similarity of fuzzy values. Fuzzy Sets and Systems, 56(2):171-174, 1993.
    • (1993) Fuzzy Sets and Systems , vol.56 , Issue.2 , pp. 171-174
    • Pappis, C.P.1    Karacapilidis, N.I.2
  • 28
    • 0013003935 scopus 로고
    • On the general theory of multiple contingency with special reference to partial contingency
    • K. Pearson. On the general theory of multiple contingency with special reference to partial contingency. Biometrika, 11(3):145-158, 1916.
    • (1916) Biometrika , vol.11 , Issue.3 , pp. 145-158
    • Pearson, K.1
  • 31
    • 52649179865 scopus 로고
    • On exact methods in systematics
    • E. S. Smirnov. On exact methods in systematics. Systematic Zoology, 17(1):1-13, 1968.
    • (1968) Systematic Zoology , vol.17 , Issue.1 , pp. 1-13
    • Smirnov, E.S.1
  • 33
    • 0022909661 scopus 로고
    • Toward memory-based reasoning
    • C. Stanfill and D. Waltz. Toward memory-based reasoning. Commun. ACM, 29(12):1213-1228, 1986.
    • (1986) Commun. ACM , vol.29 , Issue.12 , pp. 1213-1228
    • Stanfill, C.1    Waltz, D.2
  • 35
    • 0000220933 scopus 로고
    • A comparative study of similarity measures
    • X. Wang, B. De Baets, and E. Kerre. A comparative study of similarity measures. Fuzzy Sets and Systems, 73(2):259-268, 1995.
    • (1995) Fuzzy Sets and Systems , vol.73 , Issue.2 , pp. 259-268
    • Wang, X.1    De Baets, B.2    Kerre, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.