-
1
-
-
33749563831
-
K-means clustering versus validation measures: A data distribution perspective
-
H. Xiong, J. Wu, and J. Chen, "K-means clustering versus validation measures: A data distribution perspective," in Proc. 12th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, 2006, pp. 779-784.
-
(2006)
Proc. 12th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining
, pp. 779-784
-
-
Xiong, H.1
Wu, J.2
Chen, J.3
-
3
-
-
0001457509
-
Some methods for classification and analysis of multivariate observations
-
L. M. L. Cam and J. Neyman, Eds. Berkeley, CA: Univ. California Press
-
J. MacQueen, "Some methods for classification and analysis of multivariate observations," in Proc. 5th Berkeley Symp. Math. Stat. Probab., L. M. L. Cam and J. Neyman, Eds. Berkeley, CA: Univ. California Press, 1967, vol. I.
-
(1967)
Proc. 5th Berkeley Symp. Math. Stat. Probab
, vol.1
-
-
MacQueen, J.1
-
5
-
-
0032665804
-
Genetic K-means algorithm
-
Jun
-
K. Krishna and M. Narasimha Murty, "Genetic K-means algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 29, no. 3, pp. 433-439, Jun. 1999.
-
(1999)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.29
, Issue.3
, pp. 433-439
-
-
Krishna, K.1
Narasimha Murty, M.2
-
6
-
-
2442439674
-
A comparison of document clustering techniques
-
Aug
-
M. Steinbach, G. Karypis, and V. Kumar, "A comparison of document clustering techniques," in Proc. Workshop Text Mining, 6th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, Aug. 2000, pp. 20-23.
-
(2000)
Proc. Workshop Text Mining, 6th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining
, pp. 20-23
-
-
Steinbach, M.1
Karypis, G.2
Kumar, V.3
-
7
-
-
3543085722
-
Criterion functions for document clustering: Experiments and analysis
-
Jun
-
Y. Zhao and G. Karypis, "Criterion functions for document clustering: Experiments and analysis," Mach. Learn., vol. 55, no. 3, pp. 311-331, Jun. 2004.
-
(2004)
Mach. Learn
, vol.55
, Issue.3
, pp. 311-331
-
-
Zhao, Y.1
Karypis, G.2
-
9
-
-
0037284901
-
Using self-similarity to cluster large data sets
-
Apr
-
D. Barbará and P. Chen, "Using self-similarity to cluster large data sets," Data Mining Knowl. Discov., vol. 7, no. 2, pp. 123-152, Apr. 2003.
-
(2003)
Data Mining Knowl. Discov
, vol.7
, Issue.2
, pp. 123-152
-
-
Barbará, D.1
Chen, P.2
-
10
-
-
0038494682
-
Coolcat: An entropy-based algorithm for categorical clustering
-
D. Barbará, Y. Li, and J. Couto, "Coolcat: An entropy-based algorithm for categorical clustering," in Proc. 11th ACM Int. Conf. Inf. Knowl. Manag., 2002, pp. 582-589.
-
(2002)
Proc. 11th ACM Int. Conf. Inf. Knowl. Manag
, pp. 582-589
-
-
Barbará, D.1
Li, Y.2
Couto, J.3
-
11
-
-
64049099517
-
Size regularized cut for data clustering
-
Y. Chen, Y. Zhang, and X. Ji, "Size regularized cut for data clustering," in Proc. NIPS, 2005, pp. 211-218.
-
(2005)
Proc. NIPS
, pp. 211-218
-
-
Chen, Y.1
Zhang, Y.2
Ji, X.3
-
12
-
-
0141767425
-
Graph-based hierarchical conceptual clustering
-
I. Jonyer, D. J. Cook, and L. B. Holder, "Graph-based hierarchical conceptual clustering," J. Mach. Learn. Res., vol. 2, no. 2, pp. 19-43, 2001.
-
(2001)
J. Mach. Learn. Res
, vol.2
, Issue.2
, pp. 19-43
-
-
Jonyer, I.1
Cook, D.J.2
Holder, L.B.3
-
13
-
-
33748875128
-
Kernel principle component analysis in pixels clustering
-
J. Li, D. Tao, W. Hu, and X. Li, "Kernel principle component analysis in pixels clustering," in Proc. Web Intell., 2005, pp. 786-789.
-
(2005)
Proc. Web Intell
, pp. 786-789
-
-
Li, J.1
Tao, D.2
Hu, W.3
Li, X.4
-
14
-
-
34249827286
-
Enhancing the effectiveness of clustering with spectra analysis
-
Jul
-
W. Li, W. K. Ng, Y. Liu, and K.-L. Ong, "Enhancing the effectiveness of clustering with spectra analysis," IEEE Trans. Knowl. Data Eng., vol. 19, no. 7, pp. 887-902, Jul. 2007.
-
(2007)
IEEE Trans. Knowl. Data Eng
, vol.19
, Issue.7
, pp. 887-902
-
-
Li, W.1
Ng, W.K.2
Liu, Y.3
Ong, K.-L.4
-
15
-
-
0141860731
-
Cluster validity methods: Part I
-
M. Halkidi, Y. Batistakis, and M. Vazirgiannis, "Cluster validity methods: Part I," SIGMOD Rec., vol. 31, no. 2, pp. 40-45, 2002.
-
(2002)
SIGMOD Rec
, vol.31
, Issue.2
, pp. 40-45
-
-
Halkidi, M.1
Batistakis, Y.2
Vazirgiannis, M.3
-
19
-
-
64049090261
-
-
G. Karypis, Cluto: Software for clustering high-dimensional datasets 2006. version 2.1.1, Online, Available
-
G. Karypis, Cluto: Software for clustering high-dimensional datasets 2006. version 2.1.1. [Online]. Available: http://glaros.dtc.umn.edu/gkhome/views/cluto
-
-
-
-
20
-
-
64049118973
-
-
TREC, Online, Available
-
TREC, Text Retrieval Conference, 1996. [Online]. Available: http://trec.nist.gov
-
(1996)
Text Retrieval Conference
-
-
-
21
-
-
85030313899
-
Ohsumed: An interactive retrieval evaluation and new large test collection for research
-
Jul
-
W. Hersh, C. Buckley, T. J. Leone, and D. Hickam, "Ohsumed: An interactive retrieval evaluation and new large test collection for research," in Proc. 17th Annu. Int. ACM SIGIR Conf. Res. Develop. Inf. Retr., Jul. 1994, pp. 192-201.
-
(1994)
Proc. 17th Annu. Int. ACM SIGIR Conf. Res. Develop. Inf. Retr
, pp. 192-201
-
-
Hersh, W.1
Buckley, C.2
Leone, T.J.3
Hickam, D.4
-
23
-
-
0031710353
-
Webace: A web agent for document categorization and exploration
-
E.-H. Han, D. Boley, M. Gini, R. Gross, K. Hastings, G. Karypis, V. Kumar, B. Mobasher, and J. Moore, "Webace: A web agent for document categorization and exploration," in Proc. 2nd Int. Conf. Auton. Agents, 1998, pp. 408-415.
-
(1998)
Proc. 2nd Int. Conf. Auton. Agents
, pp. 408-415
-
-
Han, E.-H.1
Boley, D.2
Gini, M.3
Gross, R.4
Hastings, K.5
Karypis, G.6
Kumar, V.7
Mobasher, B.8
Moore, J.9
-
24
-
-
84948481845
-
An algorithm for suffix stripping
-
Jul
-
M. F. Porter, "An algorithm for suffix stripping," Program, vol. 14, no. 3, pp. 130-137, Jul. 1980.
-
(1980)
Program
, vol.14
, Issue.3
, pp. 130-137
-
-
Porter, M.F.1
-
25
-
-
0035923521
-
-
A. Bhattacharjee et al., Classification of human lung carcinomas by mrna expression profiling reveals distinct adenocarcinoma subclasses, Proc. Natl. Acad. Sci. U. S. A., 98, no. 24, pp. 13 790-13 795, Nov. 2001.
-
A. Bhattacharjee et al., "Classification of human lung carcinomas by mrna expression profiling reveals distinct adenocarcinoma subclasses," Proc. Natl. Acad. Sci. U. S. A., vol. 98, no. 24, pp. 13 790-13 795, Nov. 2001.
-
-
-
-
26
-
-
19044399684
-
Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling
-
Mar
-
E.-J. Yeoh et al., "Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling," Cancer Cell, vol. 1, no. 2, pp. 133-143, Mar. 2002.
-
(2002)
Cancer Cell
, vol.1
, Issue.2
, pp. 133-143
-
-
Yeoh, E.-J.1
-
28
-
-
33745834241
-
-
Online, Available
-
D. Newman, S. Hettich, C. Blake, and C. Merz, UCI Repository of Machine Learning Databases, 1998. [Online]. Available: http://www. ics.uci.edu/_mlearn/MLRepository.html
-
(1998)
UCI Repository of Machine Learning Databases
-
-
Newman, D.1
Hettich, S.2
Blake, C.3
Merz, C.4
-
32
-
-
26944478556
-
Comparing dimension reduction techniques for document clustering
-
B. Tang, M. Shepherd, M. I. Heywood, and X. Luo, "Comparing dimension reduction techniques for document clustering," in Proc. Can. Conf. AI, 2005, pp. 292-296.
-
(2005)
Proc. Can. Conf. AI
, pp. 292-296
-
-
Tang, B.1
Shepherd, M.2
Heywood, M.I.3
Luo, X.4
-
33
-
-
0015680655
-
Clustering using a similarity measure based on shared near neighbors
-
Nov
-
R. Jarvis and E. Patrick, "Clustering using a similarity measure based on shared near neighbors," IEEE Trans. Comput., vol. C-22, no. 11, pp. 1025-1034, Nov. 1973.
-
(1973)
IEEE Trans. Comput
, vol.C-22
, Issue.11
, pp. 1025-1034
-
-
Jarvis, R.1
Patrick, E.2
-
34
-
-
0032091595
-
Cure: An efficient clustering algorithm for large databases
-
Jun
-
S. Guha, R. Rastogi, and K. Shim, "Cure: An efficient clustering algorithm for large databases," in Proc. ACM SIGMOD Int. Conf. Manag. Data, Jun. 1998, pp. 73-84.
-
(1998)
Proc. ACM SIGMOD Int. Conf. Manag. Data
, pp. 73-84
-
-
Guha, S.1
Rastogi, R.2
Shim, K.3
-
35
-
-
0034133513
-
Distance-based outliers: Algorithms and applications
-
Feb
-
E. Knorr, R. Ng, and V. Tucakov, "Distance-based outliers: Algorithms and applications," VLDB J., vol. 8, no. 3/4, pp. 237-253, Feb. 2000.
-
(2000)
VLDB J
, vol.8
, Issue.3-4
, pp. 237-253
-
-
Knorr, E.1
Ng, R.2
Tucakov, V.3
-
36
-
-
0344827211
-
Robust clustering by pruning outliers
-
Dec
-
J.-S. Zhang and Y.-W. Leung, "Robust clustering by pruning outliers," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 33, no. 6, pp. 983-998, Dec. 2003.
-
(2003)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.33
, Issue.6
, pp. 983-998
-
-
Zhang, J.-S.1
Leung, Y.-W.2
-
37
-
-
34548773378
-
Distributed data stream clustering: A fast EM-based approach
-
A. Zhou, F. Cao, Y. Yan, C. Sha, and X. He, "Distributed data stream clustering: A fast EM-based approach," in Proc. 23rd Int. Conf. Data Eng., 2007, pp. 736-745.
-
(2007)
Proc. 23rd Int. Conf. Data Eng
, pp. 736-745
-
-
Zhou, A.1
Cao, F.2
Yan, Y.3
Sha, C.4
He, X.5
-
38
-
-
0039253819
-
LOF: Identifying density based local outliers
-
M. Breunig, H.-P. Kriegel, R. Ng, and J. Sander, "LOF: Identifying density based local outliers," in Proc. ACM SIGMOD Int. Conf. Manag. Data, 2000, pp. 427-438.
-
(2000)
Proc. ACM SIGMOD Int. Conf. Manag. Data
, pp. 427-438
-
-
Breunig, M.1
Kriegel, H.-P.2
Ng, R.3
Sander, J.4
-
39
-
-
0038663185
-
Intrusion detection with unlabeled data using clustering
-
L. Portnoy, E. Eskin, and S. Stolfo, "Intrusion detection with unlabeled data using clustering," in Proc. ACM CSS Workshop DMSA, 2001, pp. 5-8.
-
(2001)
Proc. ACM CSS Workshop DMSA
, pp. 5-8
-
-
Portnoy, L.1
Eskin, E.2
Stolfo, S.3
-
40
-
-
85170282443
-
A density-based algorithm for discovering clusters in large spatial databases with noise
-
Aug
-
M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, "A density-based algorithm for discovering clusters in large spatial databases with noise," in Proc. 2th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, Aug. 1996, pp. 226-231.
-
(1996)
Proc. 2th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining
, pp. 226-231
-
-
Ester, M.1
Kriegel, H.-P.2
Sander, J.3
Xu, X.4
-
41
-
-
27844564572
-
A new shared nearest neighbor clustering algorithm and its applications
-
L. Ertoz, M. Steinbach, and V. Kumar, "A new shared nearest neighbor clustering algorithm and its applications," in Proc. Workshop Clustering High Dimensional Data Appl., 2002, pp. 105-115.
-
(2002)
Proc. Workshop Clustering High Dimensional Data Appl
, pp. 105-115
-
-
Ertoz, L.1
Steinbach, M.2
Kumar, V.3
-
42
-
-
84953806973
-
Scaling clustering algorithms to large databases
-
Aug
-
P. Bradley, U. Fayyad, and C. Reina, "Scaling clustering algorithms to large databases," in Proc. 4th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining, Aug. 1998, pp. 9-15.
-
(1998)
Proc. 4th ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining
, pp. 9-15
-
-
Bradley, P.1
Fayyad, U.2
Reina, C.3
-
44
-
-
0030157145
-
BIRCH: An efficient data clustering method for very large databases
-
Jun
-
T. Zhang, R. Ramakrishnan, and M. Livny, "BIRCH: An efficient data clustering method for very large databases," in Proc. ACM SIGMOD Int. Conf. Manag. Data, Jun. 1996, pp. 103-114.
-
(1996)
Proc. ACM SIGMOD Int. Conf. Manag. Data
, pp. 103-114
-
-
Zhang, T.1
Ramakrishnan, R.2
Livny, M.3
-
45
-
-
0032098774
-
Some new indexes of cluster validity
-
Jun
-
J. Bezdek and N. Pal, "Some new indexes of cluster validity," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 28, no. 3, pp. 427-436, Jun. 1998.
-
(1998)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.28
, Issue.3
, pp. 427-436
-
-
Bezdek, J.1
Pal, N.2
|