-
4
-
-
4644233828
-
The Star Clustering Algorithm For Static And Dynamic Information Organization
-
J. A. Aslam, E. Pelekhov, and D. Rus. The Star Clustering Algorithm For Static And Dynamic Information Organization. Journal of Graph Algorithms and Applications, 8(1):95-129, 2004.
-
(2004)
Journal of Graph Algorithms and Applications
, vol.8
, Issue.1
, pp. 95-129
-
-
Aslam, J.A.1
Pelekhov, E.2
Rus, D.3
-
5
-
-
3142665421
-
Correlation Clustering
-
N. Bansal, A. Blum, and S. Chawla. Correlation Clustering. Machine Learning, 56(1-3):89-113, 2004.
-
(2004)
Machine Learning
, vol.56
, Issue.1-3
, pp. 89-113
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
-
6
-
-
84922874074
-
Seeking Stable Clusters In The Blogosphere
-
Vienna, Austria
-
N. Bansal, F. Chiang, N. Koudas, and F. W. Tompa. Seeking Stable Clusters In The Blogosphere. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), pages 806-817, Vienna, Austria, 2007.
-
(2007)
Proc. of the Int'l Conf. on Very Large Data Bases (VLDB)
, pp. 806-817
-
-
Bansal, N.1
Chiang, F.2
Koudas, N.3
Tompa, F.W.4
-
7
-
-
35348849154
-
Scaling Up All Pairs Similarity Search
-
Banff, Canada
-
R. J. Bayardo, Y. Ma, and R. Srikant. Scaling Up All Pairs Similarity Search. In Int'l World Wide Web Conference (WWW), pages 131-140, Banff, Canada, 2007.
-
(2007)
Int'l World Wide Web Conference (WWW)
, pp. 131-140
-
-
Bayardo, R.J.1
Ma, Y.2
Srikant, R.3
-
8
-
-
33745448357
-
A Latent Dirichlet Model for Unsupervised Entity Resolution
-
Bethesda, MD, USA
-
I. Bhattacharya and L. Getoor. A Latent Dirichlet Model for Unsupervised Entity Resolution. In Proc. of the SIAM International Conference on Data Mining (SDM), pages 47-58, Bethesda, MD, USA, 2006.
-
(2006)
Proc. of the SIAM International Conference on Data Mining (SDM)
, pp. 47-58
-
-
Bhattacharya, I.1
Getoor, L.2
-
11
-
-
33751255087
-
Evaluation of Clustering Algorithms for Protein-Protein Interaction Networks
-
S. Brohee and J. van Helden. Evaluation of Clustering Algorithms for Protein-Protein Interaction Networks. BMC Bioinformatics, 7:488+, 2006.
-
(2006)
BMC Bioinformatics
, vol.7
-
-
Brohee, S.1
van Helden, J.2
-
12
-
-
24644456480
-
Clustering with Qualitative Information
-
M. Charikar, V. Guruswami, and A. Wirth. Clustering with Qualitative Information. J. Comput. Syst. Sci., 71(3):360-383, 2005.
-
(2005)
J. Comput. Syst. Sci.
, vol.71
, Issue.3
, pp. 360-383
-
-
Charikar, M.1
Guruswami, V.2
Wirth, A.3
-
13
-
-
26444550791
-
Robust Identification of Fuzzy Duplicates
-
Washington, DC, USA
-
S. Chaudhuri, V. Ganti, and R. Motwani. Robust Identification of Fuzzy Duplicates. In IEEE Proc. of the Int'l Conf. on Data Eng., pages 865-876, Washington, DC, USA, 2005.
-
(2005)
IEEE Proc. of the Int'l Conf. on Data Eng.
, pp. 865-876
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
14
-
-
4243136572
-
On a Recursive Spectral Algorithm for Clustering from Pairwise Similarities
-
Technical Report MIT-LCS-TR-906, MIT LCS
-
D. Cheng, R. Kannan, S. Vempala, and G. Wang. On a Recursive Spectral Algorithm for Clustering from Pairwise Similarities. Technical Report MIT-LCS-TR-906, MIT LCS, 2003.
-
(2003)
-
-
Cheng, D.1
Kannan, R.2
Vempala, S.3
Wang, G.4
-
15
-
-
35448996113
-
Finding Near Neighbors Through Cluster Pruning.
-
Beijing, China
-
F. Chierichetti, A. Panconesi, P. Raghavan, M. Sozio, A. Tiberi, and E. Upfal. Finding Near Neighbors Through Cluster Pruning. In Proc. of the ACM Symp. on Principles of Database Systems (PODS), pages 103-112, Beijing, China, 2007.
-
(2007)
Proc. of the ACM Symp. on Principles of Database Systems (PODS)
, pp. 103-112
-
-
Chierichetti, F.1
Panconesi, A.2
Raghavan, P.3
Sozio, M.4
Tiberi, A.5
Upfal, E.6
-
16
-
-
11144240583
-
A Comparison of String Distance Metrics for Name-Matching Tasks
-
Acapulco, Mexico
-
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A Comparison of String Distance Metrics for Name-Matching Tasks. In Proc. of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), pages 73-78, Acapulco, Mexico, 2003.
-
(2003)
Proc. of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03)
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
18
-
-
0002546287
-
Efficient Algorithms for Agglomerative Hierarchical Clustering Methods
-
W. H. Day and H. Edelsbrunner. Efficient Algorithms for Agglomerative Hierarchical Clustering Methods. Journal of Classification, 1(1):7-24, 1984.
-
(1984)
Journal of Classification
, vol.1
, Issue.1
, pp. 7-24
-
-
Day, W.H.1
Edelsbrunner, H.2
-
19
-
-
33746868385
-
Immorlica. Correlation Clustering In General Weighted Graphs
-
E. D. Demaine, D. Emanuel, A. Fiat, and N. Immorlica. Correlation Clustering In General Weighted Graphs. Theor. Comput. Sci., 361(2):172-187, 2006.
-
(2006)
Theor. Comput. Sci.
, vol.361
, Issue.2
, pp. 172-187
-
-
Demaine, E.D.1
Emanuel, D.2
Fiat, A.3
Immorlica, N.4
-
20
-
-
0000891810
-
Algorithm for Solution of a Problem Of Maximum Flow in Networks with Power Estimation
-
E. A. Dinic. Algorithm for Solution of a Problem Of Maximum Flow in Networks with Power Estimation. Soviet Math. Dokl, 11:1277-1280, 1970.
-
(1970)
Soviet Math. Dokl
, vol.11
, pp. 1277-1280
-
-
Dinic, E.A.1
-
21
-
-
0015330635
-
Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems
-
J. Edmonds and R. M. Karp. Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems. J. ACM, 19(2):248-264, 1972.
-
(1972)
J. ACM
, vol.19
, Issue.2
, pp. 248-264
-
-
Edmonds, J.1
Karp, R.M.2
-
23
-
-
34548025132
-
A Survey of Kernel and Spectral Methods for Clustering
-
M. Filippone, F. Camastra, F. Masulli, and S. Rovetta. A Survey of Kernel and Spectral Methods for Clustering. Pattern Recognition, 41(1):176-190, 2008.
-
(2008)
Pattern Recognition
, vol.41
, Issue.1
, pp. 176-190
-
-
Filippone, M.1
Camastra, F.2
Masulli, F.3
Rovetta, S.4
-
24
-
-
84906283185
-
Graph Clustering and Minimum Cut Trees
-
G. W. Flake, R. E. Tarjan, and K. Tsioutsiouliklis. Graph Clustering and Minimum Cut Trees. Internet Mathematics, 1(4):385-408, 2004.
-
(2004)
Internet Mathematics
, vol.1
, Issue.4
, pp. 385-408
-
-
Flake, G.W.1
Tarjan, R.E.2
Tsioutsiouliklis, K.3
-
25
-
-
0001261128
-
Maximal Flow Through a Network
-
L. Ford and D. Fulkerson. Maximal Flow Through a Network. Canadian J. Math, 8:399-404, 1956.
-
(1956)
Canadian J. Math
, vol.8
, pp. 399-404
-
-
Ford, L.1
Fulkerson, D.2
-
27
-
-
0024090156
-
A New Approach to the Maximum-Flow Problem
-
A. V. Goldberg and R. E. Tarjan. A New Approach to the Maximum-Flow Problem. Journal of the ACM, 35(4):921-940, 1988.
-
(1988)
Journal of the ACM
, vol.35
, Issue.4
, pp. 921-940
-
-
Goldberg, A.V.1
Tarjan, R.E.2
-
28
-
-
0035676057
-
On clustering validation techniques
-
M. Halkidi, Y. Batistakis, and M. Vazirgiannis. On clustering validation techniques. journal, 17(2-3):107-145, 2001.
-
(2001)
journal
, vol.17
, Issue.2-3
, pp. 107-145
-
-
Halkidi, M.1
Batistakis, Y.2
Vazirgiannis, M.3
-
29
-
-
70349814264
-
Benchmarking Declarative Approximate Selection Predicates
-
Master's thesis, University of Toronto
-
O. Hassanzadeh. Benchmarking Declarative Approximate Selection Predicates. Master's thesis, University of Toronto, February 2007.
-
(2007)
-
-
Hassanzadeh, O.1
-
30
-
-
84865063889
-
Creating Probabilistic Databases from Duplicated Data
-
Technical Report CSRG-568, University of Toronto, To appear in The VLDB Journal, Accepted on 26
-
O. Hassanzadeh and R. J. Miller. Creating Probabilistic Databases from Duplicated Data. Technical Report CSRG-568, University of Toronto, To appear in The VLDB Journal, Accepted on 26 June 2009.
-
(2009)
-
-
Hassanzadeh, O.1
Miller, R.J.2
-
31
-
-
63149183776
-
Accuracy of Approximate String Joins Using Grams
-
Vienna, Austria
-
O. Hassanzadeh, M. Sadoghi, and R. J. Miller. Accuracy of Approximate String Joins Using Grams. In Proc. of the International Workshop on Quality in Databases (QDB), pages 11-18, Vienna, Austria, 2007.
-
(2007)
Proc. of the International Workshop on Quality in Databases (QDB)
, pp. 11-18
-
-
Hassanzadeh, O.1
Sadoghi, M.2
Miller, R.J.3
-
32
-
-
0003067623
-
Scalable Techniques for Clustering the Web
-
Dallas, Texas, USA
-
T. H. Haveliwala, A. Gionis, and P. Indyk. Scalable Techniques for Clustering the Web. In Proc. of the Int'l Workshop on the Web and Databases (WebDB), pages 129-134, Dallas, Texas, USA, 2000.
-
(2000)
Proc. of the Int'l Workshop on the Web and Databases (WebDB)
, pp. 129-134
-
-
Haveliwala, T.H.1
Gionis, A.2
Indyk, P.3
-
33
-
-
0013331361
-
Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem
-
M. A. Hernández and S. J. Stolfo. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem. Data Mining and Knowledge Discovery, 2(1):9-37, 1998.
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
34
-
-
0032729435
-
Exploring Expression Data: Identification and Analysis of Coexpressed Genes
-
L. J. Heyer, S. Kruglyak, and S. Yooseph. Exploring Expression Data: Identification and Analysis of Coexpressed Genes. Genome Res., 9(11):1106-1115, 1999.
-
(1999)
Genome Res.
, vol.9
, Issue.11
, pp. 1106-1115
-
-
Heyer, L.J.1
Kruglyak, S.2
Yooseph, S.3
-
36
-
-
84893405732
-
Data Clustering: A Review
-
A. K. Jain, M. N. Murty, and P. J. Flynn. Data Clustering: A Review. ACM Computing Surveys, 31(3):264-323, 1999.
-
(1999)
ACM Computing Surveys
, vol.31
, Issue.3
, pp. 264-323
-
-
Jain, A.K.1
Murty, M.N.2
Flynn, P.J.3
-
37
-
-
4243128193
-
On clusterings: Good, bad and spectral
-
R. Kannan, S. Vempala, and A. Vetta. On clusterings: Good, bad and spectral. Journal of the ACM, 51(3):497-515, 2004.
-
(2004)
Journal of the ACM
, vol.51
, Issue.3
, pp. 497-515
-
-
Kannan, R.1
Vempala, S.2
Vetta, A.3
-
38
-
-
33750410010
-
A Framework for Protein Structure Classification and Identification of Novel Protein Structures
-
Y. J. Kim and J. M. Patel. A Framework for Protein Structure Classification and Identification of Novel Protein Structures. BMC Bioinformatics, 7:456+, 2006.
-
(2006)
BMC Bioinformatics
, vol.7
-
-
Kim, Y.J.1
Patel, J.M.2
-
39
-
-
10244276179
-
Graph Clustering with Restricted Neighbourhood Search
-
Master's thesis, University of Toronto
-
A. D. King. Graph Clustering with Restricted Neighbourhood Search. Master's thesis, University of Toronto, 2004.
-
(2004)
-
-
King, A.D.1
-
41
-
-
85011032600
-
VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams
-
Vienna, Austria
-
C. Li, B. Wang, and X. Yang. VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams. In Proc. of the Int'l Conf. on Very Large Data Bases (VLDB), pages 303-314, Vienna, Austria, 2007.
-
(2007)
Proc. of the Int'l Conf. on Very Large Data Bases (VLDB)
, pp. 303-314
-
-
Li, C.1
Wang, B.2
Yang, X.3
-
42
-
-
0001820920
-
X-Means: Extending K-Means with Efficient Estimation of the Number of Clusters
-
San Francisco, CA, USA
-
A. M. D. Pelleg. X-Means: Extending K-Means with Efficient Estimation of the Number of Clusters. In Proc. of the Int'l Conf. on Machine Learning, pages 727-734, San Francisco, CA, USA, 2000.
-
(2000)
Proc. of the Int'l Conf. on Machine Learning
, pp. 727-734
-
-
Pelleg, A.M.D.1
-
44
-
-
9444222292
-
The Information Bottleneck: Theory And Applications
-
PhD thesis, The Hebrew University
-
N. Slonim. The Information Bottleneck: Theory And Applications. PhD thesis, The Hebrew University, 2003.
-
(2003)
-
-
Slonim, N.1
-
45
-
-
1842435182
-
Correlation Clustering: Maximizing Agreements Via Semidefinite Programming
-
New Orleans, Louisiana, USA
-
C. Swamy. Correlation Clustering: Maximizing Agreements Via Semidefinite Programming. In Proc. of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 526-527, New Orleans, Louisiana, USA, 2004.
-
(2004)
Proc. of the Annual ACM-SIAM Symposium on Discrete Algorithms (SODA)
, pp. 526-527
-
-
Swamy, C.1
-
47
-
-
0005924596
-
Graph Clustering By Flow Simulation
-
PhD thesis, University of Utrecht
-
S. van Dongen. Graph Clustering By Flow Simulation. PhD thesis, University of Utrecht, 2000.
-
(2000)
-
-
van Dongen, S.1
-
48
-
-
84865100525
-
Graph Clustering With Overlap
-
Master's thesis, University of Toronto
-
J. A. Whitney. Graph Clustering With Overlap. Master's thesis, University of Toronto, 2006.
-
(2006)
-
-
Whitney, J.A.1
-
51
-
-
0004045546
-
Clustering of Large Data Sets
-
Research Studies Press
-
J. Zupan. Clustering of Large Data Sets. Research Studies Press, 1982.
-
(1982)
-
-
Zupan, J.1
|