-
1
-
-
85012236181
-
A framework for clustering evolving data streams
-
C. C. Aggarwal, J. Han, J. Wang, and P. S. Yu. A framework for clustering evolving data streams. In VLDB, pages 81-92, 2003.
-
(2003)
VLDB
, pp. 81-92
-
-
Aggarwal, C.C.1
Han, J.2
Wang, J.3
Yu, P.S.4
-
2
-
-
58149472338
-
Swoosh: a generic approach to entity resolution
-
O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S. E. Whang, and J. Widom. Swoosh: a generic approach to entity resolution. VLDB J., 18(1):255-276, 2009.
-
(2009)
VLDB J.
, vol.18
, Issue.1
, pp. 255-276
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Su, Q.4
Whang, S.E.5
Widom, J.6
-
3
-
-
29844458352
-
Iterative record linkage for cleaning and integration
-
I. Bhattacharya and L. Getoor. Iterative record linkage for cleaning and integration. In DMKD, 2004.
-
(2004)
DMKD
-
-
Bhattacharya, I.1
Getoor, L.2
-
4
-
-
0030675156
-
Incremental clustering and dynamic information retrieval
-
M. Charikar, C. Chekuri, T. Feder, and R. Motwani. Incremental clustering and dynamic information retrieval. In STOC, pages 626-635, 1997.
-
(1997)
STOC
, pp. 626-635
-
-
Charikar, M.1
Chekuri, C.2
Feder, T.3
Motwani, R.4
-
5
-
-
26444550791
-
-
Proc. of ICDE, Tokyo, Japan
-
S. Chaudhuri, V. Ganti, and R. Motwani. Robust identification of fuzzy duplicates. In Proc. of ICDE, Tokyo, Japan, 2005.
-
(2005)
Robust identification of fuzzy duplicates
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
6
-
-
0029237323
-
Optimizing queries with materialized views
-
S. Chaudhuri, R. Krishnamurthy, S. Potamianos, and K. Shim. Optimizing queries with materialized views. In ICDE, pages 190-200, 1995.
-
(1995)
ICDE
, pp. 190-200
-
-
Chaudhuri, S.1
Krishnamurthy, R.2
Potamianos, S.3
Shim, K.4
-
7
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. IEEE Trans. Knowl. Data Eng., 19(1):1-16, 2007.
-
(2007)
IEEE Trans. Knowl. Data Eng.
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
8
-
-
0001120850
-
Minimum spanning trees and single linkage cluster analysis
-
J. C. Gower and G. J. S. Ross. Minimum spanning trees and single linkage cluster analysis. Applied Statistics, 18(1):54-64, 1969.
-
(1969)
Applied Statistics
, vol.18
, Issue.1
, pp. 54-64
-
-
Gower, J.C.1
Ross, G.J.S.2
-
9
-
-
77953181128
-
Schema and data: A holistic approach to mapping, resolution and fusion in information integration
-
L. M. Haas, M. Hentschel, D. Kossmann, and R. J. Miller. Schema and data: A holistic approach to mapping, resolution and fusion in information integration. In ER, pages 27-40, 2009.
-
(2009)
ER
, pp. 27-40
-
-
Haas, L.M.1
Hentschel, M.2
Kossmann, D.3
Miller, R.J.4
-
11
-
-
84976856849
-
The merge/purge problem for large databases
-
M. A. Hernández and S. J. Stolfo. The merge/purge problem for large databases. In Proc. of ACM SIGMOD, pages 127-138, 1995.
-
(1995)
Proc. of ACM SIGMOD
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
12
-
-
0344612511
-
A small approximately min-wise independent family of hash functions
-
P. Indyk. A small approximately min-wise independent family of hash functions. J. Algorithms, 38(1):84-90, 2001.
-
(2001)
J. Algorithms
, vol.38
, Issue.1
, pp. 84-90
-
-
Indyk, P.1
-
13
-
-
84893405732
-
Data clustering: A review
-
A. K. Jain, M. N. Murty, and P. J. Flynn. Data clustering: A review. ACM Comput. Surv., 31(3):264-323, 1999.
-
(1999)
ACM Comput. Surv.
, vol.31
, Issue.3
, pp. 264-323
-
-
Jain, A.K.1
Murty, M.N.2
Flynn, P.J.3
-
14
-
-
34548080780
-
-
Cambridge University Press, New York, NY, USA
-
C. D. Manning, P. Raghavan, and H. Schtze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008.
-
(2008)
Introduction to Information Retrieval
-
-
Manning, C.D.1
Raghavan, P.2
Schtze, H.3
-
15
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA
-
A. K. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Proc. of KDD, pages 169-178, Boston, MA, 2000.
-
(2000)
Proc. of KDD
, pp. 169-178
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
16
-
-
4944248042
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
A. E. Monge and C. Elkan. An efficient domain-independent algorithm for detecting approximately duplicate database records. In DMKD, pages 23-29, 1997.
-
(1997)
DMKD
, pp. 23-29
-
-
Monge, A.E.1
Elkan, C.2
-
17
-
-
0001139918
-
Record linkage: making maximum use of the discriminating power of identifying information
-
H. B. Newcombe and J. M. Kennedy. Record linkage: making maximum use of the discriminating power of identifying information. Commun. ACM, 5(11):563-566, 1962.
-
(1962)
Commun. ACM
, vol.5
, Issue.11
, pp. 563-566
-
-
Newcombe, H.B.1
Kennedy, J.M.2
-
18
-
-
71349088450
-
Generic entity resolution with negative rules
-
S. E. Whang, O. Benjelloun, and H. Garcia-Molina. Generic entity resolution with negative rules. VLDB J., 18(6):1261-1277, 2009.
-
(2009)
VLDB J.
, vol.18
, Issue.6
, pp. 1261-1277
-
-
Whang, S.E.1
Benjelloun, O.2
Garcia-Molina, H.3
-
20
-
-
70849098813
-
Entity resolution with iterative blocking
-
S. E. Whang, D. Menestrina, G. Koutrika, M. Theobald, and H. Garcia-Molina. Entity resolution with iterative blocking. In SIGMOD Conference, pages 219-232, 2009.
-
(2009)
SIGMOD Conference
, pp. 219-232
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
-
21
-
-
33845615644
-
-
Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC
-
W. Winkler. Overview of record linkage and current research directions. Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC, 2006.
-
(2006)
Overview of record linkage and current research directions
-
-
Winkler, W.1
|