-
1
-
-
34250670467
-
Record linkage: Similarity measures and algorithms
-
June
-
N. Koudas, S. Sarawagi, and D. Srivastava, "Record linkage: similarity measures and algorithms," in Proc. of the 2006 ACM SIGMOD Intl. Conf. on Management of Data, June 2006, pp. 802-803.
-
(2006)
Proc. of the 2006 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
2
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios, "Duplicate record detection: A survey," IEEE Trans. on Knowledge and Data Engg., vol. 19, no. 1, pp. 1-16, 2007.
-
(2007)
IEEE Trans. on Knowledge and Data Engg
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
3
-
-
52649122574
-
-
United States Postal Service, http://www.usps.com.
-
-
-
-
4
-
-
52649107408
-
-
"Wikipedia," http://en.wikipedia.org/.
-
-
-
-
5
-
-
52649174699
-
-
DBLP
-
"DBLP," http://www.informatik.uni-trier.de/~ley/db/index.html.
-
-
-
-
6
-
-
52649098304
-
Advances in record linkage methodology as applied to matching the 1985 census of tampa
-
M. A. Jaro, "Advances in record linkage methodology as applied to matching the 1985 census of tampa," American Statistical Association, 1984.
-
(1984)
American Statistical Association
-
-
Jaro, M.A.1
-
7
-
-
0012866045
-
The state of record linkage and current research problems
-
W. E. Winkler, "The state of record linkage and current research problems." US Bureau of Census, 1999.
-
(1999)
US Bureau of Census
-
-
Winkler, W.E.1
-
8
-
-
52649142555
-
-
Trillium Software
-
Trillium Software, www.trilliumsoft.com/trilliumsoft.nsf.
-
-
-
-
9
-
-
0014757386
-
A general method applicable to the search for similarities in the amino acid sequences of two proteins
-
S. B. Needleman and C. D. Wunsch, "A general method applicable to the search for similarities in the amino acid sequences of two proteins," Journal of Molecular Biology, vol. 48, pp. 443-453, 1970.
-
(1970)
Journal of Molecular Biology
, vol.48
, pp. 443-453
-
-
Needleman, S.B.1
Wunsch, C.D.2
-
11
-
-
85009259903
-
A hidden markov model information retrieval system
-
Aug
-
D. R. H. Miller, T. Leek, and R. M. Schwartz, "A hidden markov model information retrieval system," in Proc. of the 22nd ACM SIGIR Conf. on Research and Development in Information Retrieval, Aug. 1999, pp. 214-221.
-
(1999)
Proc. of the 22nd ACM SIGIR Conf. on Research and Development in Information Retrieval
, pp. 214-221
-
-
Miller, D.R.H.1
Leek, T.2
Schwartz, R.M.3
-
13
-
-
85011029434
-
Example-driven design of efficient record matching queries
-
Sept
-
S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik, "Example-driven design of efficient record matching queries," in Proc. of the 33rd Intl. Conf. on Very Large Data Bases, Sept. 2007, pp. 23-27.
-
(2007)
Proc. of the 33rd Intl. Conf. on Very Large Data Bases
, pp. 23-27
-
-
Chaudhuri, S.1
Chen, B.-C.2
Ganti, V.3
Kaushik, R.4
-
14
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
July
-
S. Tejada, C. Knoblock, and S. Minton, "Learning domain-independent string transformation weights for high accuracy object identification," in Proc. of the 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining, July 2002, pp. 350-359.
-
(2002)
Proc. of the 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
16
-
-
2342576574
-
Eliminating fuzzy duplicates in data warehouses
-
Aug
-
R. Ananthakrishna, S. Chaudhuri, and V. Ganti, "Eliminating fuzzy duplicates in data warehouses," in Proc. of the 28th Intl. Conf. on Very Large Data Bases, Aug. 2002, pp. 586-597.
-
(2002)
Proc. of the 28th Intl. Conf. on Very Large Data Bases
, pp. 586-597
-
-
Ananthakrishna, R.1
Chaudhuri, S.2
Ganti, V.3
-
17
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
June
-
X. Dong, A. Y. Halevy, and J. Madhavan, "Reference reconciliation in complex information spaces," in Proc. of the 2005 ACM SIGMOD Intl. Conf on Management of Data, June 2005, pp. 85-96.
-
(2005)
Proc. of the 2005 ACM SIGMOD Intl. Conf on Management of Data
, pp. 85-96
-
-
Dong, X.1
Halevy, A.Y.2
Madhavan, J.3
-
18
-
-
24944535349
-
Multi-relational record linkage
-
P. Singla and P. Domingos, "Multi-relational record linkage." in MRDM, 2004.
-
(2004)
MRDM
-
-
Singla, P.1
Domingos, P.2
-
19
-
-
36348996876
-
Collective entity resolution in relational data
-
I. Bhattacharya and L. Getoor, "Collective entity resolution in relational data," IEEE Data Engineering Bulletin, vol. 29, no. 2, pp. 4-12, 2006.
-
(2006)
IEEE Data Engineering Bulletin
, vol.29
, Issue.2
, pp. 4-12
-
-
Bhattacharya, I.1
Getoor, L.2
-
21
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
Sept
-
L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, et al., "Approximate string joins in a database (almost) for free," in Proc. of the 27th Intl. Conf. on Very Large Data Bases, Sept. 2001, pp. 491-500.
-
(2001)
Proc. of the 27th Intl. Conf. on Very Large Data Bases
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
-
23
-
-
85104914015
-
Efficient exact set-similarity joins
-
Sept
-
A. Arasu, V. Ganti, and R. Kaushik, "Efficient exact set-similarity joins," in Proc. of the 32nd Intl. Conf. on Very Large Data Bases, Sept. 2006, pp. 918-929.
-
(2006)
Proc. of the 32nd Intl. Conf. on Very Large Data Bases
, pp. 918-929
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
25
-
-
3142665421
-
Correlation clustering
-
N. Bansal, A. Blum, and S. Chawla, "Correlation clustering," Mach. Learn., vol. 56, no. 1-3, pp. 89-113, 2002.
-
(2002)
Mach. Learn
, vol.56
, Issue.1-3
, pp. 89-113
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
-
28
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
June
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani, "Robust and efficient fuzzy match for online data cleaning," in Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data, June 2003, pp. 313-324.
-
(2003)
Proc. of the 2003 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
29
-
-
35448984015
-
Benchmarking declarative approximate selection predicates
-
June
-
A. Chandel, O. Hassanzadeh, N. Koudas, M. Sadoghi, and D. Srivastava, "Benchmarking declarative approximate selection predicates." in Proc. of the 2007 ACM SIGMOD Intl. Conf. on Management of Data, June 2007, pp. 353-364.
-
(2007)
Proc. of the 2007 ACM SIGMOD Intl. Conf. on Management of Data
, pp. 353-364
-
-
Chandel, A.1
Hassanzadeh, O.2
Koudas, N.3
Sadoghi, M.4
Srivastava, D.5
-
30
-
-
0001944742
-
Similarity search in high dimensions via hashing
-
Sept
-
A. Gionis, P. Indyk, and R. Motwani, "Similarity search in high dimensions via hashing," in Proc. of the 25th Intl. Conf. on Very Large Data Bases, Sept. 1999, pp. 518-529.
-
(1999)
Proc. of the 25th Intl. Conf. on Very Large Data Bases
, pp. 518-529
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
|