-
2
-
-
29844458352
-
Iterative record linkage for cleaning and integration
-
I. Bhattacharya and L. Getoor. Iterative record linkage for cleaning and integration. In DMKD, 2004.
-
(2004)
DMKD
-
-
Bhattacharya, I.1
Getoor, L.2
-
3
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
M. Bilenko and R. Mooney. Adaptive duplicate detection using learnable string similarity measures. In SIGKDD, 2003.
-
(2003)
SIGKDD
-
-
Bilenko, M.1
Mooney, R.2
-
4
-
-
2342447399
-
Adaptive name matching in information integration
-
September
-
M. Bilenko, R. Mooney, W. Cohen, P. Ravikumar, and S. Fienberg. Adaptive name matching in information integration. IEEE Intelligent Systems Special Issue on Information Integration on the Web, September 2003.
-
(2003)
IEEE Intelligent Systems Special Issue on Information Integration on the Web
-
-
Bilenko, M.1
Mooney, R.2
Cohen, W.3
Ravikumar, P.4
Fienberg, S.5
-
10
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWEB, pages 73-78, 2003.
-
(2003)
IIWEB
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
11
-
-
84858526851
-
-
http://www.cs.umass.edu/~mccallum/data/cora-refs.tar.gz.
-
-
-
-
12
-
-
18744368587
-
Object matching for information integration: A profiler-based approach
-
A. Doan, Y. Lu, Y. Lee, and J. Han. Object matching for information integration: a profiler-based approach. In II Web, 2003.
-
(2003)
II Web
-
-
Doan, A.1
Lu, Y.2
Lee, Y.3
Han, J.4
-
13
-
-
33745613666
-
A platform for personal information management and integration
-
X. Dong and A. Halevy. A Platform for Personal Information Management and Integration. In Proc. of CIDR, 2005.
-
(2005)
Proc. of CIDR
-
-
Dong, X.1
Halevy, A.2
-
14
-
-
29844452045
-
Reference reconciliation in complex information spaces
-
Univ. of Washington
-
X. Dong, A. Halevy, and J. Madhavan. Reference Reconciliation in Complex Information Spaces. Technical Report 2005-03-04, Univ. of Washington, 2005.
-
(2005)
Technical Report 2005-03-04
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
15
-
-
65449161206
-
Semex: Toward on-the-fly personal information integration
-
X. Dong, A. Halevy, E. Nemes, S. Sigurdsson, and P. Domingos. Semex: Toward on-the-fly personal information integration. In II Web, 2004.
-
(2004)
II Web
-
-
Dong, X.1
Halevy, A.2
Nemes, E.3
Sigurdsson, S.4
Domingos, P.5
-
16
-
-
1542287504
-
Stuff i've seen: A system for personal information retrieval and re-use
-
S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, and D. C. Robbins. Stuff i've seen: A system for personal information retrieval and re-use. In SIGIR, 2003.
-
(2003)
SIGIR
-
-
Dumais, S.1
Cutrell, E.2
Cadiz, J.3
Jancke, G.4
Sarin, R.5
Robbins, D.C.6
-
18
-
-
0344756845
-
Declarative data cleaning: Language, model, and algorithms
-
H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C.-A. Saita. Declarative data cleaning: language, model, and algorithms. In VLDB, pages 371-380, 2001.
-
(2001)
VLDB
, pp. 371-380
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
Saita, C.-A.5
-
19
-
-
84858521243
-
-
Google. http://desktop.google.com/, 2004.
-
(2004)
-
-
-
21
-
-
84976856849
-
The merge/purge problem for large databases
-
M. A. Hernandez and S. J. Stolfo. The merge/purge problem for large databases. In SIGMOD, 1995.
-
(1995)
SIGMOD
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
22
-
-
84943425383
-
Efficient record linkage in large data sets
-
L. Jin, C. Li, and S. Mehrotra. Efficient Record Linkage in Large Data Sets. In DASFAA, 2003.
-
(2003)
DASFAA
-
-
Jin, L.1
Li, C.2
Mehrotra, S.3
-
24
-
-
0034592786
-
Intelliclean: A knowledge-based intelligent data cleaner
-
M. L. Lee, T. W. Ling, and W. L. Low. Intelliclean: a knowledge-based intelligent data cleaner. In SIGKDD, pages 290-294, 2000.
-
(2000)
SIGKDD
, pp. 290-294
-
-
Lee, M.L.1
Ling, T.W.2
Low, W.L.3
-
26
-
-
33646398530
-
Toward conditional models of identity uncertainty with application to proper noun coreference
-
A. McCallum and B. Wellner. Toward conditional models of identity uncertainty with application to proper noun coreference. In IIWEB, 2003.
-
(2003)
IIWEB
-
-
McCallum, A.1
Wellner, B.2
-
27
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. K. McCallum, K. Nigam, and L. H. Ungar. Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching. In SIGKDD, 2000.
-
(2000)
SIGKDD
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.H.3
-
28
-
-
33645967226
-
Exploiting secondary sources for unsupervised record linkage
-
M. Michalowski, S. Thakkar, and C. A. Knoblock. Exploiting secondary sources for unsupervised record linkage. In IIWeb, 2004.
-
(2004)
IIWeb
-
-
Michalowski, M.1
Thakkar, S.2
Knoblock, C.A.3
-
29
-
-
0001592068
-
Automatic linkage of vital records
-
1959
-
H. Newcombe, J. Kennedy, S. Axford, and A. James. Automatic linkage of vital records. In Science 130 (1959), no. 3381, pages 954-959, 1959.
-
(1959)
Science
, vol.130
, Issue.3381
, pp. 954-959
-
-
Newcombe, H.1
Kennedy, J.2
Axford, S.3
James, A.4
-
30
-
-
24944535349
-
Multi-relational record linkage
-
Parag and P. Domingos. Multi-relational record linkage. In MRDM, 2004.
-
(2004)
MRDM
-
-
Parag1
Domingos, P.2
-
31
-
-
85156206690
-
Identity uncertainty and citation matching
-
H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. Identity uncertainty and citation matching. In NIPS, 2002.
-
(2002)
NIPS
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
32
-
-
85166310944
-
Methods for linking and mining massive heterogeneous databases
-
J. C. Pinheiro and D. X. Sun. Methods for linking and mining massive heterogeneous databases. In SIGKDD, 1998.
-
(1998)
SIGKDD
-
-
Pinheiro, J.C.1
Sun, D.X.2
-
33
-
-
2942740136
-
Haystack: A platform for authoring end user semantic web applications
-
D. Quan, D. Huynh, and D. R. Karger. Haystack: A platform for authoring end user semantic web applications. In ISWC, 2003.
-
(2003)
ISWC
-
-
Quan, D.1
Huynh, D.2
Karger, D.R.3
-
34
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
35
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
S. Tejada, C. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
36
-
-
0002940254
-
Using the em algorithm for weight computation in the fellegi-sunter model of record linkage
-
W. E. Winkler. Using the em algorithm for weight computation in the fellegi-sunter model of record linkage. In Section on Survey Research Methods, 1988.
-
(1988)
Section on Survey Research Methods
-
-
Winkler, W.E.1
-
37
-
-
0012866045
-
The state of record linkage and current research problems
-
U.S. Bureau of the Census, Wachington, DC
-
W. E. Winkler. The state of record linkage and current research problems. Technical report, U.S. Bureau of the Census, Wachington, DC, 1999.
-
(1999)
Technical Report
-
-
Winkler, W.E.1
|