-
1
-
-
34250670467
-
Record linkage: Similarity measures and algorithms
-
N. Koudas, S. Sarawagi, and D. Srivastava, "Record linkage: Similarity measures and algorithms," in Proc. ACM SIGMOD Int. Conf. Manage. Data, 2006, pp. 802-803.
-
Proc. ACM SIGMOD Int. Conf. Manage. Data, 2006
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
3
-
-
0032091575
-
Integration of heterogeneous databases without common domains using queries based on textual similarity
-
W. W. Cohen, "Integration of heterogeneous databases without common domains using queries based on textual similarity," ACM SIGMOD Rec., vol. 27, no. 2, pp. 201-212, 1998.
-
(1998)
ACM SIGMOD Rec.
, vol.27
, Issue.2
, pp. 201-212
-
-
Cohen, W.W.1
-
4
-
-
84880467474
-
Text joins in an RDBMS for web data integration
-
L. Gravano, P. G. Ipeirotis, N. Koudas, and D. Srivastava, "Text joins in an RDBMS for web data integration," in Proc. 12th Int. Conf. World Wide Web, 2003, pp. 90-101.
-
Proc. 12th Int. Conf. World Wide Web, 2003
, pp. 90-101
-
-
Gravano, L.1
Ipeirotis, P.G.2
Koudas, N.3
Srivastava, D.4
-
5
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida
-
M. A. Jaro, "Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida," J. Amer. Statist. Assoc., vol. 84, no. 406, pp. 414-420, 1989.
-
(1989)
J. Amer. Statist. Assoc.
, vol.84
, Issue.406
, pp. 414-420
-
-
Jaro, M.A.1
-
6
-
-
52649137537
-
Transformation-based framework for record matching
-
A. Arasu, S. Chaudhuri, and R. Kaushik, "Transformation-based framework for record matching," in Proc. 24th Int. Conf. Data Eng., 2008, pp. 40-49.
-
Proc. 24th Int. Conf. Data Eng., 2008
, pp. 40-49
-
-
Arasu, A.1
Chaudhuri, S.2
Kaushik, R.3
-
7
-
-
85011029434
-
Exampledriven design of efficient record matching queries
-
S. Chaudhuri, B. C. Chen, V. Ganti, and R. Kaushik, "Exampledriven design of efficient record matching queries," in Proc. 33rd Int. Conf. Very Large Databases, 2007, pp. 327-338.
-
Proc. 33rd Int. Conf. Very Large Databases, 2007
, pp. 327-338
-
-
Chaudhuri, S.1
Chen, B.C.2
Ganti, V.3
Kaushik, R.4
-
8
-
-
77954696920
-
Learning string transformations from examples
-
A. Arasu, S. Chaudhuri, and R. Kaushik, "Learning string transformations from examples," Proc. VLDB Endowment, vol. 2, no. 1, pp. 514-525, 2009.
-
(2009)
Proc. VLDB Endowment
, vol.2
, Issue.1
, pp. 514-525
-
-
Arasu, A.1
Chaudhuri, S.2
Kaushik, R.3
-
10
-
-
0035545848
-
Learning object identification rules for information integration
-
S. Tejada, C. Knoblock, and S. Minton, "Learning object identification rules for information integration," Inf. Syst., vol. 26, no. 8, pp. 607-633, 2001.
-
(2001)
Inf. Syst.
, vol.26
, Issue.8
, pp. 607-633
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
11
-
-
79951738363
-
On graph-based name disambiguation
-
X. Fan, J. Wang, X. Pu, L. Zhou, and B. Lv, "On graph-based name disambiguation," J. Data Inf. Quality, vol. 2, no. 2, p. 10, 2011.
-
(2011)
J. Data Inf. Quality
, vol.2
, Issue.2
, pp. 10
-
-
Fan, X.1
Wang, J.2
Pu, X.3
Zhou, L.4
Lv, B.5
-
12
-
-
67649669734
-
A latent topic model for complete entity resolution
-
L. Shu, B. Long, and W. Meng, "A latent topic model for complete entity resolution," in Proc. 25th Int. Conf. Data Eng., 2009, pp. 880-891.
-
Proc. 25th Int. Conf. Data Eng., 2009
, pp. 880-891
-
-
Shu, L.1
Long, B.2
Meng, W.3
-
13
-
-
83055169894
-
Large-scale collective entity matching
-
R. Vibhor, N. D. Nilesh, and N. G. Minos, "Large-scale collective entity matching," Proc. VLDB Endowment, vol. 4, no. 4, pp. 208-218, 2011.
-
(2011)
Proc. VLDB Endowment
, vol.4
, Issue.4
, pp. 208-218
-
-
Vibhor, R.1
Nilesh, N.D.2
Minos, N.G.3
-
14
-
-
83055176299
-
Entity resolution with evolving rules
-
S. E. Whang and H. Garcia-Molina, "Entity resolution with evolving rules," Proc. VLDB Endowment, vol. 3, no. 1, pp. 1326-1337, 2010.
-
(2010)
Proc. VLDB Endowment
, vol.3
, Issue.1
, pp. 1326-1337
-
-
Whang, S.E.1
Garcia-Molina, H.2
-
15
-
-
3142665421
-
Correlation clustering
-
N. Bansal, A. Blum, and S. Chawla, "Correlation clustering," Mach. Learn., vol. 56, no. 1-3, pp. 89-113, 2004.
-
(2004)
Mach. Learn.
, vol.56
, Issue.1-3
, pp. 89-113
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
-
16
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani, "Robust and efficient fuzzy match for online data cleaning," in Proc. ACM SIGMOD Int. Conf. Manage. Data, 2003, pp. 313-324.
-
Proc. ACM SIGMOD Int. Conf. Manage. Data, 2003
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
17
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
S. Chaudhuri, V. Ganti, and R. Motwani, "Robust identification of fuzzy duplicates," in Proc. 21st Int. Conf. Data Eng., 2005, pp. 865-876.
-
Proc. 21st Int. Conf. Data Eng., 2005
, pp. 865-876
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
19
-
-
0038208065
-
A Bayesian decision model for cost optimal record matching
-
V. S. Verykios, G. V. Moustakides, and M. G. Elfeky, "A Bayesian decision model for cost optimal record matching," VLDB J., vol. 12, no. 1, pp. 28-40, 2003.
-
(2003)
VLDB J.
, vol.12
, Issue.1
, pp. 28-40
-
-
Verykios, V.S.1
Moustakides, G.V.2
Elfeky, M.G.3
-
20
-
-
70849098813
-
Entity resolution with iterative blocking
-
S. E. Whang, D. Menestrina, G. Koutrika, M. Theobald, and H. Garcia-Molina, "Entity resolution with iterative blocking," in Proc. ACM SIGMOD Int. Conf. Manage. Data, 2009, pp. 219-232.
-
Proc. ACM SIGMOD Int. Conf. Manage. Data, 2009
, pp. 219-232
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
-
21
-
-
58149472338
-
Swoosh: A generic approach to entity resolution
-
O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S. E. Whang, and J. Widom, "Swoosh: A generic approach to entity resolution," VLDB J., vol. 18, no. 1, pp. 255-276, 2009.
-
(2009)
VLDB J.
, vol.18
, Issue.1
, pp. 255-276
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Su, Q.4
Whang, S.E.5
Widom, J.6
-
22
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
M. Bilenko, B. Kamath, and R. J. Mooney, "Adaptive blocking: Learning to scale up record linkage," in Proc. IEEE Int. Conf. Data Mining, 2006, pp. 87-96.
-
Proc. IEEE Int. Conf. Data Mining, 2006
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
23
-
-
84920595044
-
A survey of indexing techniques for scalable record linkage and deduplication
-
Sep.
-
P. Christen, "A survey of indexing techniques for scalable record linkage and deduplication," IEEE Trans. Knowl. Data Eng., vol. 24, no. 9, pp. 1537-1555, Sep. 2012.
-
(2012)
IEEE Trans. Knowl. Data Eng.
, vol.24
, Issue.9
, pp. 1537-1555
-
-
Christen, P.1
-
24
-
-
57349141410
-
Efficient similarity joins for near duplicate detection
-
C. Xiao, W. Wang, X. Lin, and J. X. Yu, "Efficient similarity joins for near duplicate detection," in Proc. 17th Int. Conf. World Wide Web, 2008, pp. 131-140.
-
Proc. 17th Int. Conf. World Wide Web, 2008
, pp. 131-140
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Yu, J.X.4
-
25
-
-
67649653766
-
Top-k set similarity joins
-
C. Xiao, W. Wang, X. Lin, and H. Shang, "Top-k set similarity joins," in Proc. IEEE Int. Conf. Data Eng., 2009, pp. 916-927.
-
Proc. IEEE Int. Conf. Data Eng., 2009
, pp. 916-927
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Shang, H.4
-
26
-
-
34548791703
-
Object distinction: Distinguishing objects with identical names
-
X. Yin, J. Han, and P. S. Yu, "Object distinction: Distinguishing objects with identical names," in Proc. IEEE 23rd Int. Conf. Data Eng., 2007, pp. 1242-1246.
-
Proc. IEEE 23rd Int. Conf. Data Eng., 2007
, pp. 1242-1246
-
-
Yin, X.1
Han, J.2
Yu, P.S.3
-
27
-
-
33845667955
-
Duplicate record detection: A survey
-
Jan.
-
A. K. Elmagarmid, G. I. Panagiotis, and S. V. Vassilios, "Duplicate record detection: A survey," IEEE Trans. Knowl. Data Eng., vol. 19, no. 1, pp. 1-16, Jan. 2007.
-
(2007)
IEEE Trans. Knowl. Data Eng.
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Panagiotis, G.I.2
Vassilios, S.V.3
-
28
-
-
0003422462
-
-
New York, NY, USA: Springer
-
V. V. Vazirani, Approximation Algorithms, New York, NY, USA: Springer, 2001, pp. 1-378.
-
(2001)
Approximation Algorithms
, pp. 1-378
-
-
Vazirani, V.V.1
-
29
-
-
83055161604
-
Context-based entity description rule for entity resolution
-
L. Li, J. Li, H. Wang, and H. Gao, "Context-based entity description rule for entity resolution," in Proc. 20th ACM Int. Conf. Inf. knowl. Manag., 2011, pp. 1725-1730.
-
Proc. 20th ACM Int. Conf. Inf. Knowl. Manag., 2011
, pp. 1725-1730
-
-
Li, L.1
Li, J.2
Wang, H.3
Gao, H.4
-
30
-
-
80455148340
-
Evaluation of entity resolution approaches on real-world match problems
-
H. Kopcke, A. Thor, and E. Rahm, "Evaluation of entity resolution approaches on real-world match problems," Proc. VLDB Endowment, vol. 3, no. 1, pp. 484-493, 2010.
-
(2010)
Proc. VLDB Endowment
, vol.3
, Issue.1
, pp. 484-493
-
-
Kopcke, H.1
Thor, A.2
Rahm, E.3
-
31
-
-
84916932541
-
Collective entity resolution in relational data
-
I. Bhattacharya and L. Getoor, "Collective entity resolution in relational data," Proc. VLDB Endowment, vol. 3, no. 1, p. 5, 2010.
-
(2010)
Proc. VLDB Endowment
, vol.3
, Issue.1
, pp. 5
-
-
Bhattacharya, I.1
Getoor, L.2
-
32
-
-
33745266392
-
Domain-independent data cleaning via analysis of entity-relationship graph
-
H. Kopcke, A. Thor, and E. Rahm, "Domain-independent data cleaning via analysis of entity-relationship graph," ACM Trans. Database Syst., vol. 31, no. 2, pp. 716-767, 2006.
-
(2006)
ACM Trans. Database Syst.
, vol.31
, Issue.2
, pp. 716-767
-
-
Kopcke, H.1
Thor, A.2
Rahm, E.3
-
33
-
-
84866951187
-
Scalable iterative graph duplicate detection
-
Nov.
-
M. Herschel, F. Naumann, S. Szott, and M. Taubert, "Scalable iterative graph duplicate detection," IEEE Trans. Knowl. Data Eng., vol. 24, no. 11, pp. 2094-2108, Nov. 2011.
-
(2011)
IEEE Trans. Knowl. Data Eng.
, vol.24
, Issue.11
, pp. 2094-2108
-
-
Herschel, M.1
Naumann, F.2
Szott, S.3
Taubert, M.4
-
34
-
-
0004116989
-
-
Cambridge, MA, USA: MIT press
-
C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms. Cambridge, MA, USA: MIT press, 2001.
-
(2001)
Introduction to Algorithms
-
-
Leiserson, C.E.1
Rivest, R.L.2
Stein, C.3
-
35
-
-
0027189241
-
Entity identification in database integration
-
E.-P. Lim, J. Srivastava, S. Prabhakar, and J. Richardson, "Entity identification in database integration," in Proc. 9th Int. Conf. Data Eng., 1993, pp. 294-301.
-
Proc. 9th Int. Conf. Data Eng., 1993
, pp. 294-301
-
-
Lim, E.-P.1
Srivastava, J.2
Prabhakar, S.3
Richardson, J.4
-
36
-
-
84865086832
-
Reasoning about record matching rules
-
F. Wenfei, J. Xibei, L. Jianzhong, and M. Shuai, "Reasoning about record matching rules," Proc. VLDB Endowment, vol. 2, no. 1, pp. 407-418, 2009.
-
(2009)
Proc. VLDB Endowment
, vol.2
, Issue.1
, pp. 407-418
-
-
Wenfei, F.1
Xibei, J.2
Jianzhong, L.3
Shuai, M.4
-
37
-
-
85013841922
-
-
San Mateo, CA, USA: Morgan Kaufmann
-
D. Loshin, Master Data Management. San Mateo, CA, USA: Morgan Kaufmann, 2009.
-
(2009)
Master Data Management
-
-
Loshin, D.1
-
38
-
-
70849095483
-
A grammar-based entity representation framework for data cleaning
-
A. Arasu and R. Kaushik, "A grammar-based entity representation framework for data cleaning," in Proc. ACM SIGMOD, Int. Conf. Manag. data, 2009, pp. 233-244.
-
Proc. ACM SIGMOD, Int. Conf. Manag. Data, 2009
, pp. 233-244
-
-
Arasu, A.1
Kaushik, R.2
-
39
-
-
0003928313
-
-
Reading, MA, USA: Addison-Wesley
-
S. Abiteboul, H. Richard, and V. Victor, Foundations of Databases, vol. 8, Reading, MA, USA: Addison-Wesley, 1995.
-
(1995)
Foundations of Databases
, vol.8
-
-
Abiteboul, S.1
Richard, H.2
Victor, V.3
-
40
-
-
85146685284
-
The microsoft academic search dataset and kdd cup 2013
-
S. B. Roy, M. D. Cock, V. Mandava, S. Savanna, B. Dalessandro, C. Perlich, W. Cukierski, and B. Hamner, "The microsoft academic search dataset and kdd cup 2013," in Proc. KDD Cup 2013 Workshop, 2013, p. 1.
-
Proc. KDD Cup 2013 Workshop, 2013
, pp. 1
-
-
Roy, S.B.1
Cock, M.D.2
Mandava, V.3
Savanna, S.4
Dalessandro, B.5
Perlich, C.6
Cukierski, W.7
Hamner, B.8
-
41
-
-
85146715632
-
Combination of feature engineering and ranking models for paper-author identification in KDD Cup 2013
-
C. Li, Y. Su, T. Lin, C. Tsai, W. Chang, K. Huang, T. Kuo, S. Lin, Y. Lin, Y. Lu, C. Yang, C. Chang, W. Chin, Y. Juan, H. Tung, J. Wang, C. Wei, F. Wu, T. Yin, T. Yu, Y. Zhuang, S. Lin, H. Lin, and C. Lin, "Combination of feature engineering and ranking models for paper-author identification in KDD Cup 2013," in Proc. KDD Cup 2013 Workshop, 2013, p. 2.
-
Proc. KDD Cup 2013 Workshop, 2013
, pp. 2
-
-
Li, C.1
Su, Y.2
Lin, T.3
Tsai, C.4
Chang, W.5
Huang, K.6
Kuo, T.7
Lin, S.8
Lin, Y.9
Lu, Y.10
Yang, C.11
Chang, C.12
Chin, W.13
Juan, Y.14
Tung, H.15
Wang, J.16
Wei, C.17
Wu, F.18
Yin, T.19
Yu, T.20
Zhuang, Y.21
Lin, S.22
Lin, H.23
Lin, C.24
more..
-
42
-
-
0001172265
-
Learning logical definitions from relations
-
J. R. Quinlan, "Learning logical definitions from relations," Mach. Learn., vol. 5, no. 3, pp. 239-266, 1990.
-
(1990)
Mach. Learn.
, vol.5
, Issue.3
, pp. 239-266
-
-
Quinlan, J.R.1
|