-
1
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, p. G. Ipeirotis, and V. S. Verykios, "Duplicate record detection: A survey," Knowledge and Data Engineering, IEEE Transactions on, vol. 19, no. 1, pp. 1-16, 2007.
-
(2007)
Knowledge and Data Engineering, IEEE Transactions on
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
2
-
-
84920595044
-
A survey of indexing techniques for scalable record linkage and deduplication
-
p. Christen, "A survey of indexing techniques for scalable record linkage and deduplication," Knowledge and Data Engineering, IEEE Transactions on, vol. 24, no. 9, pp. 1537-1555, 2012.
-
(2012)
Knowledge and Data Engineering, IEEE Transactions on
, vol.24
, Issue.9
, pp. 1537-1555
-
-
Christen, P.1
-
3
-
-
33750728911
-
Learning blocking schemes for record linkage
-
Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 2006
-
M. Michelson and C. A. Knoblock, "Learning blocking schemes for record linkage," in Proceedings of the National Conference on Artificial Intelligence, vol. 21, no. 1. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, 2006, p. 440.
-
(1999)
Proceedings of the National Conference on Artificial Intelligence
, vol.21
, Issue.1
, pp. 440
-
-
Michelson, M.1
Knoblock, C.A.2
-
4
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
M. Bilenko, B. Kamath, and R. J. Mooney, "Adaptive blocking: Learning to scale up record linkage," in Data Mining, 2006. ICDM'06. Sixth International Conference on. IEEE, 2006, pp. 87-96.
-
(2006)
Data Mining, 2006. ICDM'06. Sixth International Conference On. IEEE
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
5
-
-
84976856849
-
The merge/purge problem for large databases
-
ACM
-
M. A. Hernández and S. J. Stolfo, "The merge/purge problem for large databases," in ACM SIGMOD Record, vol. 24, no. 2. ACM, 1995, pp. 127-138.
-
(1995)
ACM SIGMOD Record
, vol.24
, Issue.2
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
6
-
-
0036203458
-
Tailor: A record linkage toolbox
-
M. G. Elfeky, V. S. Verykios, and A. K. Elmagarmid, "Tailor: A record linkage toolbox," in Data Engineering, 2002. Proceedings. 18th International Conference on. IEEE, 2002, pp. 17-28.
-
(2002)
Data Engineering, 2002. Proceedings. 18th International Conference On. IEEE
, pp. 17-28
-
-
Elfeky, M.G.1
Verykios, V.S.2
Elmagarmid, A.K.3
-
7
-
-
0003159066
-
On the red-blue set cover problem
-
Society for Industrial and Applied Mathematics
-
R. D. Carr, S. Doddi, G. Konjevod, and M. Marathe, "On the red-blue set cover problem," in Proceedings of the eleventh annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 2000, pp. 345-353.
-
(2000)
Proceedings of the Eleventh Annual ACM-SIAM Symposium on Discrete Algorithms.
, pp. 345-353
-
-
Carr, R.D.1
Doddi, S.2
Konjevod, G.3
Marathe, M.4
-
8
-
-
84956869850
-
Approximation algorithms for the label-cover max and redblue set cover problems
-
Springer
-
D. Peleg, "Approximation algorithms for the label-cover max and redblue set cover problems," in Algorithm Theory-SWAT 2000. Springer, 2000, pp. 220-231.
-
(2000)
Algorithm Theory-SWAT 2000.
, pp. 220-231
-
-
Peleg, D.1
-
9
-
-
0000301097
-
A greedy heuristic for the set-covering problem
-
V. Chvatal, "A greedy heuristic for the set-covering problem," Mathematics of operations research, vol. 4, no. 3, pp. 233-235, 1979.
-
(1979)
Mathematics of Operations Research
, vol.4
, Issue.3
, pp. 233-235
-
-
Chvatal, V.1
-
11
-
-
0003922190
-
-
2nd ed. Wiley Chichester, ch. 3
-
p. E. Hart, R. O. Duda, and D. G. Stork, Pattern classification, 2nd ed. Wiley Chichester, 2001, ch. 3, pp. 117-121.
-
(2001)
Pattern Classification
, pp. 117-121
-
-
Hart, P.E.1
Duda, R.O.2
Stork, D.G.3
-
14
-
-
9444249661
-
On evaluation and training-set construction for duplicate detection
-
M. Bilenko and R. J. Mooney, "On evaluation and training-set construction for duplicate detection," in Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003, pp. 7-12.
-
(2003)
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 7-12
-
-
Bilenko, M.1
Mooney, R.J.2
-
15
-
-
84880832861
-
Constructing diverse classifier ensembles using artificial training examples
-
Citeseer
-
p. Melville and R. J. Mooney, "Constructing diverse classifier ensembles using artificial training examples," in International Joint Conference on Artificial Intelligence, vol. 18. Citeseer, 2003, pp. 505-512.
-
(2003)
International Joint Conference on Artificial Intelligence
, vol.18
, pp. 505-512
-
-
Melville, P.1
Mooney, R.J.2
-
16
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "Mapreduce: simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008.
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
|