-
9
-
-
84946093103
-
Parallel Entity Resolution with Dedoop
-
Kolb, L. and Rahm, E. (2013): Parallel Entity Resolution with Dedoop. Datenbank-Spektrum 13(1): 1-10.
-
(2013)
Datenbank-Spektrum
, vol.13
, Issue.1
, pp. 1-10
-
-
Kolb, L.1
Rahm, E.2
-
11
-
-
84872977079
-
Dedoop:efficient deduplication with Hadoop
-
Kolb, L., Thor, A., and Rahm, E. (2012a): Dedoop:efficient deduplication with Hadoop. Proc. VLDB Endowment 5(12):1878-1881.
-
(2012)
Proc. VLDB Endowment
, vol.5
, Issue.12
, pp. 1878-1881
-
-
Kolb, L.1
Thor, A.2
Rahm, E.3
-
13
-
-
84857059718
-
Multi-pass sorted neighborhood blocking with MapReduce
-
Kolb, L., Thor, A., and Rahm, E. (2011a): Multi-pass sorted neighborhood blocking with MapReduce. Computer Science - Research and Development 27(1):45-63.
-
(2011)
Computer Science - Research and Development
, vol.27
, Issue.1
, pp. 45-63
-
-
Kolb, L.1
Thor, A.2
Rahm, E.3
-
14
-
-
85059130497
-
Parallel sorted neighborhood blocking with MapReduce
-
Kolb, L., Thor, A., and Rahm, E. (2011b): Parallel sorted neighborhood blocking with MapReduce. Proc. Database Systems for Business, Technology, and Web:45-64.
-
(2011)
Proc. Database Systems for Business, Technology, and Web
, pp. 45-64
-
-
Kolb, L.1
Thor, A.2
Rahm, E.3
-
16
-
-
84873205384
-
Efficient processing of k nearest neighbor joins using MapReduce
-
Lu, W., Shen, Y., Chen, S. and Ooi, B. C. (2012): Efficient processing of k nearest neighbor joins using MapReduce. Proceedings VLDB Endowment 5(10): 1016-1027.
-
(2012)
Proceedings VLDB Endowment
, vol.5
, Issue.10
, pp. 1016-1027
-
-
Lu, W.1
Shen, Y.2
Chen, S.3
Ooi, B.C.4
-
17
-
-
84863758126
-
V-SMART-Join: a scalable MapReduce framework for all-pair similarity joins of multisets and vectors
-
Metwally, A. and Faloutsos, C. (2012): V-SMART-Join: a scalable MapReduce framework for all-pair similarity joins of multisets and vectors. Proc. VLDB Endowment 5(8): 704-715.
-
(2012)
Proc. VLDB Endowment
, vol.5
, Issue.8
, pp. 704-715
-
-
Metwally, A.1
Faloutsos, C.2
-
18
-
-
85156206690
-
Identity uncertainty and citation matching
-
Pasula, H., Marthi, B., Milch, B., Russell, S. and Shpitser, I. (2002): Identity uncertainty and citation matching. Proc. Neural Information Processing Systems:1401-1408.
-
(2002)
Proc. Neural Information Processing Systems
, pp. 1401-1408
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
20
-
-
77956051500
-
Efficient partial-duplicate detection based on sequence matching
-
Zhang, Q., Zhang, Y., Yu, H., and Huang, X. (2010):Efficient partial-duplicate detection based on sequence matching. Proc. ACM SIGIR Conference on Research and Development in Information Retrieval: 675-682.
-
(2010)
Proc. ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 675-682
-
-
Zhang, Q.1
Zhang, Y.2
Yu, H.3
Huang, X.4
-
21
-
-
84907005646
-
-
CiteSeerX dataset URL, Accessed:07/01/2012
-
CiteSeerX dataset URL. http://asterix.ics.uci.edu/data/csx.raw.txt.gz (Accessed:07/01/2012).
-
-
-
-
22
-
-
84907005645
-
-
Hadoop URL, Accessed: 08/24/2012
-
Hadoop URL. http://hadoop.apache.org/ (Accessed: 08/24/2012).
-
-
-
|