-
1
-
-
84957555217
-
-
http://www.cs.umass.edu/~mccallum/data/cora-refs.tar.gz.
-
-
-
-
2
-
-
84957606985
-
-
http://www.cs.utexas.edu/users/ml/riddle/data/restaurant.tar.gz.
-
-
-
-
3
-
-
84957605500
-
-
http://dbs.uni-leipzig.de/Abt-Buy.zip.
-
-
-
-
4
-
-
84957564023
-
-
https://sourceforge.net/p/acd2015/.
-
-
-
-
5
-
-
56349095491
-
Aggregating inconsistent information: Ranking and clustering
-
N. Ailon, M. Charikar, and A. Newman. Aggregating inconsistent information: ranking and clustering. Journal of the ACM (JACM), 55(5):23, 2008.
-
(2008)
Journal of the ACM (JACM)
, vol.55
, Issue.5
, pp. 23
-
-
Ailon, N.1
Charikar, M.2
Newman, A.3
-
6
-
-
84880528401
-
Crowd mining
-
ACM
-
Y. Amsterdamer, Y. Grossman, T. Milo, and P. Senellart. Crowd mining. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pages 241-252. ACM, 2013.
-
(2013)
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
, pp. 241-252
-
-
Amsterdamer, Y.1
Grossman, Y.2
Milo, T.3
Senellart, P.4
-
8
-
-
67649649597
-
Large-scale deduplication with constraints using dedupalog
-
IEEE
-
A. Arasu, C. Ré, and D. Suciu. Large-scale deduplication with constraints using dedupalog. In Data Engineering, 2009. ICDE'09. IEEE 25th International Conference on, pages 952-963. IEEE, 2009.
-
(2009)
Data Engineering, 2009. ICDE'09. IEEE 25th International Conference On
, pp. 952-963
-
-
Arasu, A.1
Ré, C.2
Suciu, D.3
-
9
-
-
0036949730
-
Correlation clustering
-
N. Bansal, A. Blum, and S. Chawla. Correlation clustering. In FOCS, pages 238-, 2002.
-
(2002)
FOCS
, pp. 238
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
-
10
-
-
0344981444
-
Clustering with qualitative information
-
IEEE
-
M. Charikar, V. Guruswami, and A. Wirth. Clustering with qualitative information. In Foundations of Computer Science, 2003. Proceedings. 44th Annual IEEE Symposium on, pages 524-533. IEEE, 2003.
-
(2003)
Foundations of Computer Science, 2003. Proceedings. 44th Annual IEEE Symposium On
, pp. 524-533
-
-
Charikar, M.1
Guruswami, V.2
Wirth, A.3
-
12
-
-
0032091575
-
Integration of heterogeneous databases without common domains using queries based on textual similarity
-
ACM
-
W. W. Cohen. Integration of heterogeneous databases without common domains using queries based on textual similarity. In ACM SIGMOD Record, volume 27, pages 201-212. ACM, 1998.
-
(1998)
ACM SIGMOD Record
, vol.27
, pp. 201-212
-
-
Cohen, W.W.1
-
15
-
-
84875617425
-
Using the crowd for top-k and group-by queries
-
ACM
-
S. B. Davidson, S. Khanna, T. Milo, and S. Roy. Using the crowd for top-k and group-by queries. In Proceedings of the 16th International Conference on Database Theory, pages 225-236. ACM, 2013.
-
(2013)
Proceedings of the 16th International Conference on Database Theory
, pp. 225-236
-
-
Davidson, S.B.1
Khanna, S.2
Milo, T.3
Roy, S.4
-
16
-
-
84860873929
-
Zencrowd: Leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking
-
G. Demartini, D. E. Difallah, and P. Cudré-Mauroux. Zencrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. In WWW, pages 469-478, 2012.
-
(2012)
WWW
, pp. 469-478
-
-
Demartini, G.1
Difallah, D.E.2
Cudré-Mauroux, P.3
-
17
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. Knowledge and Data Engineering, IEEE Transactions on, 19(1):1-16, 2007.
-
(2007)
Knowledge and Data Engineering, IEEE Transactions On
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
18
-
-
84901808813
-
-
Technical report, Technical report, National University of Singapore
-
J. Fan, M. Lu, B. C. Ooi, W.-C. Tan, and M. Zhang. A hybrid machine-crowdsourcing system for matching web tables. Technical report, Technical report, National University of Singapore, 2013.
-
(2013)
A Hybrid Machine-crowdsourcing System for Matching Web Tables
-
-
Fan, J.1
Lu, M.2
Ooi, B.C.3
Tan, W.-C.4
Zhang, M.5
-
19
-
-
84860868443
-
Crowddb: Query processing with the vldb crowd
-
A. Feng, M. Franklin, D. Kossmann, T. Kraska, S. Madden, S. Ramesh, A. Wang, and R. Xin. Crowddb: Query processing with the vldb crowd. Proceedings of the VLDB Endowment, 4(12), 2011.
-
(2011)
Proceedings of the VLDB Endowment
, vol.4
, Issue.12
-
-
Feng, A.1
Franklin, M.2
Kossmann, D.3
Kraska, T.4
Madden, S.5
Ramesh, S.6
Wang, A.7
Xin, R.8
-
20
-
-
79959958767
-
Crowddb: Answering queries with crowdsourcing
-
ACM
-
M. J. Franklin, D. Kossmann, T. Kraska, S. Ramesh, and R. Xin. Crowddb: answering queries with crowdsourcing. In Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pages 61-72. ACM, 2011.
-
(2011)
Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data
, pp. 61-72
-
-
Franklin, M.J.1
Kossmann, D.2
Kraska, T.3
Ramesh, S.4
Xin, R.5
-
22
-
-
34248168069
-
Clustering aggregation
-
A. Gionis, H. Mannila, and P. Tsaparas. Clustering aggregation. ACM Transactions on Knowledge Discovery from Data (TKDD), 1(1):4, 2007.
-
(2007)
ACM Transactions on Knowledge Discovery from Data (TKDD)
, vol.1
, Issue.1
, pp. 4
-
-
Gionis, A.1
Mannila, H.2
Tsaparas, P.3
-
23
-
-
58349089453
-
Consensus clustering algorithms: Comparison and refinement
-
San Francisco, California, USA, January 19, 2008
-
A. Goder and V. Filkov. Consensus clustering algorithms: Comparison and refinement. In Proceedings of the Tenth Workshop on Algorithm Engineering and Experiments, ALENEX 2008, San Francisco, California, USA, January 19, 2008, 2008.
-
(2008)
Proceedings of the Tenth Workshop on Algorithm Engineering and Experiments, ALENEX 2008
-
-
Goder, A.1
Filkov, V.2
-
24
-
-
84904317392
-
Corleone: Hands-off crowdsourcing for entity matching
-
C. Gokhale, S. Das, A. Doan, J. F. Naughton, N. Rampalli, J. W. Shavlik, and X. Zhu. Corleone: hands-off crowdsourcing for entity matching. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, 2014.
-
(2014)
Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
-
-
Gokhale, C.1
Das, S.2
Doan, A.3
Naughton, J.F.4
Rampalli, N.5
Shavlik, J.W.6
Zhu, X.7
-
25
-
-
85162363474
-
Crowdclustering
-
R. Gomes, P. Welinder, A. Krause, and P. Perona. Crowdclustering. In NIPS, pages 558-566, 2011.
-
(2011)
NIPS
, pp. 558-566
-
-
Gomes, R.1
Welinder, P.2
Krause, A.3
Perona, P.4
-
27
-
-
72649086387
-
Framework for evaluating clustering algorithms in duplicate detection
-
O. Hassanzadeh, F. Chiang, H. C. Lee, and R. J. Miller. Framework for evaluating clustering algorithms in duplicate detection. Proceedings of the VLDB Endowment, 2(1):1282-1293, 2009.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 1282-1293
-
-
Hassanzadeh, O.1
Chiang, F.2
Lee, H.C.3
Miller, R.J.4
-
28
-
-
84976856849
-
The merge/purge problem for large databases
-
ACM
-
M. A. Hernández and S. J. Stolfo. The merge/purge problem for large databases. In ACM SIGMOD Record, volume 24, pages 127-138. ACM, 1995.
-
(1995)
ACM SIGMOD Record
, vol.24
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
31
-
-
85084014515
-
Arnold: Declarative crowd-machine data integration
-
S. R. Jeffery, L. Sun, M. DeLand, N. Pendar, R. Barber, and A. Galdi. Arnold: Declarative crowd-machine data integration. In CIDR, 2013.
-
(2013)
CIDR
-
-
Jeffery, S.R.1
Sun, L.2
DeLand, M.3
Pendar, N.4
Barber, R.5
Galdi, A.6
-
32
-
-
0001116877
-
Binary codes capable of correcting deletions, insertions and reversals
-
V. I. Levenshtein. Binary codes capable of correcting deletions, insertions and reversals. In Soviet physics doklady, volume 10, page 707, 1966.
-
(1966)
Soviet Physics Doklady
, vol.10
, pp. 707
-
-
Levenshtein, V.I.1
-
33
-
-
84873191280
-
Cdas: A crowdsourcing data analytics system
-
X. Liu, M. Lu, B. C. Ooi, Y. Shen, S. Wu, and M. Zhang. Cdas: a crowdsourcing data analytics system. Proceedings of the VLDB Endowment, 5(10):1040-1051, 2012.
-
(2012)
Proceedings of the VLDB Endowment
, vol.5
, Issue.10
, pp. 1040-1051
-
-
Liu, X.1
Lu, M.2
Ooi, B.C.3
Shen, Y.4
Wu, S.5
Zhang, M.6
-
34
-
-
0022807929
-
A simple parallel algorithm for the maximal independent set problem
-
M. Luby. A simple parallel algorithm for the maximal independent set problem. SIAM journal on computing, 15(4):1036-1053, 1986.
-
(1986)
SIAM Journal on Computing
, vol.15
, Issue.4
, pp. 1036-1053
-
-
Luby, M.1
-
35
-
-
84860851183
-
Human-powered sorts and joins
-
A. Marcus, E. Wu, D. Karger, S. Madden, and R. Miller. Human-powered sorts and joins. Proceedings of the VLDB Endowment, 5(1):13-24, 2011.
-
(2011)
Proceedings of the VLDB Endowment
, vol.5
, Issue.1
, pp. 13-24
-
-
Marcus, A.1
Wu, E.2
Karger, D.3
Madden, S.4
Miller, R.5
-
36
-
-
33646398530
-
Conditional models of identity uncertainty with application to noun coreference
-
A. McCallum and B. Wellner. Conditional models of identity uncertainty with application to noun coreference. In NIPS, 2004.
-
(2004)
NIPS
-
-
McCallum, A.1
Wellner, B.2
-
37
-
-
84862645517
-
Crowdscreen: Algorithms for filtering data with humans
-
ACM
-
A. G. Parameswaran, H. Garcia-Molina, H. Park, N. Polyzotis, A. Ramesh, and J. Widom. Crowdscreen: Algorithms for filtering data with humans. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pages 361-372. ACM, 2012.
-
(2012)
Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
, pp. 361-372
-
-
Parameswaran, A.G.1
Garcia-Molina, H.2
Park, H.3
Polyzotis, N.4
Ramesh, A.5
Widom, J.6
-
38
-
-
84871076960
-
Deco: Declarative crowdsourcing
-
ACM
-
A. G. Parameswaran, H. Park, H. Garcia-Molina, N. Polyzotis, and J. Widom. Deco: declarative crowdsourcing. In Proceedings of the 21st ACM international conference on Information and knowledge management, pages 1203-1212. ACM, 2012.
-
(2012)
Proceedings of the 21st ACM International Conference on Information and Knowledge Management
, pp. 1203-1212
-
-
Parameswaran, A.G.1
Park, H.2
Garcia-Molina, H.3
Polyzotis, N.4
Widom, J.5
-
39
-
-
0012620707
-
Hanging on the metaphone
-
December
-
L. Philips. Hanging on the metaphone. Computer Language, 7 (12(December)), 1990.
-
(1990)
Computer Language
, vol.7
, Issue.12
-
-
Philips, L.1
-
40
-
-
0002490026
-
Data cleaning: Problems and current approaches
-
E. Rahm and H. H. Do. Data cleaning: Problems and current approaches. IEEE Data Eng. Bull., 23(4):3-13, 2000.
-
(2000)
IEEE Data Eng. Bull.
, vol.23
, Issue.4
, pp. 3-13
-
-
Rahm, E.1
Do, H.H.2
-
41
-
-
0039891959
-
A machine learning approach to coreference resolution of noun phrases
-
W. M. Soon, H. T. Ng, and D. C. Y. Lim. A machine learning approach to coreference resolution of noun phrases. Computational linguistics, 27(4):521-544, 2001.
-
(2001)
Computational Linguistics
, vol.27
, Issue.4
, pp. 521-544
-
-
Soon, W.M.1
Ng, H.T.2
Lim, D.C.Y.3
-
42
-
-
70349272838
-
Deterministic pivoting algorithms for constrained ranking and clustering problems
-
A. Van Zuylen and D. P. Williamson. Deterministic pivoting algorithms for constrained ranking and clustering problems. Mathematics of Operations Research, 34(3):594-620, 2009.
-
(2009)
Mathematics of Operations Research
, vol.34
, Issue.3
, pp. 594-620
-
-
Van Zuylen, A.1
Williamson, D.P.2
-
45
-
-
84855849323
-
Towards building a high-quality workforce with mechanical turk
-
P. Wais, S. Lingamneni, D. Cook, J. Fennell, B. Goldenberg, D. Lubarov, D. Marin, and H. Simons. Towards building a high-quality workforce with mechanical turk. Proceedings of computational social science and the wisdom of crowds (NIPS), pages 1-5, 2010.
-
(2010)
Proceedings of Computational Social Science and the Wisdom of Crowds (NIPS)
, pp. 1-5
-
-
Wais, P.1
Lingamneni, S.2
Cook, D.3
Fennell, J.4
Goldenberg, B.5
Lubarov, D.6
Marin, D.7
Simons, H.8
-
46
-
-
84872946975
-
Crowder: Crowdsourcing entity resolution
-
J. Wang, T. Kraska, M. J. Franklin, and J. Feng. Crowder: Crowdsourcing entity resolution. Proceedings of the VLDB Endowment, 5(11):1483-1494, 2012.
-
(2012)
Proceedings of the VLDB Endowment
, vol.5
, Issue.11
, pp. 1483-1494
-
-
Wang, J.1
Kraska, T.2
Franklin, M.J.3
Feng, J.4
-
47
-
-
84880551539
-
Leveraging transitive relations for crowdsourced joins
-
ACM
-
J. Wang, G. Li, T. Kraska, M. J. Franklin, and J. Feng. Leveraging transitive relations for crowdsourced joins. In Proceedings of the 2013 international conference on Management of data, pages 229-240. ACM, 2013.
-
(2013)
Proceedings of the 2013 International Conference on Management of Data
, pp. 229-240
-
-
Wang, J.1
Li, G.2
Kraska, T.3
Franklin, M.J.4
Feng, J.5
-
48
-
-
84881231558
-
Question selection for crowd entity resolution
-
S. E. Whang, P. Lofgren, and H. Garcia-Molina. Question selection for crowd entity resolution. Proceedings of the VLDB Endowment, 6(6):349-360, 2013.
-
(2013)
Proceedings of the VLDB Endowment
, vol.6
, Issue.6
, pp. 349-360
-
-
Whang, S.E.1
Lofgren, P.2
Garcia-Molina, H.3
|