-
2
-
-
33749588820
-
Clean answers over dirty databases: A probabilistic approach
-
P. Andritsos, A. Fuxman, and R. J. Miller. Clean answers over dirty databases: A probabilistic approach. In ICDE, 2006.
-
(2006)
ICDE
-
-
Andritsos, P.1
Fuxman, A.2
Miller, R.J.3
-
3
-
-
85104914015
-
Efficient exact set-similarity joins
-
A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, 2006.
-
(2006)
VLDB
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
4
-
-
0036949730
-
Correlation clustering
-
Washington, DC, USA, IEEE Computer Society
-
N. Bansal, A. Blum, and S. Chawla. Correlation clustering. In FOGS '02: Proceedings of the 43rd Symposium on Foundations of Computer Science, page 238, Washington, DC, USA, 2002. IEEE Computer Society.
-
(2002)
FOGS '02: Proceedings of the 43rd Symposium on Foundations of Computer Science
, pp. 238
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
-
5
-
-
34248229658
-
Collective entity resolution in relational data
-
I. Bhattacharya and L. Getoor. Collective entity resolution in relational data. TKDD, 1(1), 2007.
-
(2007)
TKDD
, vol.1
, Issue.1
-
-
Bhattacharya, I.1
Getoor, L.2
-
6
-
-
9444281954
-
Learnable similarity functions and their applications to clustering and record linkage
-
San Jose, California, USA, AAAI Press/The MIT Press
-
M. Bilenko. Learnable similarity functions and their applications to clustering and record linkage. In Proceedings of the Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applications of Artificial Intelligence, July 25-29, 2004, San Jose, California, USA, pages 981-982. AAAI Press/The MIT Press, 2004.
-
(2004)
Proceedings of the Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applications of Artificial Intelligence, July 25-29, 2004
, pp. 981-982
-
-
Bilenko, M.1
-
7
-
-
33746054079
-
Adaptive product normalization: Using online learning for record linkage in comparison shopping
-
M. Bilenko, S. Basu, and M. Sahami. Adaptive product normalization: Using online learning for record linkage in comparison shopping. In ICDM, 2005.
-
(2005)
ICDM
-
-
Bilenko, M.1
Basu, S.2
Sahami, M.3
-
8
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
M. Bilenko, B. Kamath, and R. J. Mooney. Adaptive blocking: Learning to scale up record linkage. In ICDM, 2006.
-
(2006)
ICDM
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
9
-
-
2342447399
-
Adaptive name-matching in information integration
-
M. Bilenko, R. Mooney, W. Cohen, P. Ravikumar, and S. Fienberg. Adaptive name-matching in information integration. IEEE Intelligent Systems, 2003.
-
(2003)
IEEE Intelligent Systems
-
-
Bilenko, M.1
Mooney, R.2
Cohen, W.3
Ravikumar, P.4
Fienberg, S.5
-
10
-
-
24644456480
-
Clustering with qualitative information
-
M. Charikar, V. Guruswami, and A. Wirth. Clustering with qualitative information. J. Comput. Syst. Sci., 71(3):360-383, 2005.
-
(2005)
J. Comput. Syst. Sci
, vol.71
, Issue.3
, pp. 360-383
-
-
Charikar, M.1
Guruswami, V.2
Wirth, A.3
-
11
-
-
85011029434
-
Example-driven design of efficient record matching queries
-
S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik. Example-driven design of efficient record matching queries. In VLDB, pages 327-338, 2007.
-
(2007)
VLDB
, pp. 327-338
-
-
Chaudhuri, S.1
Chen, B.-C.2
Ganti, V.3
Kaushik, R.4
-
13
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
S. Chaudhuri, V. Ganti, and R. Motwani. Robust identification of fuzzy duplicates. In ICDE, 2005.
-
(2005)
ICDE
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
14
-
-
33846213661
-
A divide-and-merge methodology for clustering
-
D. Cheng, R. Kannan, S. Vempala, and G. Wang. A divide-and-merge methodology for clustering. ACM Trans. Database Syst., 31(4):1499-1525, 2006.
-
(2006)
ACM Trans. Database Syst
, vol.31
, Issue.4
, pp. 1499-1525
-
-
Cheng, D.1
Kannan, R.2
Vempala, S.3
Wang, G.4
-
16
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
July
-
W. W. Cohen. Data integration using similarity joins and a word-based information representation language. ACM Transactions on Information Systems, 18(3):288-321, July 2000.
-
(2000)
ACM Transactions on Information Systems
, vol.18
, Issue.3
, pp. 288-321
-
-
Cohen, W.W.1
-
19
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
Rome, Italy
-
L. Gravano, P. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In Proc. of the 27th Int'l Conference on Very Large Databases (VLDB), Rome, Italy, 2001.
-
(2001)
Proc. of the 27th Int'l Conference on Very Large Databases (VLDB)
-
-
Gravano, L.1
Ipeirotis, P.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
22
-
-
63449083945
-
-
J. Ko, T. Mitamura, and E. Nyberg. Language-independent probabilistic answer ranking for question answering. In ACL, 2007.
-
J. Ko, T. Mitamura, and E. Nyberg. Language-independent probabilistic answer ranking for question answering. In ACL, 2007.
-
-
-
-
24
-
-
84888516789
-
-
Y. Koren and D. Harel. A multi-scale algorithm for the linear arrangement problem. In WG, 2002.
-
Y. Koren and D. Harel. A multi-scale algorithm for the linear arrangement problem. In WG, 2002.
-
-
-
-
26
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L. H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Knowledge Discovery and Data Mining, pages 169-178, 2000.
-
(2000)
Knowledge Discovery and Data Mining
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
29
-
-
84898987614
-
Identity uncertainty and citation matching
-
Vancouver, British Columbia, MIT Press
-
H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. Identity uncertainty and citation matching. In Advances in Neural Processing Systems 15, Vancouver, British Columbia, 2002. MIT Press.
-
(2002)
Advances in Neural Processing Systems 15
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
36
-
-
65449139953
-
-
M. L. Wick, K. Rohanimanesh, K. Schultz, and A. McCallum. A unified approach for schema matching, coreference and canonicalization. In KDD, 2008.
-
M. L. Wick, K. Rohanimanesh, K. Schultz, and A. McCallum. A unified approach for schema matching, coreference and canonicalization. In KDD, 2008.
-
-
-
-
37
-
-
57149130672
-
Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
-
X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, pages 353-364, 2008.
-
(2008)
SIGMOD Conference
, pp. 353-364
-
-
Yang, X.1
Wang, B.2
Li, C.3
|