-
1
-
-
2342576574
-
Eliminating fuzzy duplicates in data warehouses
-
Hong Kong, China
-
Ananthakrishna, R., Chaudhuri, S., & Ganti, V. (2002). Eliminating fuzzy duplicates in data warehouses. In The International Conference on Very Large Databases (VLDB), Hong Kong, China.
-
(2002)
The International Conference on Very Large Databases (VLDB)
-
-
Ananthakrishna, R.1
Chaudhuri, S.2
Ganti, V.3
-
2
-
-
33750452514
-
Swoosh: A generic approach to entity resolution
-
Tech. rep, Stanford University
-
Benjelloun, O., Garcia-Molina, H., Su, Q., & Widom, J. (2005). Swoosh: A generic approach to entity resolution. Tech. rep., Stanford University.
-
(2005)
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Su, Q.3
Widom, J.4
-
4
-
-
80052662115
-
Relational clustering for multi-type entity resolution
-
Chicago, IL, USA
-
Bhattacharya, I., & Getoor, L. (2005). Relational clustering for multi-type entity resolution. In The ACM SIGKDD Workshop on Multi Relational Data Mining (MRDM), Chicago, IL, USA.
-
(2005)
The ACM SIGKDD Workshop on Multi Relational Data Mining (MRDM)
-
-
Bhattacharya, I.1
Getoor, L.2
-
7
-
-
33749549918
-
Query-time entity resolution
-
Philadelphia, PA, USA
-
Bhattacharya, I., Licamele, L., & Getoor, L. (2006). Query-time entity resolution. In The ACMInternational Conference on Knowledge Discovery and Data Mining (SIGKDD), Philadelphia, PA, USA.
-
(2006)
The ACMInternational Conference on Knowledge Discovery and Data Mining (SIGKDD)
-
-
Bhattacharya, I.1
Licamele, L.2
Getoor, L.3
-
9
-
-
2342447399
-
Adaptive name matching in information integration
-
Bilenko, M., Mooney, R., Cohen, W., Ravikumar, P., & Fienberg, S. (2003). Adaptive name matching in information integration.. IEEE Intelligent Systems, 18(5), 16-23.
-
(2003)
IEEE Intelligent Systems
, vol.18
, Issue.5
, pp. 16-23
-
-
Bilenko, M.1
Mooney, R.2
Cohen, W.3
Ravikumar, P.4
Fienberg, S.5
-
11
-
-
33749624541
-
Efficient batch top-k search for dictionary-based entity recognition
-
Washington, DC, USA
-
Chandel, A., Nagesh, P. C., & Sarawagi, S. (2006). Efficient batch top-k search for dictionary-based entity recognition. In The IEEE International Conference on Data Engineering (ICDE), Washington, DC, USA.
-
(2006)
The IEEE International Conference on Data Engineering (ICDE)
-
-
Chandel, A.1
Nagesh, P.C.2
Sarawagi, S.3
-
12
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
San Diego, CA, USA
-
Chaudhuri, S., Ganjam, K., Ganti, V., & Motwani, R. (2003). Robust and efficient fuzzy match for online data cleaning. In The ACM International Conference on Management of Data (SIGMOD), San Diego, CA, USA.
-
(2003)
The ACM International Conference on Management of Data (SIGMOD)
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
13
-
-
0034592802
-
Hardening soft information sources
-
Boston, MA, USA
-
Cohen, W., Kautz, H., & McAllester, D. (2000). Hardening soft information sources. In The ACMInternational Conference on Knowledge Discovery and Data Mining (SIGKDD), Boston, MA, USA.
-
(2000)
The ACMInternational Conference on Knowledge Discovery and Data Mining (SIGKDD)
-
-
Cohen, W.1
Kautz, H.2
McAllester, D.3
-
14
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
Baltimore, MD, USA
-
Dong, X., Halevy, A., & Madhavan, J. (2005). Reference reconciliation in complex information spaces. In The ACM International Conference on Management of Data (SIGMOD), Baltimore, MD, USA.
-
(2005)
The ACM International Conference on Management of Data (SIGMOD)
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
15
-
-
0006408706
-
Localized partial evaluation of belief networks
-
Seattle, WA, USA
-
Draper, D., & Hanks, S. (1994). Localized partial evaluation of belief networks. In The Annual Conference on Uncertainty in Artificial Intelligence (UAI), Seattle, WA, USA.
-
(1994)
The Annual Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Draper, D.1
Hanks, S.2
-
18
-
-
29844448776
-
Conquer: Efficient management of inconsistent databases
-
Baltimore, MD, USA
-
Fuxman, A., Fazli, E., & Miller, R. (2005). Conquer: Efficient management of inconsistent databases. In The ACM International Conference on Management of Data (SIGMOD), Baltimore, MD, USA.
-
(2005)
The ACM International Conference on Management of Data (SIGMOD)
-
-
Fuxman, A.1
Fazli, E.2
Miller, R.3
-
19
-
-
0344927353
-
Text joins for data cleansing and integration in an rdbms
-
Bangalore, India
-
Gravano, L., Ipeirotis, P., Koudas, N., & Srivastava, D. (2003). Text joins for data cleansing and integration in an rdbms. In The IEEE International Conference on Data Engineering (ICDE), Bangalore, India.
-
(2003)
The IEEE International Conference on Data Engineering (ICDE)
-
-
Gravano, L.1
Ipeirotis, P.2
Koudas, N.3
Srivastava, D.4
-
20
-
-
84976856849
-
The merge/purge problem for large databases
-
San Jose, CA, USA
-
Hernández, M., & Stolfo, S. (1995). The merge/purge problem for large databases. In The ACM International Conference on Management of Data (SIGMOD), San Jose, CA, USA.
-
(1995)
The ACM International Conference on Management of Data (SIGMOD)
-
-
Hernández, M.1
Stolfo, S.2
-
21
-
-
84880127702
-
Exploiting relationships for domain-independent data cleaning
-
Newport Beach, CA, USA
-
Kalashnikov, D., Mehrotra, S., & Chen, Z. (2005). Exploiting relationships for domain-independent data cleaning. In SIAM International Conference on Data Mining (SIAM SDM), Newport Beach, CA, USA.
-
(2005)
SIAM International Conference on Data Mining (SIAM SDM)
-
-
Kalashnikov, D.1
Mehrotra, S.2
Chen, Z.3
-
23
-
-
17244368453
-
Semantic integration in text: From ambiguous names to identifiable entities
-
Li, X., Morie, P., & Roth, D. (2005). Semantic integration in text: From ambiguous names to identifiable entities. AI Magazine. Special Issue on Semantic Integration, 26(1).
-
(2005)
AI Magazine. Special Issue on Semantic Integration
, vol.26
, Issue.1
-
-
Li, X.1
Morie, P.2
Roth, D.3
-
24
-
-
33749235905
-
Spectral clustering for multi-type relational data
-
Long, B., Zhang, Z. M., Wu, X., & Yu, P. S. (2006). Spectral clustering for multi-type relational data. In Proceedings of the 23rd International Conference on Machine Learning (ICML).
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning (ICML)
-
-
Long, B.1
Zhang, Z.M.2
Wu, X.3
Yu, P.S.4
-
25
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA, USA
-
McCallum, A., Nigam, K., & Ungar, L. (2000). Efficient clustering of high-dimensional data sets with application to reference matching. In The ACM International Conference On Knowledge Discovery and Data Mining (SIGKDD), Boston, MA, USA.
-
(2000)
The ACM International Conference On Knowledge Discovery and Data Mining (SIGKDD)
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
26
-
-
33646398530
-
Conditional models of identity uncertainty with application to noun coreference
-
Vancouver, BC, Canada
-
McCallum, A., & Wellner, B. (2004). Conditional models of identity uncertainty with application to noun coreference. In Advances In Neural Information Processing Systems (NIPS), Vancouver, BC, Canada.
-
(2004)
Advances In Neural Information Processing Systems (NIPS)
-
-
McCallum, A.1
Wellner, B.2
-
27
-
-
85018108837
-
The field matching problem: Algorithms and applications
-
Portland, OR, USA
-
Monge, A., & Elkan, C. (1996). The field matching problem: Algorithms and applications. In The ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Portland, OR, USA.
-
(1996)
The ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD)
-
-
Monge, A.1
Elkan, C.2
-
28
-
-
0004043396
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
Tuscon, AZ, USA
-
Monge, A., & Elkan, C. (1997). An efficient domain-independent algorithm for detecting approximately duplicate database records. In The SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD), Tuscon, AZ, USA.
-
(1997)
The SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery (DMKD)
-
-
Monge, A.1
Elkan, C.2
-
29
-
-
0345566149
-
A guided tour to approximate string matching
-
Navarro, G. (2001). A guided tour to approximate string matching. ACM Computing Surveys, 33(1), 31-88.
-
(2001)
ACM Computing Surveys
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
30
-
-
84898987614
-
Identity uncertainty and citation matching
-
Vancouver, BC, Canada
-
Pasula, H., Marthi, B., Milch, B., Russell, S., & Shpitser, I. (2003). Identity uncertainty and citation matching. In Advances in Neural Information Processing Systems (NIPS), Vancouver, BC, Canada.
-
(2003)
Advances in Neural Information Processing Systems (NIPS)
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
31
-
-
26844557708
-
A hierarchical graphical model for record linkage
-
Banff, Alberta, Canada
-
Ravikumar, P., & Cohen, W. (2004). A hierarchical graphical model for record linkage. In The Conference on Uncertainty in Artificial Intelligence (UAI), Banff, Alberta, Canada.
-
(2004)
The Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Ravikumar, P.1
Cohen, W.2
-
32
-
-
0242456811
-
Interactive deduplication using active learning
-
Edmonton, Alberta, Canada
-
Sarawagi, S., & Bhamidipaty, A. (2002). Interactive deduplication using active learning. In Proceedings of the Eighth ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Edmonton, Alberta, Canada.
-
(2002)
Proceedings of the Eighth ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD)
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
33
-
-
24944535349
-
Multi-relational record linkage
-
Seattle, WA, USA
-
Singla, P., & Domingos, P. (2004). Multi-relational record linkage. In The SIGKDD Workshop on Multi-Relational Data Mining (MRDM), Seattle, WA, USA.
-
(2004)
The SIGKDD Workshop on Multi-Relational Data Mining (MRDM)
-
-
Singla, P.1
Domingos, P.2
-
34
-
-
0035545848
-
Learning object identification rules for information integration
-
Tejada, S., Knoblock, C., & Minton, S. (2001). Learning object identification rules for information integration. Information Systems Journal, 26(8), 635-656.
-
(2001)
Information Systems Journal
, vol.26
, Issue.8
, pp. 635-656
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
35
-
-
2942741943
-
Methods for record linkage and Bayesian networks
-
Tech. rep, Statistical Research Division, U.S. Census Bureau, Washington, DC
-
Winkler, W. (2002). Methods for record linkage and Bayesian networks. Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC.
-
(2002)
-
-
Winkler, W.1
|