-
1
-
-
10944272139
-
Friends and neighbors on the Web
-
July
-
ADAMIC, L. AND ADAR, E. 2003. Friends and neighbors on the Web. Social Networ. 25, 3 (July), 211-230.
-
(2003)
Social Networ
, vol.25
, Issue.3
, pp. 211-230
-
-
ADAMIC, L.1
ADAR, E.2
-
2
-
-
2342576574
-
Eliminating fuzzy duplicates in data warehouses
-
Hong Kong, China
-
ANANTHAKRISHNA, R., CHAUDHURI, S., AND GANTI, V. 2002. Eliminating fuzzy duplicates in data warehouses. In The International Conference on Very Large Databases (VLDB), Hong Kong, China.
-
(2002)
The International Conference on Very Large Databases (VLDB)
-
-
ANANTHAKRISHNA, R.1
CHAUDHURI, S.2
GANTI, V.3
-
3
-
-
33750452514
-
Swoosh: A generic approach to entity resolution
-
Tech. rep, Stanford University, March
-
BENJELLOUN, O., GARCIA-MOLINA, H., SU, Q., AND WIDOM, J. 2005. Swoosh: A generic approach to entity resolution. Tech. rep., Stanford University. (March)
-
(2005)
-
-
BENJELLOUN, O.1
GARCIA-MOLINA, H.2
SU, Q.3
WIDOM, J.4
-
9
-
-
2342447399
-
Adaptive name matching in information integration
-
BILENKO, M., MOONEY, R., COHEN, W., RAVIKUMAR, P., AND FIENBERG, S. 2003. Adaptive name matching in information integration. IEEE Intellig. Syst. 18, 5, 16-23.
-
(2003)
IEEE Intellig. Syst
, vol.18
, Issue.5
, pp. 16-23
-
-
BILENKO, M.1
MOONEY, R.2
COHEN, W.3
RAVIKUMAR, P.4
FIENBERG, S.5
-
10
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
San Diego, CA
-
CHAUDHURI, S., GANJAM, K., GANTI, V., AND MOTWANI, R. 2003. Robust and efficient fuzzy match for online data cleaning. In The ACM International Conference on Management of Data (SIGMOD). San Diego, CA.
-
(2003)
The ACM International Conference on Management of Data (SIGMOD)
-
-
CHAUDHURI, S.1
GANJAM, K.2
GANTI, V.3
MOTWANI, R.4
-
11
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
COHEN, W. 2000. Data integration using similarity joins and a word-based information representation language. ACM Trans. Inform. Syst. 18, 288-321.
-
(2000)
ACM Trans. Inform. Syst
, vol.18
, pp. 288-321
-
-
COHEN, W.1
-
12
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
Acapulco, Mexico
-
COHEN, W., RAVIKUMAR, P., AND FIENBERG, S. 2003. A comparison of string distance metrics for name-matching tasks. In The IJCAI Workshop on Information Integration on the Web (IIWeb). Acapulco, Mexico.
-
(2003)
The IJCAI Workshop on Information Integration on the Web (IIWeb)
-
-
COHEN, W.1
RAVIKUMAR, P.2
FIENBERG, S.3
-
14
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
Baltimore, MD
-
DONG, X., HALEVY, A., AND MADHAVAN, J. 2005. Reference reconciliation in complex information spaces. In The ACM International Conference on Management of Data (SIGMOD). Baltimore, MD.
-
(2005)
The ACM International Conference on Management of Data (SIGMOD)
-
-
DONG, X.1
HALEVY, A.2
MADHAVAN, J.3
-
16
-
-
0031622479
-
CiteSeer: An automatic citation indexing system
-
Pittsburgh, PA
-
GILES, C. L., BOLLACKER, K., AND LAWRENCE, S. 1998. CiteSeer: An automatic citation indexing system. In The ACM Conference on Digital Libraries, Pittsburgh, PA.
-
(1998)
The ACM Conference on Digital Libraries
-
-
GILES, C.L.1
BOLLACKER, K.2
LAWRENCE, S.3
-
17
-
-
0344927353
-
Text joins for data cleansing and integration in an RDBMS
-
Bangalore, India
-
GRAVANO, L., IPEIHOTIS, P., KOUDAS, N., AND SRIVASTAVA, D. 2003. Text joins for data cleansing and integration in an RDBMS. In The IEEE International Conference on Data Engineering (ICDE). Bangalore, India.
-
(2003)
The IEEE International Conference on Data Engineering (ICDE)
-
-
GRAVANO, L.1
IPEIHOTIS, P.2
KOUDAS, N.3
SRIVASTAVA, D.4
-
19
-
-
84880127702
-
Exploiting relationships for domainindependent data cleaning
-
Newport Beach, CA
-
KALASHNKOV, D., MEHROTRA, S., AND CHEN, Z. 2005. Exploiting relationships for domainindependent data cleaning. In The SIAM International Conference on Data Mining (SIAM SDM). Newport Beach, CA.
-
(2005)
The SIAM International Conference on Data Mining (SIAM SDM)
-
-
KALASHNKOV, D.1
MEHROTRA, S.2
CHEN, Z.3
-
20
-
-
17244368453
-
AI Magazine. Special Issue on Semantic Integration 26
-
1
-
LI, X., MORIE, P., AND ROTH, D. 2005. Semantic integration in text: From ambiguous names to identifiable entities. AI Magazine. Special Issue on Semantic Integration 26, 1, 45-58.
-
(2005)
, pp. 45-58
-
-
LI, X.1
MORIE, P.2
ROTH, D.3
-
22
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA
-
MCCALLUM, A., NIGAM, K., AND UNGAR, L. 2000. Efficient clustering of high-dimensional data sets with application to reference matching. In The International Conference On Knowledge Discovery and Data Mining (SIGKDD). Boston, MA.
-
(2000)
The International Conference On Knowledge Discovery and Data Mining (SIGKDD)
-
-
MCCALLUM, A.1
NIGAM, K.2
UNGAR, L.3
-
26
-
-
0345566149
-
A guided tour to approximate string matching
-
NAVARRO, G. 2001. A guided tour to approximate string matching. ACM Comp. Sur. 33, 1, 31-88.
-
(2001)
ACM Comp. Sur
, vol.33
, Issue.1
, pp. 31-88
-
-
NAVARRO, G.1
-
27
-
-
0001592068
-
Automatic linkage of vital records
-
NEWCOMBE, H., KENNEDY, J., AXFORD, S., AND JAMES, A. 1959. Automatic linkage of vital records. Science 130, 954-959.
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
NEWCOMBE, H.1
KENNEDY, J.2
AXFORD, S.3
JAMES, A.4
-
28
-
-
84898987614
-
Identity uncertainty and citation matching
-
Vancouver, Canada
-
PASULA, H., MARTHI, B., MILCH, B., RUSSELL, S., AND SHPITSEH, I. 2003. Identity uncertainty and citation matching. In The Annual Conference on Neural Information Processing Systems (NIPS). Vancouver, Canada.
-
(2003)
The Annual Conference on Neural Information Processing Systems (NIPS)
-
-
PASULA, H.1
MARTHI, B.2
MILCH, B.3
RUSSELL, S.4
SHPITSEH, I.5
-
33
-
-
0035545848
-
Learning object identification rules for information integration
-
TEJADA, S., KNOBLOCK, C., AND MINTON, S. 2001. Learning object identification rules for information integration. Inform. Syst. J. 26, 8, 635-656.
-
(2001)
Inform. Syst. J
, vol.26
, Issue.8
, pp. 635-656
-
-
TEJADA, S.1
KNOBLOCK, C.2
MINTON, S.3
-
34
-
-
34248202425
-
-
WINKLER, W. 1999. The state of record linkage and current research problems. Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC.
-
WINKLER, W. 1999. The state of record linkage and current research problems. Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC.
-
-
-
-
35
-
-
2942741943
-
Methods for record linkage and Bayesian networks
-
Tech. rep, Statistical Research Division, U.S. Census Bureau, Washington, DC
-
WINKLER, W. 2002. Methods for record linkage and Bayesian networks. Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC.
-
(2002)
-
-
WINKLER, W.1
|