-
1
-
-
84858463554
-
-
Riddle:http://www.cs.utexas.edu/users/ml/riddle/.
-
Riddle
-
-
-
2
-
-
36349014619
-
-
Kdd workshop on data cleaning, record linkage, and object consolidation, 2003
-
Kdd workshop on data cleaning, record linkage, and object consolidation, 2003.
-
-
-
-
4
-
-
36348996326
-
-
R. Bekkerman and A. McCallum. Disambiguating web appearances of people in a social network. In WWW, 2005.
-
R. Bekkerman and A. McCallum. Disambiguating web appearances of people in a social network. In WWW, 2005.
-
-
-
-
5
-
-
33745225977
-
Iterative record linkage for cleaning and integration
-
I. Bhattacharya and L. Getoor. Iterative record linkage for cleaning and integration. In DMKD Workshop, 2004.
-
(2004)
DMKD Workshop
-
-
Bhattacharya, I.1
Getoor, L.2
-
6
-
-
80052662115
-
Relational clustering for multi-type entity resolution
-
I. Bhattacharya and L. Getoor. Relational clustering for multi-type entity resolution. In MRDM Workshop, 2005.
-
(2005)
MRDM Workshop
-
-
Bhattacharya, I.1
Getoor, L.2
-
8
-
-
33745448357
-
A latent dirichlet model for unsupervised entity resolution
-
I. Bhattacharya and L. Getoor. A latent dirichlet model for unsupervised entity resolution. In SIAM Data Mining (SDM), 2006.
-
(2006)
SIAM Data Mining (SDM)
-
-
Bhattacharya, I.1
Getoor, L.2
-
9
-
-
77952372966
-
-
M. Bilenko and R. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, 2003.
-
M. Bilenko and R. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, 2003.
-
-
-
-
10
-
-
29844447256
-
Data cleaning in Microsoft SQL Server 2005
-
S. Chaudhuri, K. Ganjam, V. Ganti, R. Kapoor, V. Narasayya, and T. Vassilakis. Data cleaning in Microsoft SQL Server 2005. In SIGMOD, 2005.
-
(2005)
SIGMOD
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Kapoor, R.4
Narasayya, V.5
Vassilakis, T.6
-
12
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
W. W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Cohen, W.W.1
Richman, J.2
-
13
-
-
34247205876
-
Learning metadata from the evidence in an on-line citation matching scheme
-
I. G. Councill, H. Li, Z. Zhuang, S. Debnath, L. Bolelli, W. C. Lee, A. Sivasubramaniam, and C. L. Giles. Learning metadata from the evidence in an on-line citation matching scheme. In JCDL, 2006.
-
(2006)
JCDL
-
-
Councill, I.G.1
Li, H.2
Zhuang, Z.3
Debnath, S.4
Bolelli, L.5
Lee, W.C.6
Sivasubramaniam, A.7
Giles, C.L.8
-
14
-
-
33745776306
-
Joint deduplication of multiple record types in relational data
-
A. Culotta and A. McCallum. Joint deduplication of multiple record types in relational data. In CIKM, 2005.
-
(2005)
CIKM
-
-
Culotta, A.1
McCallum, A.2
-
15
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
X. Dong, A. Y. Halevy, and J. Madhavan. Reference reconciliation in complex information spaces. In SIGMOD, 2005.
-
(2005)
SIGMOD
-
-
Dong, X.1
Halevy, A.Y.2
Madhavan, J.3
-
17
-
-
4944235920
-
Two supervised learning approaches for name disambiguation in author citations
-
H. Han, L. Giles, H. Zha, C. Li, and K. Tsioutsiouliklis. Two supervised learning approaches for name disambiguation in author citations. In JCDL, 2004.
-
(2004)
JCDL
-
-
Han, H.1
Giles, L.2
Zha, H.3
Li, C.4
Tsioutsiouliklis, K.5
-
18
-
-
84976856849
-
The merge/purge problem for large databases
-
M. Hernandez and S. Stolfo. The merge/purge problem for large databases. In SIGMOD, 1995.
-
(1995)
SIGMOD
-
-
Hernandez, M.1
Stolfo, S.2
-
21
-
-
33745266392
-
Domain-independent data cleaning via analysis of entity-relationship graph
-
June
-
D. V. Kalashnikov and S. Mehrotra. Domain-independent data cleaning via analysis of entity-relationship graph. ACM TODS, 31(2), June 2006.
-
(2006)
ACM TODS
, vol.31
, Issue.2
-
-
Kalashnikov, D.V.1
Mehrotra, S.2
-
23
-
-
34548733091
-
Disambiguation algorithm for people search on the web
-
D. V. Kalashnikov, S. Mehrotra, Z. Chen, R. Nuray-Turan, and N. Ashish. Disambiguation algorithm for people search on the web. In ICDE poster, 2007.
-
(2007)
ICDE poster
-
-
Kalashnikov, D.V.1
Mehrotra, S.2
Chen, Z.3
Nuray-Turan, R.4
Ashish, N.5
-
25
-
-
36348983081
-
Identification and tracing of ambiguous names: Discriminative and generative approaches
-
X. Li, P. Morie, and D. Roth. Identification and tracing of ambiguous names: Discriminative and generative approaches. In AAAI, 2004.
-
(2004)
AAAI
-
-
Li, X.1
Morie, P.2
Roth, D.3
-
27
-
-
31844455911
-
Conditional models of identity uncertainty with application to noun coreference
-
A. McCallum and B. Wellner. Conditional models of identity uncertainty with application to noun coreference. In NIPS, 2004.
-
(2004)
NIPS
-
-
McCallum, A.1
Wellner, B.2
-
28
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. K. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In SIGKDD, 2000.
-
(2000)
SIGKDD
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
29
-
-
33750347523
-
Contextual search and name disambiguation in email using graphs
-
E. Minkov, W. W. Cohen, and A. Y. Ng. Contextual search and name disambiguation in email using graphs. In ACM SIGIR, 2006.
-
(2006)
ACM SIGIR
-
-
Minkov, E.1
Cohen, W.W.2
Ng, A.Y.3
-
31
-
-
47249101877
-
Improving grouped-entity resolution using quasi-cliques
-
B.-W. On, E. Elmacioglu, D. Lee, J. Kang, and J. Pei. Improving grouped-entity resolution using quasi-cliques. In ICDM, 2006.
-
(2006)
ICDM
-
-
On, B.-W.1
Elmacioglu, E.2
Lee, D.3
Kang, J.4
Pei, J.5
-
32
-
-
27544460727
-
Comparative study of name disambiguation problem using a scalable blocking-based framework
-
B.-W. On, D. Lee, J. Kang, and P. Mitra. Comparative study of name disambiguation problem using a scalable blocking-based framework. In JCDL, 2005.
-
(2005)
JCDL
-
-
On, B.-W.1
Lee, D.2
Kang, J.3
Mitra, P.4
-
33
-
-
85156206690
-
Identity uncertainty and citation matching
-
H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. Identity uncertainty and citation matching. In NIPS, 2002.
-
(2002)
NIPS
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
34
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
36
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
S. Tejada, C. A. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In SIGKDD, 2002.
-
(2002)
SIGKDD
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
|