-
3
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
M. Bilenko and R. Mooney. Adaptive duplicate detection using learnable string similarity measures. In Proc. KDD-03, pages 39-48, 2003.
-
(2003)
Proc. KDD-03
, pp. 39-48
-
-
Bilenko, M.1
Mooney, R.2
-
6
-
-
33646696837
-
A comparison of string metrics for matching names and records
-
W. Cohen, P. Ravikumar, and S. Fienberg. A comparison of string metrics for matching names and records. In Proc. KDD-03 Wkshp. on Data Cleaning, Record Linkage, and Object Consolidation, pages 13-18, 2003.
-
(2003)
Proc. KDD-03 Wkshp. on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 13-18
-
-
Cohen, W.1
Ravikumar, P.2
Fienberg, S.3
-
7
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In Proc. KDD-02, pages 475-480, 2002.
-
(2002)
Proc. KDD-02
, pp. 475-480
-
-
Cohen, W.1
Richman, J.2
-
8
-
-
85127836544
-
Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms
-
Philadelphia, PA
-
M. Collins. Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proc. EMNLP-02, Philadelphia, PA, 2002.
-
(2002)
Proc. EMNLP-02
-
-
Collins, M.1
-
9
-
-
33745776306
-
Joint deduplication of multiple record types in relational data
-
A. Culotta and A. McCallum. Joint deduplication of multiple record types in relational data. In Proc. CIKM-05, pages 257-258, 2005.
-
(2005)
Proc. CIKM-05
, pp. 257-258
-
-
Culotta, A.1
McCallum, A.2
-
10
-
-
84878049792
-
-
J. Davis, I. Dutra, D. Page, and V. Costa. Establishing identity equivalence in multi-relational domains. In Proc. ICIA-05, 2005.
-
J. Davis, I. Dutra, D. Page, , and V. Costa. Establishing identity equivalence in multi-relational domains. In Proc. ICIA-05, 2005.
-
-
-
-
12
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
X. Dong, A. Halevy, and J. Madhavan. Reference reconciliation in complex information spaces. In Proc. SIGMOD-05, pages 85-96, 2005.
-
(2005)
Proc. SIGMOD-05
, pp. 85-96
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
15
-
-
34748863355
-
-
and, editors, IMLS, Pittsburgh, PA, USA
-
A. Fern, L. Getoor, and B. Milch, editors. Proceedings of the ICML-2006 Workshop on Open Problems in Statistical Relational Learning. IMLS, Pittsburgh, PA, USA, 2006.
-
(2006)
Proceedings of the ICML-2006 Workshop on Open Problems in Statistical Relational Learning
-
-
-
16
-
-
84878052668
-
-
Online bibliography, SIGKDD, 2003
-
J. Gehrke, P. Ginsparg, and J. Kleinberg. KDD cup 2003. Online bibliography, SIGKDD, 2003. http://-www.cs.cornell.edu/projects/kddcup.
-
(2003)
KDD cup
-
-
Gehrke, J.1
Ginsparg, P.2
Kleinberg, J.3
-
18
-
-
0003860037
-
-
W. R. Gilks, S. Richardson, and D. J. Spiegelhalter, editors, Chapman and Hall, London, UK
-
W. R. Gilks, S. Richardson, and D. J. Spiegelhalter, editors. Markov Chain Monte Carlo in Practice. Chapman and Hall, London, UK, 1996.
-
(1996)
Markov Chain Monte Carlo in Practice
-
-
-
19
-
-
0012249331
-
Using q-grams in a DBMS for approximate string processing
-
L. Gravano, P. Ipeirotis, H. Jagadish, N. Koudas, S. Muthukrishnan, L. Pietarinen, and D. Srivastava. Using q-grams in a DBMS for approximate string processing. IEEE Data Engineering Bulletin, 24(4): 28-34, 2001.
-
(2001)
IEEE Data Engineering Bulletin
, vol.24
, Issue.4
, pp. 28-34
-
-
Gravano, L.1
Ipeirotis, P.2
Jagadish, H.3
Koudas, N.4
Muthukrishnan, S.5
Pietarinen, L.6
Srivastava, D.7
-
20
-
-
84976856849
-
The merge/purge problem for large databases
-
M. Hernandez and S. Stolfo. The merge/purge problem for large databases. In Proc. SIGMOD-95, pages 127-138, 1995.
-
(1995)
Proc. SIGMOD-95
, pp. 127-138
-
-
Hernandez, M.1
Stolfo, S.2
-
21
-
-
0032131393
-
Object identification: A Bayesian analysis with application to traffic surveillance
-
T. Huang and S. Russell. Object identification: A Bayesian analysis with application to traffic surveillance. Artificial Intelligence, 103:1-17, 1998.
-
(1998)
Artificial Intelligence
, vol.103
, pp. 1-17
-
-
Huang, T.1
Russell, S.2
-
22
-
-
0041849711
-
A general stochastic approach to solving problems with hard and soft constraints
-
D. Gu, J. Du, and P. Pardalos, editors, American Mathematical Society, New York, NY
-
H. Kautz, B. Selman, and Y Jiang. A general stochastic approach to solving problems with hard and soft constraints. In D. Gu, J. Du, and P. Pardalos, editors, The Satisfiability Problem: Theory and Applications, pages 573-586. American Mathematical Society, New York, NY, 1997.
-
(1997)
The Satisfiability Problem: Theory and Applications
, pp. 573-586
-
-
Kautz, H.1
Selman, B.2
Jiang, Y.3
-
23
-
-
31844432693
-
Learning the structure of Markov logic networks
-
Bonn, Germany, ACM Press
-
S. Kok and P. Domingos. Learning the structure of Markov logic networks. In Proc. ICML-05, pages 441-448, Bonn, Germany, 2005. ACM Press.
-
(2005)
Proc. ICML-05
, pp. 441-448
-
-
Kok, S.1
Domingos, P.2
-
24
-
-
38049122217
-
-
Department of Computer Science and Engineering, University of Washington, Seattle, Seattle, WA
-
S. Kok, P. Singla, M. Richardson, and P. Domingos. The Alchemy system for statistical relational AL Technical report, Department of Computer Science and Engineering, University of Washington, Seattle, WA, 2005. http://www.cs. washington.edu/ai/alchemy.
-
(2005)
The Alchemy system for statistical relational AL Technical report
-
-
Kok, S.1
Singla, P.2
Richardson, M.3
Domingos, P.4
-
25
-
-
17244368453
-
Semantic integration in text: From ambiguous names to identifiable entities
-
X. Li, P. Morie, and D. Roth. Semantic integration in text: from ambiguous names to identifiable entities. AIMagazine, 26(1):45-58, 2005.
-
(2005)
AIMagazine
, vol.26
, Issue.1
, pp. 45-58
-
-
Li, X.1
Morie, P.2
Roth, D.3
-
26
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Proc. KDD-00, pages 169-178, 2000.
-
(2000)
Proc. KDD-00
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
27
-
-
34748869965
-
-
A. McCallum, S. Tejada, and D. Quass, editors, ACM Press
-
A. McCallum, S. Tejada, and D. Quass, editors. Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation. ACM Press, 2003.
-
(2003)
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
-
-
-
28
-
-
33646765912
-
Conditional models of identity uncertainty with application to noun coreference
-
A. McCallum and B. Wellner. Conditional models of identity uncertainty with application to noun coreference. In Adv. NIPS 17, pages 905-912, 2005.
-
(2005)
Adv. NIPS
, vol.17
, pp. 905-912
-
-
McCallum, A.1
Wellner, B.2
-
29
-
-
84880739933
-
BLOG: Probabilistic models with unknown objects
-
Edinburgh, Scotland
-
B. Milch, B. Marthi, D. Sontag, S. Russell, and D. L. Ong. BLOG: Probabilistic models with unknown objects. In Proc. IJCAI-05, pages 1352-1359, Edinburgh, Scotland, 2005.
-
(2005)
Proc. IJCAI-05
, pp. 1352-1359
-
-
Milch, B.1
Marthi, B.2
Sontag, D.3
Russell, S.4
Ong, D.L.5
-
30
-
-
0004043396
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
A. Monge and C. Elkan. An efficient domain-independent algorithm for detecting approximately duplicate database records. In Proc. SIGMOD-97 DMKD Wkshp., 1997.
-
(1997)
Proc. SIGMOD-97 DMKD Wkshp
-
-
Monge, A.1
Elkan, C.2
-
31
-
-
0001592068
-
Automatic linkage of vital records
-
H. Newcombe, J. Kennedy, S. Axford, and A. James. Automatic linkage of vital records. Science, 130:954-959, 1959.
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
Newcombe, H.1
Kennedy, J.2
Axford, S.3
James, A.4
-
33
-
-
24344507522
-
A hit-miss model for duplicate detection in the WHO Drug safety Database
-
Chicago, IL
-
G. Norén, R. Orre, and A. Bate. A hit-miss model for duplicate detection in the WHO Drug safety Database. In Proc. KDD-05, pages 459-468, Chicago, IL, 2005.
-
(2005)
Proc. KDD-05
, pp. 459-468
-
-
Norén, G.1
Orre, R.2
Bate, A.3
-
34
-
-
84898987614
-
Identity uncertainty and citation matching
-
H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. Identity uncertainty and citation matching. In Adv. NIPS 15, pages 1401-1408, 2003.
-
(2003)
Adv. NIPS
, vol.15
, pp. 1401-1408
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
37
-
-
0030120958
-
On the hardness of approximate reasoning
-
D. Roth. On the hardness of approximate reasoning. Artificial Intelligence, 82:273-302, 1996.
-
(1996)
Artificial Intelligence
, vol.82
, pp. 273-302
-
-
Roth, D.1
-
38
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In Proc. KDD-02, pages 269-278, 2002.
-
(2002)
Proc. KDD-02
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
39
-
-
0001445381
-
Local search strategies for satisfiability testing
-
D. S. Johnson and M. A. Trick, editors, American Mathematical Society, Washington, DC
-
B. Selman, H. Kautz, and B. Cohen. Local search strategies for satisfiability testing. In D. S. Johnson and M. A. Trick, editors, Cliques, Coloring, and Satisfiability: Second DIMACS Implementation Challenge, pages 521-532. American Mathematical Society, Washington, DC, 1996.
-
(1996)
Cliques, Coloring, and Satisfiability: Second DIMACS Implementation Challenge
, pp. 521-532
-
-
Selman, B.1
Kautz, H.2
Cohen, B.3
-
40
-
-
29344435802
-
Constraint-based entity matching
-
Pittsburgh, PA, AAAI Press
-
W. Shen, X. Li, and A. Doan. Constraint-based entity matching. In Proc. AAAI-05, pages 862-867, Pittsburgh, PA, 2005. AAAI Press.
-
(2005)
Proc. AAAI-05
, pp. 862-867
-
-
Shen, W.1
Li, X.2
Doan, A.3
-
41
-
-
29344465423
-
Discriminative training of Markov logic networks
-
Pittsburgh, PA, AAAI Press
-
P. Singla and P. Domingos. Discriminative training of Markov logic networks. In Proc. AAAI-05, pages 868-873, Pittsburgh, PA, 2005. AAAI Press.
-
(2005)
Proc. AAAI-05
, pp. 868-873
-
-
Singla, P.1
Domingos, P.2
-
42
-
-
33646416742
-
Object identification with attribute-mediated dependences
-
Porto, Portugal, springer
-
P. Singla and P. Domingos. Object identification with attribute-mediated dependences. In Proc. PKDD-05, pages 297-308, Porto, Portugal, 2005. springer.
-
(2005)
Proc. PKDD-05
, pp. 297-308
-
-
Singla, P.1
Domingos, P.2
-
43
-
-
33750696315
-
Memory-efficient inference in relational domains
-
Boston, MA, AAAI Press
-
P. Singla and P. Domingos. Memory-efficient inference in relational domains. In Proc. AAAI-06, pages 488-493, Boston, MA, 2006. AAAI Press.
-
(2006)
Proc. AAAI-06
, pp. 488-493
-
-
Singla, P.1
Domingos, P.2
-
44
-
-
0035545848
-
Learning object identification rules for information integration
-
S. Tejada, C. Knoblock, and S. Minton. Learning object identification rules for information integration. Information Systems, 26(8): 607-633, 2001.
-
(2001)
Information Systems
, vol.26
, Issue.8
, pp. 607-633
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
45
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
S. Tejada, C. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In Proc. KDD-02, pages 350-359, 2002.
-
(2002)
Proc. KDD-02
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
46
-
-
84878031915
-
-
W. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Census Bureau, 1999.
-
W. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Census Bureau, 1999.
-
-
-
|