-
1
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid,A.K., Ipeirotis,P.G. and Verykios,V.S. (2007) Duplicate record detection: a survey. IEEE Trans. Know. Data Eng., 19, 1-16.
-
(2007)
IEEE Trans. Know. Data Eng.
, vol.19
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
2
-
-
0036990263
-
Probabilistic record linkage and a method to calculate the positive predictive value
-
Blakely,T. and Salmond,C. (2002) Probabilistic record linkage and a method to calculate the positive predictive value. Int. J. Epidemiol., 31, 1246-1252.
-
(2002)
Int. J. Epidemiol.
, vol.31
, pp. 1246-1252
-
-
Blakely, T.1
Salmond, C.2
-
3
-
-
84898987614
-
Identity uncertainty and citation matching
-
Becker,S., Thrun,S. and Obermayer,K. (eds), MIT Press, Cambridge, MA
-
Pasula,H., Marthi,B., Milch,B. et al. (2003) Identity uncertainty and citation matching. In: Becker,S., Thrun,S. and Obermayer,K. (eds), Proceedings of 16th Annual Advances in Neural Information Processing Systems (NIPS 2002), Vol. 15. MIT Press, Cambridge, MA, pp. 1425-1432.
-
(2003)
Proceedings of 16th Annual Advances in Neural Information Processing Systems (NIPS 2002)
, vol.15
, pp. 1425-1432
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
-
4
-
-
84865086832
-
Reasoning about record matching rules
-
ACM
-
Fan,W., Jia,X., Li,J. and Ma,S. (2009) Reasoning about record matching rules. In: The 35th International Conference on Very Large Data Bases (VLDB), Lyon, France. ACM, pp. 407-418.
-
(2009)
The 35th International Conference on Very Large Data Bases (VLDB), Lyon, France
, pp. 407-418
-
-
Fan, W.1
Jia, X.2
Li, J.3
Ma, S.4
-
5
-
-
0001592068
-
Automatic linkage of vital records
-
Newcombe,H.B., Kennedy,J.M., Axford,S.J. and James,A.P. (1959) Automatic linkage of vital records. Science, 130, 954-959.
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.J.3
James, A.P.4
-
6
-
-
84947399464
-
A theory for record linkage
-
Fellegi,I. and Sunter,A. (1969) A theory for record linkage. J. Am. Stat. Soc., 64, 1183-1210.
-
(1969)
J. Am. Stat. Soc.
, vol.64
, pp. 1183-1210
-
-
Fellegi, I.1
Sunter, A.2
-
8
-
-
77954003729
-
Iterative record linkage for cleaning and integration
-
ACM, New York
-
Bhattacharya,I. and Getoor,L. (2004) Iterative record linkage for cleaning and integration. In: ACM SIGMOD Workshop on Research Issues in DataMining and Knowledge Discovery (DMKD), Paris, France. ACM, New York, pp. 11-18.
-
(2004)
ACM SIGMOD Workshop on Research Issues in DataMining and Knowledge Discovery (DMKD), Paris, France
, pp. 11-18
-
-
Bhattacharya, I.1
Getoor, L.2
-
9
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
ACM, New York
-
Bilenko,M. and Mooney,R.J. (2003) Adaptive duplicate detection using learnable string similarity measures. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC. ACM, New York, pp. 39-48.
-
(2003)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC
, pp. 39-48
-
-
Bilenko, M.1
Mooney, R.J.2
-
10
-
-
0242456811
-
Interactive deduplication using active learning
-
ACM, New York
-
Sarawagi,S. and Bhamidipaty,A. (2002) Interactive deduplication using active learning. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Edmonton, Alberta. ACM, New York, pp. 269-278.
-
(2002)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Edmonton, Alberta
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
11
-
-
79960524941
-
An unsupervised heuristic-based approach for bibliographic metadata deduplication
-
Borges,E.N., de Carvalho,M.G., Galante,R. et al. (2011) An unsupervised heuristic-based approach for bibliographic metadata deduplication. Inf. Process. Manag., 47, 706-718.
-
(2011)
Inf. Process. Manag.
, vol.47
, pp. 706-718
-
-
Borges, E.N.1
De Carvalho, M.G.2
Galante, R.3
-
12
-
-
65449178105
-
Febrl: An open source data cleaning, deduplication and record linkage system with a graphical user interface
-
ACM, New York
-
Christen,P. (2008) Febrl: an open source data cleaning, deduplication and record linkage system with a graphical user interface. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, NV. ACM, New York, pp. 1065-1068.
-
(2008)
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, NV
, pp. 1065-1068
-
-
Christen, P.1
-
13
-
-
73949088415
-
FRIL: A tool for comparative record linkage
-
Jurczyk,P., Lu,J.J., Xiong,L. et al. (2008) FRIL: a tool for comparative record linkage. Proc. AMIA Symp., 2008, 440-444.
-
(2008)
Proc. AMIA Symp., 2008
, pp. 440-444
-
-
Jurczyk, P.1
Lu, J.J.2
Xiong, L.3
-
14
-
-
84892704342
-
Design and implementation of Metta, a metasearch engine for biomedical literature intended for systematic reviewers
-
in press
-
Smalheiser,N.R., Lin,C., Jia,L. et al. (2013) Design and implementation of Metta, a metasearch engine for biomedical literature intended for systematic reviewers. Health Information Science and Systems, in press.
-
(2013)
Health Information Science and Systems
-
-
Smalheiser, N.R.1
Lin, C.2
Jia, L.3
-
15
-
-
84882598022
-
Find duplicates among the PubMed, EMBASE, and Cochrane library databases in systematic reviews
-
Qi,X., Yang,M., Ren,W. et al. (2013) Find duplicates among the PubMed, EMBASE, and Cochrane library databases in systematic reviews. PLoS One, 8, e71838.
-
(2013)
PLoS One
, vol.8
-
-
Qi, X.1
Yang, M.2
Ren, W.3
-
16
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
Hernandez,M., Andez,M.A.H., Stolfo,S. and Fayyad,U. (1998) Real-world data is dirty: data cleansing and the merge/purge problem. Data Min. Knowl. Discov., 2, 9-37.
-
(1998)
Data Min. Knowl. Discov.
, vol.2
, pp. 9-37
-
-
Hernandez, M.1
Andez, M.A.H.2
Stolfo, S.3
Fayyad, U.4
-
17
-
-
0034592784
-
Efficient clustering of high dimensional data sets with application to reference matching
-
ACM, NewYork
-
McCallum,A., Nigam,K. and Ungar,L. (2000) Efficient clustering of high dimensional data sets with application to reference matching. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA . ACM, NewYork, pp. 169-178.
-
(2000)
Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, MA
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
18
-
-
79957852084
-
Efficient spectral neighborhood blocking for entity resolution
-
IEEE Computer Society, Washington, DC
-
Shu,L., Chen,A., Xiong,M. and Meng,W. (2011) Efficient spectral neighborhood blocking for entity resolution. In: IEEE International Conference on Data Engineering (ICDE), Hannover, Germany . IEEE Computer Society, Washington, DC, pp. 1067-1078.
-
(2011)
IEEE International Conference on Data Engineering (ICDE), Hannover, Germany
, pp. 1067-1078
-
-
Shu, L.1
Chen, A.2
Xiong, M.3
Meng, W.4
-
19
-
-
0004116989
-
-
3rd edn. MIT Press, Cambridge, MA
-
Corman,T.H., Leiserson,C.E., Rivest,R.L. and Stein,C. (2009) Introduction to Algorithms, 3rd edn. MIT Press, Cambridge, MA.
-
(2009)
Introduction to Algorithms
-
-
Corman, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
Stein, C.4
-
20
-
-
0041627757
-
A simple algorithm for identifying abbreviation definitions in biomedical text
-
Schwartz,A.S. and Hearst,M.A. (2003) A simple algorithm for identifying abbreviation definitions in biomedical text. Pac. Symp. Biocomput., 4, 451-462.
-
(2003)
Pac. Symp. Biocomput.
, vol.4
, pp. 451-462
-
-
Schwartz, A.S.1
Hearst, M.A.2
|