-
1
-
-
33750452514
-
Swoosh: A generic approach to entity resolution
-
March
-
Benjelloun, O., Garcia-Molina, H., Su, Q.,Widom, J.: Swoosh: A generic approach to entity resolution. Stanford University technical report (March 2005)
-
(2005)
Stanford University Technical Report
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Su, Q.3
Widom, J.4
-
4
-
-
84884417241
-
Preparation of name and address data for record linkage using hidden markov models
-
Churches, T., Christen, P., Lu, J., Zhu, J.X.: Preparation of name and address data for record linkage using hidden markov models. BioMed Central Medical Informatics and Decision Making 2(9) (2002)
-
(2002)
BioMed Central Medical Informatics and Decision Making
, vol.2
, Issue.9
-
-
Churches, T.1
Christen, P.2
Lu, J.3
Zhu, J.X.4
-
5
-
-
18744363418
-
A comparison of string metrics for matching names and addresses
-
Cohen, W.W., Ravikumar, P., Fienberg, S.E.: A comparison of string metrics for matching names and addresses. In: International Joint Conference on Artificial Intelligence, Proceedings of the Workshop on Information Integration on the Web (August 2003)
-
International Joint Conference on Artificial Intelligence, Proceedings of the Workshop on Information Integration on the Web (August 2003)
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
6
-
-
3042649466
-
From authority control to informed retrieval: Framing the expanded domain of subject access
-
Dalrymple, P.W., Young, J.A.: From authority control to informed retrieval: Framing the expanded domain of subject access. College & Research Libraries 52, 139-149 (1991)
-
(1991)
College & Research Libraries
, vol.52
, pp. 139-149
-
-
Dalrymple, P.W.1
Young, J.A.2
-
7
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid, A., Ipeirotis, P.,Verykios, V.: Duplicate record detection: A survey. IEEE Transactions on Knowledge and Data Engineering 19(1), 1-16 (2007)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.1
Ipeirotis, P.2
Verykios, V.3
-
9
-
-
44649164477
-
Detecting near-duplicates in large-scale short text databases
-
DOI 10.1007/978-3-540-68125-0-87, Advances in Knowledge Discovery and Data Mining - 12th Pacific-Asia Conference, PAKDD 2008, Proceedings
-
Gong, C., Huang, Y., Cheng, X., Bai, S.: Detecting near-duplicates in large-scale short text databases. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 877-883. Springer, Heidelberg (2008) (Pubitemid 351776381)
-
(2008)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.5012 LNAI
, pp. 877-883
-
-
Gong, C.1
Huang, Y.2
Cheng, X.3
Bai, S.4
-
11
-
-
77955933052
-
Cassandra: A decentralized structured storage system
-
Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. SIGOPS Oper. Syst. Rev. 44, 35-40 (2010)
-
(2010)
SIGOPS Oper. Syst. Rev.
, vol.44
, pp. 35-40
-
-
Lakshman, A.1
Malik, P.2
-
12
-
-
35348911985
-
Detecting near-duplicates for web crawling
-
Manku, G., Jain, A., Sarma, A.D.: Detecting near-duplicates for web crawling. In: 16th International World Wide Conference, Banff, Alberta, Canada (May 2007)
-
16th International World Wide Conference, Banff, Alberta, Canada (May 2007)
-
-
Manku, G.1
Jain, A.2
Sarma, A.D.3
-
13
-
-
70449112601
-
Viaf (virtual international authority file): Linking the deutsche nationalbibliothek and library of congress name authority files
-
Rick, B., Hengel-Dittrich, C., O'Neill, E.T., Tillett, B.: Viaf (virtual international authority file): Linking the deutsche nationalbibliothek and library of congress name authority files. International Cataloging and Bibliographic Control 36(1), 12-19 (2007)
-
(2007)
International Cataloging and Bibliographic Control
, vol.36
, Issue.1
, pp. 12-19
-
-
Rick, B.1
Hengel-Dittrich, C.2
O'Neill, E.T.3
Tillett, B.4
-
14
-
-
0035545848
-
Learning object identification rules for information extraction
-
Tejada, S., Knoblock, C., Minton, S.: Learning object identification rules for information extraction. Information Systems 26(8), 607-633 (2001)
-
(2001)
Information Systems
, vol.26
, Issue.8
, pp. 607-633
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
16
-
-
77954716614
-
Mapdupreducer: Detecting near duplicates over massive datasets
-
ACM, New York
-
Wang, C.,Wang, J., Lin, X.,Wang,W.,Wang, H., Li, H., Tian, W., Xu, J., Li, R.: Mapdupreducer: detecting near duplicates over massive datasets. In: Proceedings of the 2010 International Conference on Management of Data, SIGMOD 2010, pp. 1119-1122. ACM, New York (2010)
-
(2010)
Proceedings of the 2010 International Conference on Management of Data, SIGMOD 2010
, pp. 1119-1122
-
-
Wang, C.1
Wang, J.2
Lin, X.3
Wang, W.4
Wang, H.5
Li, H.6
Tian, W.7
Xu, J.8
Li, R.9
-
18
-
-
0008976521
-
String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage
-
American Statistical Association
-
Winkler, W.E.: String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, American Statistical Association, pp. 354-359 (1990)
-
(1990)
Proceedings of the Section on Survey Research Methods
, pp. 354-359
-
-
Winkler, W.E.1
|