-
3
-
-
2342447399
-
Adaptive name matching in information integration
-
Mikhail Bilenko, Raymond Mooney, William Cohen, Pradeep Ravikumar, and Stephen Fienberg. 2003. Adaptive name matching in information integration. Intelligent Systems, 18(5):16–23.
-
(2003)
Intelligent Systems
, vol.18
, Issue.5
, pp. 16-23
-
-
Bilenko, Mikhail1
Mooney, Raymond2
Cohen, William3
Ravikumar, Pradeep4
Fienberg, Stephen5
-
4
-
-
0035478854
-
Random forests
-
Leo Breiman. 2001. Random forests. Machine Learning, 45(1):5–32.
-
(2001)
Machine Learning
, vol.45
, Issue.1
, pp. 5-32
-
-
Breiman, Leo1
-
6
-
-
84873149287
-
-
Springer Publishing Company, Incorporated
-
Peter Christen. 2012. Data matching. Springer Publishing Company, Incorporated.
-
(2012)
Data matching
-
-
Christen, Peter1
-
10
-
-
84944264324
-
A baseline method for genealogical entity resolution
-
Julia Efremova, Bijan Ranjbar-Sahraei, Frans A. Oliehoek, Toon Calders, and Karl Tuyls. 2014. A baseline method for genealogical entity resolution. In Proceedings of the Workshop on Population Reconstruction, organized in the framework of the LINKS project.
-
(2014)
Proceedings of the Workshop on Population Reconstruction, organized in the framework of the LINKS project
-
-
Efremova, Julia1
Ranjbar-Sahraei, Bijan2
Oliehoek, Frans A.3
Calders, Toon4
Tuyls, Karl5
-
11
-
-
33845667955
-
Duplicate record detection: A survey
-
Ahmed K. Elmagarmid, Panagiotis G. Ipeirotis, and Vassilios S. Verykios. 2007. Duplicate record detection: A survey. Knowledge and Data Engineering, IEEE Transactions on, 19(1):1–16.
-
(2007)
Knowledge and Data Engineering, IEEE Transactions on
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, Ahmed K.1
Ipeirotis, Panagiotis G.2
Verykios, Vassilios S.3
-
16
-
-
85047926860
-
-
IISG Amsterdam
-
Kees Mandemakers, Sanne Muurling, Ineke Maas, Bart Van de Putte, Richard L. Zijdeman, Paul Lambert, Marco H.D. van Leeuwen, Frans van Poppel, and Andrew Miles. 2013. HSN standardized, HISCO-coded and classified occupational titles. IISG Amsterdam.
-
(2013)
HSN standardized, HISCO-coded and classified occupational titles
-
-
Mandemakers, Kees1
Muurling, Sanne2
Maas, Ineke3
Van de Putte, Bart4
Zijdeman, Richard L.5
Lambert, Paul6
van Leeuwen, Marco H.D.7
van Poppel, Frans8
Miles, Andrew9
-
17
-
-
3843127500
-
Character n-gram tokenization for european language text retrieval
-
Paul McNamee and James Mayfield. 2004. Character n-gram tokenization for european language text retrieval. Information Retrieval, 7(1-2):73–97.
-
(2004)
Information Retrieval
, vol.7
, Issue.1-2
, pp. 73-97
-
-
McNamee, Paul1
Mayfield, James2
-
19
-
-
0345566149
-
A guided tour to approximate string matching
-
Gonzalo Navarro. 2001. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31–88.
-
(2001)
ACM Comput. Surv
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, Gonzalo1
-
21
-
-
35748932917
-
A review of feature selection techniques in bioinformatics
-
September
-
Yvan Saeys, Iñaki Inza, and Pedro Larrañaga. 2007. A review of feature selection techniques in bioinformatics. Bioinformatics, 23(19):2507–2517, September.
-
(2007)
Bioinformatics
, vol.23
, Issue.19
, pp. 2507-2517
-
-
Saeys, Yvan1
Inza, Iñaki2
Larrañaga, Pedro3
-
25
-
-
0003363140
-
Matching and record linkage
-
Wiley
-
William E. Winkler. 1995. Matching and record linkage. In Business Survey Methods, pages 355–384. Wiley.
-
(1995)
Business Survey Methods
, pp. 355-384
-
-
Winkler, William E.1
|