메뉴 건너뛰기




Volumn 69, Issue 2, 2010, Pages 197-210

Frameworks for entity matching: A comparison

Author keywords

Entity matching; Entity resolution; Match optimization; Matcher combination; Training selection

Indexed keywords

DATA INTEGRATION; ENTITY MATCHING; RESEARCH PROTOTYPE; STATE OF THE ART; TRAINING DATA;

EID: 72649095071     PISSN: 0169023X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.datak.2009.10.003     Document Type: Article
Times cited : (325)

References (63)
  • 14
    • 72649102401 scopus 로고    scopus 로고
    • Mining document collections to facilitate accurate approximate entity matching
    • Chaudhuri S., Ganti V., and Xin D. Mining document collections to facilitate accurate approximate entity matching. PVLDB 2 1 (2009) 395-406
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 395-406
    • Chaudhuri, S.1    Ganti, V.2    Xin, D.3
  • 23
    • 84893947008 scopus 로고    scopus 로고
    • A comparison and generalization of blocking and windowing algorithms for duplicate detection
    • U. Draisbach, F. Naumann, A comparison and generalization of blocking and windowing algorithms for duplicate detection, in: Proceedings of QDB 2009 Workshop at VLDB, 2009.
    • (2009) Proceedings of QDB 2009 Workshop at VLDB
    • Draisbach, U.1    Naumann, F.2
  • 26
    • 84947399464 scopus 로고
    • A theory for record linkage
    • Fellegi I.P., and Sunter A.B. A theory for record linkage. J. Am. Stat. Assoc. 64 328 (1969) 1183-1210
    • (1969) J. Am. Stat. Assoc. , vol.64 , Issue.328 , pp. 1183-1210
    • Fellegi, I.P.1    Sunter, A.B.2
  • 28
    • 33845350152 scopus 로고    scopus 로고
    • Record linkage: Current practice and future directions
    • Tech. Rep, CSIRO Mathematical and Information Sciences
    • L. Gu, R. Baxter, D. Vickers, C. Rainsford, Record linkage: current practice and future directions, Tech. Rep., CSIRO Mathematical and Information Sciences, 2003.
    • (2003)
    • Gu, L.1    Baxter, R.2    Vickers, D.3    Rainsford, C.4
  • 32
    • 72649086387 scopus 로고    scopus 로고
    • Framework for evaluating clustering algorithms in duplicate detection
    • Hassanzadeh O., Chiang F., Miller R.J., and Lee H.C. Framework for evaluating clustering algorithms in duplicate detection. PVLDB 2 1 (2009) 1282-1293
    • (2009) PVLDB , vol.2 , Issue.1 , pp. 1282-1293
    • Hassanzadeh, O.1    Chiang, F.2    Miller, R.J.3    Lee, H.C.4
  • 34
    • 47649126673 scopus 로고    scopus 로고
    • Interactive entity resolution in relational data: a visual analytic tool and its evaluation
    • Kang H., Getoor L., Shneiderman B., Bilgic M., and Licamele L. Interactive entity resolution in relational data: a visual analytic tool and its evaluation. IEEE Trans. Vis. Comput. Graph. 14 5 (2008) 999-1014
    • (2008) IEEE Trans. Vis. Comput. Graph. , vol.14 , Issue.5 , pp. 999-1014
    • Kang, H.1    Getoor, L.2    Shneiderman, B.3    Bilgic, M.4    Licamele, L.5
  • 41
    • 33646398530 scopus 로고    scopus 로고
    • Conditional models of identity uncertainty with application to noun coreference
    • A. McCallum, B. Wellner, Conditional models of identity uncertainty with application to noun coreference, in: Advances in Neural Information Processing Systems, vol. 17. 2004, pp. 905-912.
    • (2004) in: Advances in Neural Information Processing Systems , vol.17 , pp. 905-912
    • McCallum, A.1    Wellner, B.2
  • 47
    • 0002490026 scopus 로고    scopus 로고
    • Data cleaning: problems and current approaches
    • Rahm E., and Do H.H. Data cleaning: problems and current approaches. IEEE Data Eng. Bull. 23 4 (2000) 3-13
    • (2000) IEEE Data Eng. Bull. , vol.23 , Issue.4 , pp. 3-13
    • Rahm, E.1    Do, H.H.2
  • 53
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • Tejada S., Knoblock C.A., and Minton S. Learning object identification rules for information integration. Inf. Syst. 26 8 (2001) 607-633
    • (2001) Inf. Syst. , vol.26 , Issue.8 , pp. 607-633
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 56
    • 0038208065 scopus 로고    scopus 로고
    • A Bayesian decision model for cost optimal record matching
    • Verykios V.S., Moustakides G.V., and Elfeky M.G. A Bayesian decision model for cost optimal record matching. VLDB J. 12 1 (2003) 28-40
    • (2003) VLDB J. , vol.12 , Issue.1 , pp. 28-40
    • Verykios, V.S.1    Moustakides, G.V.2    Elfeky, M.G.3
  • 60
    • 33845615644 scopus 로고    scopus 로고
    • Overview of record linkage and current research directions, Tech
    • Rep, US Bureau of the Census, Washington, DC
    • W.E. Winkler, Overview of record linkage and current research directions, Tech. Rep., US Bureau of the Census, Washington, DC, 2006.
    • (2006)
    • Winkler, W.E.1
  • 61
    • 70849105253 scopus 로고    scopus 로고
    • Ed-join: an efficient algorithm for similarity joins with edit distance constraints
    • Xiao C., Wang W., and Lin X. Ed-join: an efficient algorithm for similarity joins with edit distance constraints. PVLDB 1 1 (2008) 933-944
    • (2008) PVLDB , vol.1 , Issue.1 , pp. 933-944
    • Xiao, C.1    Wang, W.2    Lin, X.3
  • 62
    • 5644287747 scopus 로고    scopus 로고
    • Entity identification for heterogeneous database integration - a Multiple Classifier System approach and empirical evaluation
    • Zhao H., and Ram S. Entity identification for heterogeneous database integration - a Multiple Classifier System approach and empirical evaluation. Inf. Syst. 30 2 (2005) 119-132
    • (2005) Inf. Syst. , vol.30 , Issue.2 , pp. 119-132
    • Zhao, H.1    Ram, S.2
  • 63
    • 47849087202 scopus 로고    scopus 로고
    • Entity matching across heterogeneous data sources: an approach based on constrained cascade generalization
    • Zhao H., and Ram S. Entity matching across heterogeneous data sources: an approach based on constrained cascade generalization. Data Knowl. Eng. 66 3 (2008) 368-381
    • (2008) Data Knowl. Eng. , vol.66 , Issue.3 , pp. 368-381
    • Zhao, H.1    Ram, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.