메뉴 건너뛰기




Volumn 24, Issue 3, 2012, Pages 399-412

A genetic programming approach to record deduplication

Author keywords

Database administration; database integration.; evolutionary computing and genetic algorithms

Indexed keywords

COMPUTATIONAL TIME; DATA CONTENTS; DATABASE INTEGRATION; DEDUPLICATION; EVOLUTIONARY COMPUTING; GOVERNMENT ORGANIZATIONS; HIGH QUALITY; ITS DATA; QUALITY INFORMATION; STATE-OF-THE-ART METHODS;

EID: 84856428590     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2010.234     Document Type: Article
Times cited : (80)

References (33)
  • 1
    • 84856415685 scopus 로고    scopus 로고
    • Operation clean data
    • Aug.
    • M. Wheatley, "Operation Clean Data," CIO Asia Magazine, http:// www.cio-asia.com, Aug. 2004.
    • (2004) CIO Asia Magazine
    • Wheatley, M.1
  • 5
    • 84947399464 scopus 로고
    • A theory for record linkage
    • I.P. Fellegi and A.B. Sunter, "A Theory for Record Linkage," J. Am. Statistical Assoc., vol. 66, no. 1, pp. 1183-1210, 1969.
    • (1969) J. Am. Statistical Assoc. , vol.66 , Issue.1 , pp. 1183-1210
    • Fellegi, I.P.1    Sunter, A.B.2
  • 6
    • 0038208065 scopus 로고    scopus 로고
    • A Bayesian decision model for cost optimal record matching
    • DOI 10.1007/s00778-002-0072-y
    • V.S. Verykios, G.V. Moustakides, and M.G. Elfeky, "A Bayesian Decision Model for Cost Optimal Record Matching," The Very Large Databases J., vol. 12, no. 1, pp. 28-40, 2003. (Pubitemid 36752332)
    • (2003) VLDB Journal , vol.12 , Issue.1 , pp. 28-40
    • Verykios, V.S.1    Moustakides, G.V.2    Elfeky, M.G.3
  • 7
    • 84856482017 scopus 로고    scopus 로고
    • Is you data dirty? and Does that matter?
    • R. Bell and F. Dravis, "Is You Data Dirty? and Does that Matter?," Accenture Whiter Paper, http://www.accenture.com, 2006.
    • (2006) Accenture Whiter Paper
    • Bell, R.1    Dravis, F.2
  • 13
    • 53449089815 scopus 로고    scopus 로고
    • A genetic programming framework for content-based image retrieval
    • R.d.S. Torres, A.X. Falcao, M.A. Gonçalves, J.P. Papa, B. Zhang, W. Fan, and E.A. Fox, "A Genetic Programming Framework for Content-Based Image Retrieval," Pattern Recognition, vol. 42, no. 2, pp. 283-292, 2009.
    • (2009) Pattern Recognition , vol.42 , Issue.2 , pp. 283-292
  • 20
    • 0032640910 scopus 로고    scopus 로고
    • Digital libraries and autonomous citation indexing
    • June
    • S. Lawrence, L. Giles, and K. Bollacker, "Digital Libraries and Autonomous Citation Indexing," Computer, vol. 32, no. 6, pp. 67-71, June 1999.
    • (1999) Computer , vol.32 , Issue.6 , pp. 67-71
    • Lawrence, S.1    Giles, L.2    Bollacker, K.3
  • 23
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • W.W. Cohen, "Data Integration Using Similarity Joins and a Word-Based Information Representation Language," ACM Trans. Information Systems, vol. 18, no. 3, pp. 288-321, 2000.
    • (2000) ACM Trans. Information Systems , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.W.1
  • 25
    • 0001592068 scopus 로고
    • Automatic linkage of vital records
    • Oct.
    • H.B. Newcombe, J.M. Kennedy, S. Axford, and A. James, "Automatic Linkage of Vital Records," Science, vol. 130, no. 3381, pp. 954-959, Oct. 1959.
    • (1959) Science , vol.130 , Issue.3381 , pp. 954-959
    • Newcombe, H.B.1    Kennedy, J.M.2    Axford, S.3    James, A.4
  • 26
    • 84856482021 scopus 로고    scopus 로고
    • Freely Extensible Biomedical Record Linkage
    • "Freely Extensible Biomedical Record Linkage," http:// sourceforge.net/projects/febrl, 2011.
    • (2011)
  • 28
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • DOI 10.1016/S0306-4379(01)00042-4, Data Extraction, Cleaning and Reconciliation
    • S. Tejada, C.A. Knoblock, and S. Minton, "Learning Object Identification Rules for Information Integration," Information Systems, vol. 26, no. 8, pp. 607-633, 2001. (Pubitemid 33046273)
    • (2001) Information Systems , vol.26 , Issue.8 , pp. 607-633
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 30
    • 0042312958 scopus 로고    scopus 로고
    • Genetic programming's continued evolution
    • ch. 1, MIT Press
    • P.J. Angeline, "Genetic Programming's Continued Evolution," Advances in Genetic Programming, vol. 2, ch. 1, MIT Press, 1996.
    • (1996) Advances in Genetic Programming , vol.2
    • Angeline, P.J.1
  • 32
    • 26444478506 scopus 로고    scopus 로고
    • Probabilistic data generation for deduplication and data linkage
    • Intelligent Data Engineering and Automated Learning - IDEAL 2005: 6th International Conference. Proceedings
    • P. Christen, "Probabilistic Data Generation for Deduplication and Data Linkage," Intelligent Data Eng. and Automated Learning, pp. 109-116, Springer, 2005. (Pubitemid 41431651)
    • (2005) Lecture Notes in Computer Science , vol.3578 , pp. 109-116
    • Christen, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.