메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1027-1032

A fast approach for parallel deduplication on multicore processors

Author keywords

data integration; deduplication; parallel systems

Indexed keywords

BLOCKING METHOD; DATA INTEGRATION; DEDUPLICATION; DISTRIBUTED ENVIRONMENTS; EMPIRICAL EVALUATIONS; LOW DEGREE; MULTI-CORE PROCESSOR; PARALLEL DATA; PARALLEL SYSTEM; PROGRAMMING MODELS; ROBUST DATUM; SUB-BLOCKS; UNBALANCED LOADS;

EID: 79959294918     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1982185.1982411     Document Type: Conference Paper
Times cited : (26)

References (15)
  • 4
    • 26444478506 scopus 로고    scopus 로고
    • Probabilistic data generation for deduplication and data linkage
    • Springer Berlin / Heidelberg
    • P. Christen. Probabilistic data generation for deduplication and data linkage. In Intelligent Data Engineering and Automated Learning - IDEAL 2005, pages 109-116. Springer Berlin / Heidelberg, 2005.
    • (2005) Intelligent Data Engineering and Automated Learning - IDEAL 2005 , pp. 109-116
    • Christen, P.1
  • 5
    • 7444251738 scopus 로고    scopus 로고
    • Febrl - A parallel open source data linkage system
    • P. Christen, T. Churches, and M. Hegland. Febrl - a parallel open source data linkage system. In PAKDD, pages 638-647, 2004.
    • (2004) PAKDD , pp. 638-647
    • Christen, P.1    Churches, T.2    Hegland, M.3
  • 8
    • 37549003336 scopus 로고    scopus 로고
    • Mapreduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat. Mapreduce: simplified data processing on large clusters. Commun. ACM, 51(1):107-113, 2008.
    • (2008) Commun. ACM , vol.51 , Issue.1 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 11
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • ACM
    • A. McCallum, K. Nigam, and L. H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In KDD '00: ACM SIGKDD. ACM, 2000.
    • (2000) KDD '00: ACM SIGKDD
    • McCallum, A.1    Nigam, K.2    Ungar, L.H.3
  • 12
    • 79959324551 scopus 로고    scopus 로고
    • Performance and scalability of fast blocking techniques for deduplication and data linkage
    • C. Peter. Performance and scalability of fast blocking techniques for deduplication and data linkage. Proc. VLDB Endow., 1(2):1253-1264, 2007.
    • (2007) Proc. VLDB Endow. , vol.1 , Issue.2 , pp. 1253-1264
    • Peter, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.