메뉴 건너뛰기




Volumn , Issue , 2012, Pages 53-62

Beyond 100 million entities: Large-scale blocking-based resolution for heterogeneous data

Author keywords

Attribute agnostic blocking; Data cleaning; Entity resolution

Indexed keywords

ATTRIBUTE-AGNOSTIC BLOCKING; BLOCKING METHOD; BLOCKING TECHNIQUE; COMPUTATIONAL COSTS; DATA CLEANING; EFICIENCY; ENTITY IDENTIFIERS; EXPERIMENTAL EVALUATION; HETEROGENEOUS DATA; LARGE DATASETS; REAL-WORLD OBJECTS; RELATIONSHIPS BETWEEN ENTITIES; SCHEMA INFORMATION; SEMI-STRUCTURED; VOLUMINOUS DATA;

EID: 84858041897     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2124295.2124305     Document Type: Conference Paper
Times cited : (53)

References (26)
  • 1
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • M. Bilenko, B. Kamath, and R. J. Mooney. Adaptive blocking: Learning to scale up record linkage. In ICDM, 2006.
    • (2006) ICDM
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 4
    • 11144240583 scopus 로고    scopus 로고
    • A comparison of string distance metrics for name-matching tasks
    • W. W. Cohen, P. D. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWeb, 2003.
    • (2003) IIWeb
    • Cohen, W.W.1    Ravikumar, P.D.2    Fienberg, S.E.3
  • 5
    • 29844452555 scopus 로고    scopus 로고
    • Reference reconciliation in complex information spaces
    • X. Dong, A. Halevy, and J. Madhavan. Reference reconciliation in complex information spaces. In SIGMOD, 2005.
    • (2005) SIGMOD
    • Dong, X.1    Halevy, A.2    Madhavan, J.3
  • 11
    • 79959927816 scopus 로고    scopus 로고
    • On-the-fly entity-aware query processing in the presence of linkage
    • E. Ioannou, W. Nejdl, C. Niederée, and Y. Velegrakis. On-the-fly entity-aware query processing in the presence of linkage. PVLDB, 3(1), 2010.
    • (2010) PVLDB , vol.3 , Issue.1
    • Ioannou, E.1    Nejdl, W.2    Niederée, C.3    Velegrakis, Y.4
  • 12
    • 35248813379 scopus 로고    scopus 로고
    • Architecture of the world wide web, volume one
    • December
    • I. Jacobs and N. Walsh. Architecture of the world wide web, volume one. W3C Recommendation, December 2004.
    • (2004) W3C Recommendation
    • Jacobs, I.1    Walsh, N.2
  • 13
    • 77952280581 scopus 로고    scopus 로고
    • HARRA: Fast iterative hashed record linkage for large-scale data collections
    • H. Kim and D. Lee. HARRA: fast iterative hashed record linkage for large-scale data collections. In EDBT, 2010.
    • (2010) EDBT
    • Kim, H.1    Lee, D.2
  • 14
    • 33846320077 scopus 로고    scopus 로고
    • Supporting eficient record linkage for large data sets using mapping techniques
    • C. Li, L. Jin, and S. Mehrotra. Supporting eficient record linkage for large data sets using mapping techniques. WWW Journal, 9(4), 2006.
    • (2006) WWW Journal , vol.9 , Issue.4
    • Li, C.1    Jin, L.2    Mehrotra, S.3
  • 16
    • 0034592784 scopus 로고    scopus 로고
    • Eficient clustering of highdimensional data sets with application to reference matching
    • A. McCallum, K. Nigam, and L. H. Ungar. Eficient clustering of highdimensional data sets with application to reference matching. In KDD, 2000.
    • (2000) KDD
    • McCallum, A.1    Nigam, K.2    Ungar, L.H.3
  • 17
    • 36348932551 scopus 로고    scopus 로고
    • Learning blocking schemes for record linkage
    • M. Michelson and C. A. Knoblock. Learning blocking schemes for record linkage. In AAAI, 2006.
    • (2006) AAAI
    • Michelson, M.1    Knoblock, C.A.2
  • 18
    • 79956009487 scopus 로고    scopus 로고
    • The missing links: Discovering hidden same-as links among a billion of triples
    • G. Papadakis, G. Demartini, P. Kärger, and P. Fankhauser. The missing links: Discovering hidden same-as links among a billion of triples. In iiWAS, 2010.
    • (2010) IiWAS
    • Papadakis, G.1    Demartini, G.2    Kärger, P.3    Fankhauser, P.4
  • 19
    • 79952386495 scopus 로고    scopus 로고
    • Eficient entity resolution for large heterogeneous information spaces
    • G. Papadakis, E. Ioannou, C. Niederée, and P. Fankhauser. Eficient entity resolution for large heterogeneous information spaces. In WSDM, 2011.
    • (2011) WSDM
    • Papadakis, G.1    Ioannou, E.2    Niederée, C.3    Fankhauser, P.4
  • 20
    • 79960519872 scopus 로고    scopus 로고
    • Eliminating the redundancy in blocking-based entity resolution methods
    • G. Papadakis, E. Ioannou, C. Niederée, T. Palpanas, and W. Nejdl. Eliminating the redundancy in blocking-based entity resolution methods. In JCDL, pages 85-94, 2011.
    • (2011) JCDL , pp. 85-94
    • Papadakis, G.1    Ioannou, E.2    Niederée, C.3    Palpanas, T.4    Nejdl, W.5
  • 22
    • 83055169894 scopus 로고    scopus 로고
    • Large-scale collective entity matching
    • V. Rastogi, N. N. Dalvi, and M. N. Garofalakis. Large-scale collective entity matching. PVLDB, 4(4), 2011.
    • (2011) PVLDB , vol.4 , Issue.4
    • Rastogi, V.1    Dalvi, N.N.2    Garofalakis, M.N.3
  • 23
    • 0242456803 scopus 로고    scopus 로고
    • Learning domain-independent string transformation weights for high accuracy object identification
    • S. Tejada, C. A. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In KDD, 2002.
    • (2002) KDD
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 24
    • 74549152150 scopus 로고    scopus 로고
    • Robust record linkage blocking using suffix arrays
    • T. Vries, H. Ke, S. Chawla, and P. Christen. Robust record linkage blocking using suffix arrays. In CIKM, 2009.
    • (2009) CIKM
    • Vries, T.1    Ke, H.2    Chawla, S.3    Christen, P.4
  • 26
    • 51949107423 scopus 로고    scopus 로고
    • Modeling heterogeneous data in dataspace
    • M. Zhong, M. Liu, and Q. Chen. Modeling heterogeneous data in dataspace. In IRI, 2008.
    • (2008) IRI
    • Zhong, M.1    Liu, M.2    Chen, Q.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.