메뉴 건너뛰기




Volumn 9, Issue 9, 2016, Pages 684-695

Comparative analysis of approximate blocking techniques for entity resolution

Author keywords

[No Author keywords available]

Indexed keywords

BLOCKING TECHNIQUE; COMPARATIVE ANALYSIS; EMPIRICAL SURVEYS; ENTITY RESOLUTIONS; PAIR-WISE COMPARISON; QUADRATIC COMPLEXITY; SYNTHETIC DATASETS; TIME EFFICIENCIES;

EID: 84975801848     PISSN: None     EISSN: 21508097     Source Type: Conference Proceeding    
DOI: 10.14778/2947618.2947624     Document Type: Chapter
Times cited : (144)

References (29)
  • 1
    • 33845363891 scopus 로고    scopus 로고
    • A fast linkage detection scheme for multi-source information integration
    • A. N. Aizawa and K. Oyama. A fast linkage detection scheme for multi-source information integration. In WIRI, pages 30-39, 2005.
    • (2005) WIRI , pp. 30-39
    • Aizawa, A.N.1    Oyama, K.2
  • 2
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • M. Bilenko, B. Kamath, and R. J. Mooney. Adaptive blocking: Learning to scale up record linkage. In ICDM, pages 87-96, 2006.
    • (2006) ICDM , pp. 87-96
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 3
    • 26444550791 scopus 로고    scopus 로고
    • Robust identification of fuzzy duplicates
    • S. Chaudhuri, V. Ganti, and R. Motwani. Robust identification of fuzzy duplicates. In ICDE, pages 865-876, 2005.
    • (2005) ICDE , pp. 865-876
    • Chaudhuri, S.1    Ganti, V.2    Motwani, R.3
  • 4
    • 65449178105 scopus 로고    scopus 로고
    • Febrl an open source data cleaning, deduplication and record linkage system with a graphical user interface
    • P. Christen. Febrl an open source data cleaning, deduplication and record linkage system with a graphical user interface. In KDD, pages 1065-1068, 2008.
    • (2008) KDD , pp. 1065-1068
    • Christen, P.1
  • 5
    • 84920595044 scopus 로고    scopus 로고
    • A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans
    • P. Christen. A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowl. Data Eng., 24(9):1537-1555, 2012.
    • (2012) Knowl. Data Eng , vol.24 , Issue.9 , pp. 1537-1555
    • Christen, P.1
  • 8
    • 84954139187 scopus 로고    scopus 로고
    • A clustering-based framework to control block sizes for entity resolution
    • J. Fisher, P. Christen, Q. Wang, and E. Rahm. A clustering-based framework to control block sizes for entity resolution. In KDD, pages 279-288, 2015.
    • (2015) KDD , pp. 279-288
    • Fisher, J.1    Christen, P.2    Wang, Q.3    Rahm, E.4
  • 9
    • 84905818198 scopus 로고    scopus 로고
    • Uncertain entity resolution
    • A. Gal. Uncertain entity resolution. PVLDB, 7(13):1711-1712, 2014.
    • (2014) PVLDB , vol.7 , Issue.13 , pp. 1711-1712
    • Gal, A.1
  • 10
    • 84975846124 scopus 로고    scopus 로고
    • Entity resolution in the big data era: Probabilistic db support to entity resolution
    • A. Gal and B. Kimelfeld. Entity resolution in the big data era: Probabilistic db support to entity resolution. In EDBT (tutorial), 2015.
    • (2015) EDBT (tutorial)
    • Gal, A.1    Kimelfeld, B.2
  • 11
    • 0344756845 scopus 로고    scopus 로고
    • Declarative Data Cleaning: Language, Model and Algorithms
    • H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C. Saita. Declarative Data Cleaning: Language, Model and Algorithms. In VLDB, pages 371-380, 2001.
    • (2001) VLDB , pp. 371-380
    • Galhardas, H.1    Florescu, D.2    Shasha, D.3    Simon, E.4    Saita, C.5
  • 12
    • 84873162472 scopus 로고    scopus 로고
    • Entity resolution: Theory, practice & open challenges
    • L. Getoor and A. Machanavajjhala. Entity resolution: Theory, practice & open challenges. PVLDB, 5(12):2018-2019, 2012.
    • (2012) PVLDB , vol.5 , Issue.12 , pp. 2018-2019
    • Getoor, L.1    Machanavajjhala, A.2
  • 14
    • 27644510774 scopus 로고    scopus 로고
    • Fast algorithms for frequent itemset mining using fp-trees
    • G. Grahne and J. Zhu. Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans. Knowl. Data Eng., 17(10):1347-1362, 2005.
    • (2005) IEEE Trans. Knowl. Data Eng , vol.17 , Issue.10 , pp. 1347-1362
    • Grahne, G.1    Zhu, J.2
  • 16
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • M. A. Herńandez and S. J. Stolfo. The merge/purge problem for large databases. SIGMOD Rec., 24(2):127-138, 1995.
    • (1995) SIGMOD Rec , vol.24 , Issue.2 , pp. 127-138
    • Herńandez, M.A.1    Stolfo, S.J.2
  • 17
    • 84861729507 scopus 로고    scopus 로고
    • Efficient multidimensional blocking for link discovery without losing recall
    • R. Isele, A. Jentzsch, and C. Bizer. Efficient multidimensional blocking for link discovery without losing recall. In WebDB, 2011.
    • (2011) WebDB
    • Isele, R.1    Jentzsch, A.2    Bizer, C.3
  • 18
    • 84892971761 scopus 로고    scopus 로고
    • Mfiblocks: An effective blocking algorithm for entity resolution
    • B. Kenig and A. Gal. Mfiblocks: An effective blocking algorithm for entity resolution. Inf. Syst., 38(6):908-926, 2013.
    • (2013) Inf. Syst , vol.38 , Issue.6 , pp. 908-926
    • Kenig, B.1    Gal, A.2
  • 19
    • 72649095071 scopus 로고    scopus 로고
    • Frameworks for entity matching: A comparison
    • H. Köpcke and E. Rahm. Frameworks for entity matching: A comparison. Data Knowl. Eng., 69(2):197-210, 2010.
    • (2010) Data Knowl. Eng , vol.69 , Issue.2 , pp. 197-210
    • Köpcke, H.1    Rahm, E.2
  • 20
    • 84874283130 scopus 로고    scopus 로고
    • Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous data integration
    • Y. Ma and T. Tran. Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous data integration. In WSDM, pages 325-334, 2013.
    • (2013) WSDM , pp. 325-334
    • Ma, Y.1    Tran, T.2
  • 21
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • A. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In KDD, pages 169-178, 2000.
    • (2000) KDD , pp. 169-178
    • McCallum, A.1    Nigam, K.2    Ungar, L.3
  • 22
    • 33750728911 scopus 로고    scopus 로고
    • Learning blocking schemes for record linkage
    • M. Michelson and C. A. Knoblock. Learning blocking schemes for record linkage. In AAAI, pages 440-445, 2006.
    • (2006) AAAI , pp. 440-445
    • Michelson, M.1    Knoblock, C.A.2
  • 23
    • 84881069805 scopus 로고    scopus 로고
    • LIMES -A time-efficient approach for large-scale link discovery on the web of data
    • A. N. Ngomo and S. Auer. LIMES -A time-efficient approach for large-scale link discovery on the web of data. In IJCAI, pages 2312-2317, 2011.
    • (2011) IJCAI , pp. 2312-2317
    • Ngomo, A.N.1    Auer, S.2
  • 24
    • 84976519185 scopus 로고    scopus 로고
    • Schema-agnostic vs schema-based configurations for blocking methods on homogeneous data
    • G. Papadakis, G. Alexiou, G. Papastefanatos, and G. Koutrika. Schema-agnostic vs schema-based configurations for blocking methods on homogeneous data. PVLDB, pages 312-323, 2015.
    • (2015) PVLDB , pp. 312-323
    • Papadakis, G.1    Alexiou, G.2    Papastefanatos, G.3    Koutrika, G.4
  • 25
    • 79960519872 scopus 로고    scopus 로고
    • Eliminating redundancy in blocking-based entity resolution
    • G. Papadakis, E. Ioannou, C. Niedeŕee, T. Palpanas, and W. Nejdl. Eliminating redundancy in blocking-based entity resolution. In JCDL, pages 85-94, 2011.
    • (2011) JCDL , pp. 85-94
    • Papadakis, G.1    Ioannou, E.2    Niedeŕee, C.3    Palpanas, T.4    Nejdl, W.5
  • 26
    • 84887673907 scopus 로고    scopus 로고
    • A blocking framework for entity resolution in highly heterogeneous information spaces
    • G. Papadakis, E. Ioannou, T. Palpanas, C. Niedeŕee, and W. Nejdl. A blocking framework for entity resolution in highly heterogeneous information spaces. IEEE Trans. Knowl. Data Eng., 25(12):2665-2682, 2013.
    • (2013) IEEE Trans. Knowl. Data Eng , vol.25 , Issue.12 , pp. 2665-2682
    • Papadakis, G.1    Ioannou, E.2    Palpanas, T.3    Niedeŕee, C.4    Nejdl, W.5
  • 28
    • 85042903743 scopus 로고    scopus 로고
    • Scaling entity resolution to large, heterogeneous data with enhanced meta-blocking
    • G. Papadakis, G. Papastefanatos, T. Palpanas, and M. Koubarakis. Scaling entity resolution to large, heterogeneous data with enhanced meta-blocking. In EDBT, pages 221-232, 2016.
    • (2016) EDBT , pp. 221-232
    • Papadakis, G.1    Papastefanatos, G.2    Palpanas, T.3    Koubarakis, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.