메뉴 건너뛰기




Volumn 28, Issue 1, 2016, Pages 166-180

Semantic-Aware Blocking for Entity Resolution

Author keywords

blocking; Data matching; deduplication; entity resolution; indexing; locality sensitive hashing; record linkage; semantic features; semantic hashing; semantic similarity; taxonomy tree

Indexed keywords

IMAGE RETRIEVAL; INDEXING (OF INFORMATION); TREES (MATHEMATICS); VIRTUAL REALITY;

EID: 84961639084     PISSN: 10414347     EISSN: None     Source Type: Journal    
DOI: 10.1109/TKDE.2015.2468711     Document Type: Conference Paper
Times cited : (36)

References (42)
  • 1
    • 84920600570 scopus 로고    scopus 로고
    • Efficient record linkage using a double embedding scheme
    • N. Adly, "Efficient record linkage using a double embedding scheme," in Proc. Int. Conf. Data Mining, 2009, pp. 274-281.
    • (2009) Proc. Int. Conf. Data Mining , pp. 274-281
    • Adly, N.1
  • 3
    • 37549058056 scopus 로고    scopus 로고
    • Near-optimal hashing algorithms for near neighbor problem in high dimension
    • A. Andoni and P. Indyk, "Near-optimal hashing algorithms for near neighbor problem in high dimension," Commun. ACM, vol. 51, no. 1, pp. 117-122, 2008.
    • (2008) Commun. ACM , vol.51 , Issue.1 , pp. 117-122
    • Andoni, A.1    Indyk, P.2
  • 10
    • 65449165865 scopus 로고    scopus 로고
    • Towards parameter-free blocking for scalable record linkage
    • Australian Nat. Univ., CA, Australia, Tech. Rep. TR-CS-07-03
    • P. Christen, "Towards parameter-free blocking for scalable record linkage," Dept. Comput. Sci., Australian Nat. Univ., CA, Australia, Tech. Rep. TR-CS-07-03, 2007.
    • (2007) Dept. Comput. Sci.
    • Christen, P.1
  • 12
    • 84920595044 scopus 로고    scopus 로고
    • A survey of indexing techniques for scalable record linkage and deduplication
    • Sep.
    • P. Christen, "A survey of indexing techniques for scalable record linkage and deduplication," IEEE Trans. Knowl. Data Eng., vol. 24, no. 9, pp. 1537-1555, Sep. 2012.
    • (2012) IEEE Trans. Knowl. Data Eng. , vol.24 , Issue.9 , pp. 1537-1555
    • Christen, P.1
  • 16
    • 0345566262 scopus 로고    scopus 로고
    • Learning to match ontologies on the semantic web
    • A. Doan, J. Madhavan, R. Dhamankar, P. Domingos, and A. Halevy, "Learning to match ontologies on the semantic web," VLDB J., vol. 12, no. 4, pp. 303-319, 2003.
    • (2003) VLDB J. , vol.12 , Issue.4 , pp. 303-319
    • Doan, A.1    Madhavan, J.2    Dhamankar, R.3    Domingos, P.4    Halevy, A.5
  • 18
    • 84947399464 scopus 로고
    • A theory for record linkage
    • I. Fellegi and A. Sunter, "A theory for record linkage," J. Amer. Stat. Assoc., vol. 64, no. 328, pp. 1183-1210, 1969.
    • (1969) J. Amer. Stat. Assoc. , vol.64 , Issue.328 , pp. 1183-1210
    • Fellegi, I.1    Sunter, A.2
  • 19
    • 84880915872 scopus 로고    scopus 로고
    • Computing semantic relatedness using Wikipedia-based explicit semantic analysis
    • E. Gabrilovich and S. Markovitch, "Computing semantic relatedness using Wikipedia-based explicit semantic analysis," in Proc. 20th Int. Joint Conf. Artif. Intell., 2007, pp. 1606-1611.
    • (2007) Proc. 20th Int. Joint Conf. Artif. Intell. , pp. 1606-1611
    • Gabrilovich, E.1    Markovitch, S.2
  • 21
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • M. A. Hernández and S. J. Stolfo, "The merge/purge problem for large databases," ACM SIGMOD Rec., vol. 24, pp. 127-138, 1995.
    • (1995) ACM SIGMOD Rec. , vol.24 , pp. 127-138
    • Hernández, M.A.1    Stolfo, S.J.2
  • 22
    • 0013331361 scopus 로고    scopus 로고
    • Real-world data is dirty: Data cleansing and the merge/purge problem
    • M. A. Hernández and S. J. Stolfo, "Real-world data is dirty: Data cleansing and the merge/purge problem," Data Mining Knowl. Discovery, vol. 2, no. 1, pp. 9-37, 1998.
    • (1998) Data Mining Knowl. Discovery , vol.2 , Issue.1 , pp. 9-37
    • Hernández, M.A.1    Stolfo, S.J.2
  • 23
    • 0031644241 scopus 로고    scopus 로고
    • Approximate nearest neighbors: Towards removing the curse of dimensionality
    • P. Indyk and R. Motwani, "Approximate nearest neighbors: Towards removing the curse of dimensionality," in Proc. Annu. ACM Symp. Theory Comput., 1998, pp. 604-613.
    • (1998) Proc. Annu. ACM Symp. Theory Comput. , pp. 604-613
    • Indyk, P.1    Motwani, R.2
  • 26
    • 84894647271 scopus 로고    scopus 로고
    • An unsupervised algorithm for learning blocking schemes
    • M. Kejriwal and D. P. Miranker, "An unsupervised algorithm for learning blocking schemes," in Proc. Int. Conf. Data Mining, 2013, pp. 340-349.
    • (2013) Proc. Int. Conf. Data Mining , pp. 340-349
    • Kejriwal, M.1    Miranker, D.P.2
  • 27
    • 84892971761 scopus 로고    scopus 로고
    • MFIBlocks: An effective blocking algorithm for entity resolution
    • B. Kenig and A. Gal, "MFIBlocks: An effective blocking algorithm for entity resolution," Inf. Syst., vol. 38, no. 6, pp. 908-926, 2013.
    • (2013) Inf. Syst. , vol.38 , Issue.6 , pp. 908-926
    • Kenig, B.1    Gal, A.2
  • 34
    • 33244462877 scopus 로고    scopus 로고
    • Entropy based nearest neighbor search in high dimensions
    • R. Panigrahy, "Entropy based nearest neighbor search in high dimensions," in Proc. Annu. ACM-SIAM Symp. Discrete Algorithm, 2006, pp. 1186-1195.
    • (2006) Proc. Annu. ACM-SIAM Symp. Discrete Algorithm , pp. 1186-1195
    • Panigrahy, R.1
  • 37
    • 84904650785 scopus 로고    scopus 로고
    • Metablocking: Taking entity resolution to the next level
    • Aug.
    • G. Papadakis, G. Koutrika, T. Palpanas, and W. Nejdl, "Metablocking: Taking entity resolution to the next level," IEEE Trans. Knowl. Data Eng., vol. 26, no. 8, pp. 1946-1960, Aug. 2014.
    • (2014) IEEE Trans. Knowl. Data Eng. , vol.26 , Issue.8 , pp. 1946-1960
    • Papadakis, G.1    Koutrika, G.2    Palpanas, T.3    Nejdl, W.4
  • 38
    • 0003033112 scopus 로고
    • Using information content to evaluate semantic similarity in a taxonomy
    • P. Resnik, "Using information content to evaluate semantic similarity in a taxonomy," in Proc. Int. Joint Conf. Artif. Intell., 1995, pp. 448-453.
    • (1995) Proc. Int. Joint Conf. Artif. Intell. , pp. 448-453
    • Resnik, P.1
  • 40
    • 84863154010 scopus 로고    scopus 로고
    • Towards a probabilistic taxonomy of many concepts
    • WA, USA, Tech. Rep. MSR-TR-2011-25
    • W. Wu, H. Li, H. Wang, and K. Zhu, "Towards a probabilistic taxonomy of many concepts," Microsoft Res. Redmond, WA, USA, Tech. Rep. MSR-TR-2011-25, 2011.
    • (2011) Microsoft Res. Redmond
    • Wu, W.1    Li, H.2    Wang, H.3    Zhu, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.