메뉴 건너뛰기




Volumn , Issue , 2012, Pages 431-440

A framework for entity resolution with efficient blocking

Author keywords

[No Author keywords available]

Indexed keywords

ATTRIBUTE VALUES; BLOCKING ALGORITHMS; CONTEXT SENSITIVE; DATA OBJECTS; DATA SETS; DATA-SOURCES; GENERIC FRAMEWORKS; MARKOV BLANKETS; MATCHING ALGORITHM; MATCHING FUNCTIONS; NAIVE BAYES CLASSIFIERS; RELATIONAL DATA; WEB DATA INTEGRATION;

EID: 84868328932     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IRI.2012.6303041     Document Type: Conference Paper
Times cited : (3)

References (33)
  • 4
    • 33745448357 scopus 로고    scopus 로고
    • A latent Dirichlet model for unsupervised entity resolution
    • I. Bhattacharya and L. Getoor. A latent Dirichlet model for unsupervised entity resolution. In SDM, 2006.
    • (2006) SDM
    • Bhattacharya, I.1    Getoor, L.2
  • 5
    • 77952372966 scopus 로고    scopus 로고
    • Adaptive duplicate detection using learnable string similarity measures
    • M. Bilenko and R. J. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, 2003.
    • (2003) KDD
    • Bilenko, M.1    Mooney, R.J.2
  • 8
    • 0242540438 scopus 로고    scopus 로고
    • Learning to match and cluster large high-dimensional data sets for data integration
    • W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In KDD, 2002.
    • (2002) KDD
    • Cohen, W.1    Richman, J.2
  • 11
    • 84865086832 scopus 로고    scopus 로고
    • Reasoning about record matching rules
    • W. Fan, X. Jia, J. Li, and S. Ma. Reasoning about record matching rules. In VLDB, 2009.
    • (2009) VLDB
    • Fan, W.1    Jia, X.2    Li, J.3    Ma, S.4
  • 13
    • 84880467474 scopus 로고    scopus 로고
    • Text joins in an RDBMS for web data integration
    • L. Gravano, P. G. Ipeirotis, N. Koudas, and D. Srivastava. Text joins in an RDBMS for web data integration. In WWW, pages 90-101, 2003.
    • (2003) WWW , pp. 90-101
    • Gravano, L.1    Ipeirotis, P.G.2    Koudas, N.3    Srivastava, D.4
  • 14
    • 79953162324 scopus 로고    scopus 로고
    • Merging the results of approximate match operations
    • S. Guha, N. Koudas, A. Marathe, and D. Srivastava. Merging the results of approximate match operations. In VLDB, 2004.
    • (2004) VLDB
    • Guha, S.1    Koudas, N.2    Marathe, A.3    Srivastava, D.4
  • 15
    • 72649086387 scopus 로고    scopus 로고
    • Framework for evaluating clustering algorithms in duplicate detection
    • O. Hassanzadeh, F. Chiang, H. C. Lee, and R. J. Miller. Framework for evaluating clustering algorithms in duplicate detection. In VLDB, 2009.
    • (2009) VLDB
    • Hassanzadeh, O.1    Chiang, F.2    Lee, H.C.3    Miller, R.J.4
  • 17
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions, and reversals
    • V. I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady, 10(8):707-710, 1966.
    • (1966) Soviet Physics Doklady , vol.10 , Issue.8 , pp. 707-710
    • Levenshtein, V.I.1
  • 19
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • A. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In KDD, pages 169-178, 2000.
    • (2000) KDD , pp. 169-178
    • McCallum, A.1    Nigam, K.2    Ungar, L.3
  • 21
    • 37649028224 scopus 로고    scopus 로고
    • Finding and evaluating community structure in networks
    • Feb
    • M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69(2):026113+, Feb 2004.
    • (2004) Physical Review E , vol.69 , Issue.2 , pp. 026113
    • Newman, M.E.J.1    Girvan, M.2
  • 24
    • 85166310944 scopus 로고    scopus 로고
    • Methods for linking and mining massive heterogeneous databases
    • J. C. Pinheiro and D. X. Sun. Methods for linking and mining massive heterogeneous databases. In KDD, 1998.
    • (1998) KDD
    • Pinheiro, J.C.1    Sun, D.X.2
  • 25
    • 0016572913 scopus 로고
    • A vector space model for automatic indexing
    • G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing. Communications of the ACM, 18(11):613-620, 1975.
    • (1975) Communications of the ACM , vol.18 , Issue.11 , pp. 613-620
    • Salton, G.1    Wong, A.2    Yang, C.S.3
  • 26
    • 0242456811 scopus 로고    scopus 로고
    • Interactive deduplication using active learning
    • S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In KDD, pages 269-278, 2002.
    • (2002) KDD , pp. 269-278
    • Sarawagi, S.1    Bhamidipaty, A.2
  • 27
    • 79957852084 scopus 로고    scopus 로고
    • Efficient spectral neighborhood blocking for entity resolution
    • Hannover, Germany, April
    • L. Shu, A. Chen, M. Xiong, and W. Meng. Efficient spectral neighborhood blocking for entity resolution. In ICDE, pages 1067-1078, Hannover, Germany, April 2011.
    • (2011) ICDE , pp. 1067-1078
    • Shu, L.1    Chen, A.2    Xiong, M.3    Meng, W.4
  • 28
    • 79957866123 scopus 로고    scopus 로고
    • A latent topic model for complete entity resolution
    • L. Shu, B. Long, and W. Meng. A latent topic model for complete entity resolution. In ICDE, 2009.
    • (2009) ICDE
    • Shu, L.1    Long, B.2    Meng, W.3
  • 29
    • 0027113212 scopus 로고
    • Approximate string matching with q-grams and maximal matches
    • E. Ukkonen. Approximate string matching with q-grams and maximal matches. Theoretical Computer Science, 92(1):191-211, 1992.
    • (1992) Theoretical Computer Science , vol.92 , Issue.1 , pp. 191-211
    • Ukkonen, E.1
  • 30
    • 0034228352 scopus 로고    scopus 로고
    • Automating the approximate record matching process
    • V. S. Verykios and A. K. Elmagarmid. Automating the approximate record matching process. Information Sciences, 126:83-98, 1999.
    • (1999) Information Sciences , vol.126 , pp. 83-98
    • Verykios, V.S.1    Elmagarmid, A.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.