메뉴 건너뛰기




Volumn 6184 LNCS, Issue , 2010, Pages 595-607

Efficient duplicate record detection based on similarity estimation

Author keywords

Duplicate Detection; Heterogeneous Records; Record Similarity; Similarity Estimation

Indexed keywords

BASIC IDEA; BI-PARTITE GRAPH MATCHING; DATA PROCESSING AND ANALYSIS; DATA SOURCE; DUPLICATE DETECTION; DUPLICATE RECORD DETECTION; EFFICIENT METHOD; INFORMATION INTEGRATION SYSTEM; LOW RATES; SIMILARITY ESTIMATION;

EID: 77955023483     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-14246-8_58     Document Type: Conference Paper
Times cited : (7)

References (16)
  • 3
    • 0002719797 scopus 로고
    • The hungarian method for the assignment problem
    • Kuhn, H.W.: The hungarian method for the assignment problem. Naval res. Logist. Quart. (1955)
    • (1955) Naval Res. Logist. Quart.
    • Kuhn, H.W.1
  • 4
    • 0002223982 scopus 로고
    • Algorithms for the assignment and transportation problems
    • Munkres, J.: Algorithms for the assignment and transportation problems. J. Soc. Indust. Appl. Math. (1957)
    • (1957) J. Soc. Indust. Appl. Math.
    • Munkres, J.1
  • 5
    • 77952372966 scopus 로고    scopus 로고
    • Adaptive duplicate detection using learnable string similarity measures
    • August
    • Bilenko, M., Mooney, R.J.: Adaptive duplicate detection using learnable string similarity measures. In: SIGKDD, pp. 39-48 (August 2003)
    • (2003) SIGKDD , pp. 39-48
    • Bilenko, M.1    Mooney, R.J.2
  • 6
    • 35448984015 scopus 로고    scopus 로고
    • Benchmarking declarative approximate selection predicates
    • June
    • Chandel, Hassanzadeh, O., Koudas, N., et al.: Benchmarking declarative approximate selection predicates. In: SIGMOD, pp. 353-364 (June 2007)
    • (2007) SIGMOD , pp. 353-364
    • Chandel1    Hassanzadeh, O.2    Koudas, N.3
  • 7
    • 1142279457 scopus 로고    scopus 로고
    • Robust and efficient fuzzy match for online data cleaning
    • June
    • Chaudhuri, S., Ganjam, K., Ganti, V., Motwani, R.: Robust and efficient fuzzy match for online data cleaning. In: SIGMOD, pp. 313-324 (June 2003)
    • (2003) SIGMOD , pp. 313-324
    • Chaudhuri, S.1    Ganjam, K.2    Ganti, V.3    Motwani, R.4
  • 8
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • Cohen, W.W.: Data integration using similarity joins and a word-based information representation language. ACM Trans. on Information Systems 18(3), 288-321 (2000)
    • (2000) ACM Trans. on Information Systems , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.W.1
  • 9
    • 0034832365 scopus 로고    scopus 로고
    • Automatic segmentation of text into structured records
    • May
    • Borkar, V.R., Deshmukh, K., Sarawagi, S.: Automatic segmentation of text into structured records. In: SIGMOD, pp. 175-186 (May 2001)
    • (2001) SIGMOD , pp. 175-186
    • Borkar, V.R.1    Deshmukh, K.2    Sarawagi, S.3
  • 10
    • 34047192804 scopus 로고    scopus 로고
    • Semi-markov conditional random fields for information extraction
    • December
    • Sarawagi, S., Cohen, W.W.: Semi-markov conditional random fields for information extraction. In: NIPS (December 2004)
    • (2004) NIPS
    • Sarawagi, S.1    Cohen, W.W.2
  • 11
    • 84885677547 scopus 로고    scopus 로고
    • Learning to extract information from semi-structured text using a discriminative context free grammar
    • August
    • Viola, P.A., Narasimhan, M.: Learning to extract information from semi-structured text using a discriminative context free grammar. In: SIGIR, pp. 330-337 (August 2005)
    • (2005) SIGIR , pp. 330-337
    • Viola, P.A.1    Narasimhan, M.2
  • 12
    • 12244290581 scopus 로고    scopus 로고
    • Exploiting dictionaries in named entity extraction: Combining semi-markov extraction processes and data integration methods
    • August
    • Cohen, W.W., Sarawagi, S.: Exploiting dictionaries in named entity extraction: combining semi-markov extraction processes and data integration methods. In: SIGKDD, pp. 89-98 (August 2004)
    • (2004) SIGKDD , pp. 89-98
    • Cohen, W.W.1    Sarawagi, S.2
  • 13
    • 52649137537 scopus 로고    scopus 로고
    • Transformation-based framework for record matching
    • April
    • Arasu, Chaudhuri, S., Kaushik, R.: Transformation-based framework for record matching. In: ICDE, pp. 40-49 (April 2008)
    • (2008) ICDE , pp. 40-49
    • Arasu1    Chaudhuri, S.2    Kaushik, R.3
  • 14
    • 70849095483 scopus 로고    scopus 로고
    • A Grammar-based Entity Representation Framework for Data Cleaning
    • June
    • Arasu, Kaushik, R.: A Grammar-based Entity Representation Framework for Data Cleaning. In: SIGMOD, pp. 233-244 (June 2009)
    • (2009) SIGMOD , pp. 233-244
    • Arasu1    Kaushik, R.2
  • 15
    • 77955027339 scopus 로고    scopus 로고
    • Duplicate Record Detection Method Based on Optimal Bipartite Graph Matching
    • October
    • Mohan, L., Hongzhi, W., Jianzhong, L., Hong, G.: Duplicate Record Detection Method Based on Optimal Bipartite Graph Matching. In: NDBC (October 2009)
    • (2009) NDBC
    • Mohan, L.1    Hongzhi, W.2    Jianzhong, L.3    Hong, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.