메뉴 건너뛰기




Volumn 87, Issue , 2008, Pages 51-60

Towards scalable real-time entity resolution using a similarity-aware inverted index approach

Author keywords

Approximate string comparisons; Data matching; Record linkage; Scalability; Similarity measures.

Indexed keywords

APPROXIMATE MATCHING; DATA MATCHING; DATA SETS; DOCUMENT COLLECTION; INVERTED INDEXING; INVERTED INDICES; MATCHING SPEED; PERSONAL INFORMATION; REAL WORLD DATA; RECORD LINKAGE; RESOLUTION TECHNIQUES; SIMILARITY MEASURE; STANDARD BLOCKING; STRING COMPARISON; STRUCTURED DATABASE; VERY LARGE DATABASE;

EID: 67650216370     PISSN: 14451336     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Article
Times cited : (19)

References (27)
  • 1
    • 0034592763 scopus 로고    scopus 로고
    • The IGrid index: Reversing the dimensionality curse for similarity in-dexing in high dimensional space
    • (SIGKDD00), Boston
    • Aggarwal, C.C. & Yu, P.S. (2000), The IGrid index: Reversing the dimensionality curse for similarity in-dexing in high dimensional space, in 'ACM Inter-national Conference on Knowledge Discovery and Data Mining' (SIGKDD'00), Boston, pp. 119-129.
    • (2000) ACM Inter-national Conference on Knowledge Discovery and Data Mining , pp. 119-129
    • Aggarwal, C.C.1    Yu, P.S.2
  • 2
    • 33845363891 scopus 로고    scopus 로고
    • A fast linkage de-tection scheme for multi-source information inte-gration
    • (WIRI05), Tokyo
    • Aizawa, A. & Oyama, K. (2005), A fast linkage de-tection scheme for multi-source information inte-gration, in 'Web Information Retrieval and Inte-gration' (WIRI'05), Tokyo, pp. 30-39.
    • (2005) Web Information Retrieval and Inte-gration , pp. 30-39
    • Aizawa, A.1    Oyama, K.2
  • 7
    • 78449293191 scopus 로고    scopus 로고
    • A comparison of personal name matching: Techniques and practical issues
    • (MCD06), held at IEEE ICDM06, Hong Kong
    • Christen, P. (2006), A comparison of personal name matching: Techniques and practical issues, in 'Workshop on Mining Complex Data' (MCD'06), held at IEEE ICDM'06, Hong Kong.
    • (2006) Workshop on Mining Complex Data
    • Christen, P.1
  • 8
    • 65449178105 scopus 로고    scopus 로고
    • Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface
    • (SIGKDD08), Las Vegas
    • Christen, P. (2008), Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface, in 'ACM Inter-national Conference on Knowledge Discovery and Data Mining' (SIGKDD'08), Las Vegas, pp. 1065- 1068.
    • (2008) ACM Inter-national Conference on Knowledge Discovery and Data Mining , pp. 1065-1068
    • Christen, P.1
  • 11
    • 0242540438 scopus 로고    scopus 로고
    • Learning to match and cluster large high-dimensional data sets for data integration
    • (SIGKDD02), Edmonton
    • Cohen, W.W. & Richman, J. (2002), Learning to match and cluster large high-dimensional data sets for data integration, in 'ACM International Con-ference on Knowledge Discovery and Data Mining' (SIGKDD'02), Edmonton, pp. 475-480.
    • (2002) ACM International Con-ference on Knowledge Discovery and Data Mining , pp. 475-480
    • Cohen, W.W.1    Richman, J.2
  • 12
    • 12244271239 scopus 로고    scopus 로고
    • On-line duplicate detection: Signature reliability in a dynamic retrieval environment
    • (CIKM03), New Orleans
    • Conrad, J.G., Guo, X.S. & Schriber, C.P. (2003), On-line duplicate detection: Signature reliability in a dynamic retrieval environment, in 'ACM Confer-ence on Information and Knowledge Management' (CIKM'03), New Orleans, pp. 443-452.
    • (2003) ACM Confer-ence on Information and Knowledge Management , pp. 443-452
    • Conrad, J.G.1    Guo, X.S.2    Schriber, C.P.3
  • 18
    • 37149056535 scopus 로고    scopus 로고
    • Decision models for record linkage
    • Springer LNCS 3755
    • Gu, L. & Baxter, R. (2006), Decision models for record linkage, in 'Selected Papers from AusDM', Springer LNCS 3755, pp. 146-160.
    • (2006) Selected Papers from AusDM , pp. 146-160
    • Gu, L.1    Baxter, R.2
  • 20
    • 33745266392 scopus 로고    scopus 로고
    • Domain-independent data cleaning via analysis of entity-relationship graph
    • Kalashnikov, D.V. & Mehrotra, S. (2006), 'Domain-independent data cleaning via analysis of entity-relationship graph', ACM Transactions on Database Systems (TODS), 31(2), 716-767.
    • (2006) ACM Transactions on Database Systems (TODS) , vol.31 , Issue.2 , pp. 716-767
    • Kalashnikov, D.V.1    Mehrotra, S.2
  • 26
    • 84893853717 scopus 로고    scopus 로고
    • LinkClus: Effi-cient clustering via heterogeneous semantic links
    • (VLDB06), Seoul
    • Yin, X., Han, J. & Yu, P.S. (2006), LinkClus: Effi-cient clustering via heterogeneous semantic links, in 'International Conference on Very Large Data Bases' (VLDB'06), Seoul, pp. 427-438.
    • (2006) International Conference on Very Large Data Bases , pp. 427-438
    • Yin, X.1    Han, J.2    Yu, P.S.3
  • 27
    • 33747729581 scopus 로고    scopus 로고
    • Inverted files for text search engines
    • (CSUR)
    • Zobel, J. & Moffat, A. (2006), 'Inverted files for text search engines', ACM Computing Surveys (CSUR), 38(2).
    • (2006) ACM Computing Surveys , vol.38 , Issue.2
    • Zobel, J.1    Moffat, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.