메뉴 건너뛰기




Volumn , Issue , 2007, Pages 37-46

D-Swoosh: A family of algorithms for generic, distributed entity resolution

Author keywords

Data cleaning; Entity resolution; Information integration

Indexed keywords

DATA REDUCTION; FUNCTIONS; INFORMATION USE; PROGRAM PROCESSORS;

EID: 34848900466     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDCS.2007.96     Document Type: Conference Paper
Times cited : (32)

References (23)
  • 2
    • 34848924479 scopus 로고    scopus 로고
    • D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution (Extended version)
    • Technical report, Stanford University, Available at
    • O. Benjelloun, H. Garcia-Molina, H. Gong, H. Kawai, T. E. Larson, D. Menestrina, and S. Thavisomboon. D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution (Extended version). Technical report, Stanford University, 2006. Available at http://dbpubs.stanford.edu/pub/2006-8.
    • (2006)
    • Benjelloun, O.1    Garcia-Molina, H.2    Gong, H.3    Kawai, H.4    Larson, T.E.5    Menestrina, D.6    Thavisomboon, S.7
  • 5
    • 33746054079 scopus 로고    scopus 로고
    • Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping
    • Houston, Texas
    • M. Bilenko, S. Basu, and M. Sahami. Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping. In Proc. of IEEE Int. Conf on Data Mining, Houston, Texas, 2005.
    • (2005) Proc. of IEEE Int. Conf on Data Mining
    • Bilenko, M.1    Basu, S.2    Sahami, M.3
  • 6
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • K. M. Chandy and L. Lamport. Distributed snapshots: determining global states of distributed systems. ACM Trans. Comput. Syst., 3(1):63-75, 1985.
    • (1985) ACM Trans. Comput. Syst , vol.3 , Issue.1 , pp. 63-75
    • Chandy, K.M.1    Lamport, L.2
  • 7
    • 26444550791 scopus 로고    scopus 로고
    • Robust identification of fuzzy duplicates
    • Tokyo, Japan
    • S. Chaudhuri, V. Ganti, and R. Motwani. Robust identification of fuzzy duplicates. In Proc. of ICDE, Tokyo, Japan, 2005.
    • (2005) Proc. of ICDE
    • Chaudhuri, S.1    Ganti, V.2    Motwani, R.3
  • 8
    • 84945709358 scopus 로고
    • Solution of a problem in concurrent programming control
    • E. W. Dijkstra. Solution of a problem in concurrent programming control. Commun. ACM, 8(9):569, 1965.
    • (1965) Commun. ACM , vol.8 , Issue.9 , pp. 569
    • Dijkstra, E.W.1
  • 11
    • 0022145769 scopus 로고
    • How to assign votes in a distributed system
    • H. Garcia-Molina and D. Barbara. How to assign votes in a distributed system. J. ACM, 32(4):841-860, 1985.
    • (1985) J. ACM , vol.32 , Issue.4 , pp. 841-860
    • Garcia-Molina, H.1    Barbara, D.2
  • 12
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • M. A. Hernández and S. J. Stolfo. The merge/purge problem for large databases. In Proc. of ACM SIGMOD, pages 127-138, 1995.
    • (1995) Proc. of ACM SIGMOD , pp. 127-138
    • Hernández, M.A.1    Stolfo, S.J.2
  • 13
    • 0001089514 scopus 로고
    • Optimal coteries for rings and related networks
    • T Ibaraki, H. Nagamochi, and T. Kameda. Optimal coteries for rings and related networks. Distrib. Comput., 8(4):191-201, 1995.
    • (1995) Distrib. Comput , vol.8 , Issue.4 , pp. 191-201
    • Ibaraki, T.1    Nagamochi, H.2    Kameda, T.3
  • 14
    • 0022069122 scopus 로고
    • A √N algorithm for mutual exclusion in decentralized systems
    • M. Maekawa. A √N algorithm for mutual exclusion in decentralized systems. ACM Trans. Comput. Syst., 3(2):145-159, 1985.
    • (1985) ACM Trans. Comput. Syst , vol.3 , Issue.2 , pp. 145-159
    • Maekawa, M.1
  • 15
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • Boston, MA
    • A. K. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Proc. of KDD, pages 169-178, Boston, MA, 2000.
    • (2000) Proc. of KDD , pp. 169-178
    • McCallum, A.K.1    Nigam, K.2    Ungar, L.3
  • 19
    • 0242456811 scopus 로고    scopus 로고
    • Interactive deduplication using active learning
    • Edmonton, Alberta
    • S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In Proc. of ACM SIGKDD, Edmonton, Alberta, 2002.
    • (2002) Proc. of ACM SIGKDD
    • Sarawagi, S.1    Bhamidipaty, A.2
  • 20
    • 0035545848 scopus 로고    scopus 로고
    • Learning object identification rules for information integration
    • S. Tejada, C. A. Knoblock, and S. Minton. Learning object identification rules for information integration. Information Systems Journal, 26(8):635-656, 2001.
    • (2001) Information Systems Journal , vol.26 , Issue.8 , pp. 635-656
    • Tejada, S.1    Knoblock, C.A.2    Minton, S.3
  • 21
    • 0018480001 scopus 로고
    • A majority consensus approach to concurrency control for multiple copy databases
    • R. H. Thomas. A majority consensus approach to concurrency control for multiple copy databases. ACM Trans. Database Syst., 4(2):180-209, 1979.
    • (1979) ACM Trans. Database Syst , vol.4 , Issue.2 , pp. 180-209
    • Thomas, R.H.1
  • 22
    • 0038208065 scopus 로고    scopus 로고
    • A bayesian decision model for cost optimal record matching
    • V. S. Verykios, G. V. Moustakides, and M. G. Elfeky. A bayesian decision model for cost optimal record matching. The VLDB Journal, 12(1):28-10, 2003.
    • (2003) The VLDB Journal , vol.12 , Issue.1 , pp. 28-10
    • Verykios, V.S.1    Moustakides, G.V.2    Elfeky, M.G.3
  • 23
    • 34848923528 scopus 로고    scopus 로고
    • W. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC, 1999.
    • W. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Bureau of the Census, Washington, DC, 1999.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.