메뉴 건너뛰기




Volumn , Issue , 2009, Pages 450-461

Efficient Top-K count queries over imprecise duplicates

Author keywords

[No Author keywords available]

Indexed keywords

DATA SETS; DEDUPLICATION; DUPLICATE ELIMINATION; EXPONENTIAL TIME ALGORITHM; LINEAR EMBEDDING; NONLOCAL; NOVEL METHODS; NP-HARD; ON THE FLIES; ORDER OF MAGNITUDE; POLYNOMIAL-TIME ALGORITHMS; RUNNING TIME;

EID: 70349162329     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1516360.1516413     Document Type: Conference Paper
Times cited : (4)

References (37)
  • 1
    • 2342576574 scopus 로고    scopus 로고
    • Eliminating fuzzy duplicates in data warehouses
    • R. Ananthakrishna, S. chaudhuri, and V. Ganti. Eliminating fuzzy duplicates in data warehouses. In VLDB, 2002.
    • (2002) VLDB
    • Ananthakrishna, R.1    chaudhuri, S.2    Ganti, V.3
  • 2
    • 33749588820 scopus 로고    scopus 로고
    • Clean answers over dirty databases: A probabilistic approach
    • P. Andritsos, A. Fuxman, and R. J. Miller. Clean answers over dirty databases: A probabilistic approach. In ICDE, 2006.
    • (2006) ICDE
    • Andritsos, P.1    Fuxman, A.2    Miller, R.J.3
  • 3
    • 85104914015 scopus 로고    scopus 로고
    • Efficient exact set-similarity joins
    • A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, 2006.
    • (2006) VLDB
    • Arasu, A.1    Ganti, V.2    Kaushik, R.3
  • 5
    • 34248229658 scopus 로고    scopus 로고
    • Collective entity resolution in relational data
    • I. Bhattacharya and L. Getoor. Collective entity resolution in relational data. TKDD, 1(1), 2007.
    • (2007) TKDD , vol.1 , Issue.1
    • Bhattacharya, I.1    Getoor, L.2
  • 7
    • 33746054079 scopus 로고    scopus 로고
    • Adaptive product normalization: Using online learning for record linkage in comparison shopping
    • M. Bilenko, S. Basu, and M. Sahami. Adaptive product normalization: Using online learning for record linkage in comparison shopping. In ICDM, 2005.
    • (2005) ICDM
    • Bilenko, M.1    Basu, S.2    Sahami, M.3
  • 8
    • 84878049861 scopus 로고    scopus 로고
    • Adaptive blocking: Learning to scale up record linkage
    • M. Bilenko, B. Kamath, and R. J. Mooney. Adaptive blocking: Learning to scale up record linkage. In ICDM, 2006.
    • (2006) ICDM
    • Bilenko, M.1    Kamath, B.2    Mooney, R.J.3
  • 10
    • 24644456480 scopus 로고    scopus 로고
    • Clustering with qualitative information
    • M. Charikar, V. Guruswami, and A. Wirth. Clustering with qualitative information. J. Comput. Syst. Sci., 71(3):360-383, 2005.
    • (2005) J. Comput. Syst. Sci , vol.71 , Issue.3 , pp. 360-383
    • Charikar, M.1    Guruswami, V.2    Wirth, A.3
  • 11
    • 85011029434 scopus 로고    scopus 로고
    • Example-driven design of efficient record matching queries
    • S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik. Example-driven design of efficient record matching queries. In VLDB, pages 327-338, 2007.
    • (2007) VLDB , pp. 327-338
    • Chaudhuri, S.1    Chen, B.-C.2    Ganti, V.3    Kaushik, R.4
  • 12
    • 1142279457 scopus 로고    scopus 로고
    • Robust and efficient fuzzy match for online data cleaning
    • S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD, 2003.
    • (2003) SIGMOD
    • Chaudhuri, S.1    Ganjam, K.2    Ganti, V.3    Motwani, R.4
  • 13
    • 26444550791 scopus 로고    scopus 로고
    • Robust identification of fuzzy duplicates
    • S. Chaudhuri, V. Ganti, and R. Motwani. Robust identification of fuzzy duplicates. In ICDE, 2005.
    • (2005) ICDE
    • Chaudhuri, S.1    Ganti, V.2    Motwani, R.3
  • 16
    • 0000666461 scopus 로고    scopus 로고
    • Data integration using similarity joins and a word-based information representation language
    • July
    • W. W. Cohen. Data integration using similarity joins and a word-based information representation language. ACM Transactions on Information Systems, 18(3):288-321, July 2000.
    • (2000) ACM Transactions on Information Systems , vol.18 , Issue.3 , pp. 288-321
    • Cohen, W.W.1
  • 22
    • 63449083945 scopus 로고    scopus 로고
    • J. Ko, T. Mitamura, and E. Nyberg. Language-independent probabilistic answer ranking for question answering. In ACL, 2007.
    • J. Ko, T. Mitamura, and E. Nyberg. Language-independent probabilistic answer ranking for question answering. In ACL, 2007.
  • 24
    • 84888516789 scopus 로고    scopus 로고
    • Y. Koren and D. Harel. A multi-scale algorithm for the linear arrangement problem. In WG, 2002.
    • Y. Koren and D. Harel. A multi-scale algorithm for the linear arrangement problem. In WG, 2002.
  • 26
    • 0034592784 scopus 로고    scopus 로고
    • Efficient clustering of high-dimensional data sets with application to reference matching
    • A. McCallum, K. Nigam, and L. H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Knowledge Discovery and Data Mining, pages 169-178, 2000.
    • (2000) Knowledge Discovery and Data Mining , pp. 169-178
    • McCallum, A.1    Nigam, K.2    Ungar, L.H.3
  • 36
    • 65449139953 scopus 로고    scopus 로고
    • M. L. Wick, K. Rohanimanesh, K. Schultz, and A. McCallum. A unified approach for schema matching, coreference and canonicalization. In KDD, 2008.
    • M. L. Wick, K. Rohanimanesh, K. Schultz, and A. McCallum. A unified approach for schema matching, coreference and canonicalization. In KDD, 2008.
  • 37
    • 57149130672 scopus 로고    scopus 로고
    • Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
    • X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, pages 353-364, 2008.
    • (2008) SIGMOD Conference , pp. 353-364
    • Yang, X.1    Wang, B.2    Li, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.