메뉴 건너뛰기




Volumn , Issue , 2009, Pages 182-193

A cluster-based approach to XML similarity joins

Author keywords

Clustering; Entity resolution; Similarity joins; Similarity measures; XML; xml databases

Indexed keywords

CLUSTERING; ENTITY RESOLUTION; SIMILARITY JOINS; SIMILARITY MEASURES; XML DATABASES;

EID: 70350637296     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1620432.1620451     Document Type: Conference Paper
Times cited : (5)

References (37)
  • 1
    • 36849071950 scopus 로고    scopus 로고
    • Xproj: A framework for projected structural clustering of xml documents
    • C. C. Aggarwal, N. Ta, J. Wang, J. Feng, and M. J. Zaki. Xproj: a framework for projected structural clustering of xml documents. In Proc. KDD Conf., pages 46-55, 2007.
    • (2007) Proc. KDD Conf , pp. 46-55
    • Aggarwal, C.C.1    Ta, N.2    Wang, J.3    Feng, J.4    Zaki, M.J.5
  • 3
    • 33745628382 scopus 로고    scopus 로고
    • Approximate matching of hierarchical data using pq-grams
    • N. Augsten, M. H. Böhlen, and J. Gamper. Approximate matching of hierarchical data using pq-grams. In Proc. VLDB Conf., pages 301-312, 2005.
    • (2005) Proc. VLDB Conf , pp. 301-312
    • Augsten, N.1    Böhlen, M.H.2    Gamper, J.3
  • 4
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all pairs similarity search
    • R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In Proc. WWW Conf., pages 131-140, 2007.
    • (2007) , pp. 131-140
    • Bayardo, R.J.1    Ma, Y.2    Srikant, R.3
  • 6
    • 12744279236 scopus 로고    scopus 로고
    • A short survey of document structure similarity algorithms
    • D. Buttler. A short survey of document structure similarity algorithms. In Proc. Intl. Conf. on Internet Computing, pages 3-9, 2004.
    • (2004) Proc. Intl. Conf. on Internet Computing , pp. 3-9
    • Buttler, D.1
  • 9
    • 33749597967 scopus 로고    scopus 로고
    • A primitive operator for similarity joins in data cleaning
    • S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In Proc. ICDE Conf., page 5, 2006.
    • (2006) Proc. ICDE Conf , pp. 5
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 10
    • 0002818648 scopus 로고    scopus 로고
    • Combining approaches to information retrieval
    • W. B. Croft. Combining approaches to information retrieval. Advances in information retrieval, 7:1-36, 2000.
    • (2000) Advances in information retrieval , vol.7 , pp. 1-36
    • Croft, W.B.1
  • 11
    • 70350639186 scopus 로고    scopus 로고
    • Overview of the inex 2008 xml mining track
    • S. Geva, J. Kamps, and A. Trotman, editors, Proc. INEX 2008
    • L. Denoyer and P. Gallinari. Overview of the inex 2008 xml mining track. In S. Geva, J. Kamps, and A. Trotman, editors, Proc. INEX 2008, LNCS, 2009.
    • (2009) LNCS
    • Denoyer, L.1    Gallinari, P.2
  • 12
  • 13
    • 33745218927 scopus 로고    scopus 로고
    • Integrating xml data sources using approximate joins
    • S. Guha, H. V. Jagadish, N. Koudas, D. Srivastava, and T. Yu. Integrating xml data sources using approximate joins. TODS, 31(1):161-207, 2006.
    • (2006) TODS , vol.31 , Issue.1 , pp. 161-207
    • Guha, S.1    Jagadish, H.V.2    Koudas, N.3    Srivastava, D.4    Yu, T.5
  • 15
    • 47949084700 scopus 로고    scopus 로고
    • Comparison of complete and elementless native storage of xml documents
    • T. Härder, C. Mathis, and K. Schmidt. Comparison of complete and elementless native storage of xml documents. In Proc. IDEAS Conf., pages 102-113, 2007.
    • (2007) Proc. IDEAS Conf , pp. 102-113
    • Härder, T.1    Mathis, C.2    Schmidt, K.3
  • 16
    • 34147139436 scopus 로고    scopus 로고
    • An efficient infrastructure for native transactional xml processing
    • M. P. Haustein and T. Härder. An efficient infrastructure for native transactional xml processing. DKE, 61(3):500-523, 2007.
    • (2007) DKE , vol.61 , Issue.3 , pp. 500-523
    • Haustein, M.P.1    Härder, T.2
  • 17
    • 84912150847 scopus 로고    scopus 로고
    • Measuring the structural similarity of semistructured documents using entropy
    • S. Helmer. Measuring the structural similarity of semistructured documents using entropy. In Proc. VLDB Conf., pages 1022-1032, 2007.
    • (2007) Proc. VLDB Conf , pp. 1022-1032
    • Helmer, S.1
  • 19
    • 2442561063 scopus 로고    scopus 로고
    • A bag of paths model for measuring structural similarity in web documents
    • S. Joshi, N. Agrawal, R. Krishnapuram, and S. Negi. A bag of paths model for measuring structural similarity in web documents. In Proc. KDD Conf., pages 577-582, 2003.
    • (2003) Proc. KDD Conf , pp. 577-582
    • Joshi, S.1    Agrawal, N.2    Krishnapuram, R.3    Negi, S.4
  • 20
    • 33845514928 scopus 로고    scopus 로고
    • Articulating information needs in xml query languages
    • J. Kamps, M. Marx, M. de Rijke, and B. Sigurbjörnsson. Articulating information needs in xml query languages. TOIS, 24(4):407-436, 2006.
    • (2006) TOIS , vol.24 , Issue.4 , pp. 407-436
    • Kamps, J.1    Marx, M.2    de Rijke, M.3    Sigurbjörnsson, B.4
  • 21
    • 34250670467 scopus 로고    scopus 로고
    • Record linkage: Similarity measures and algorithms
    • N. Koudas, S. Sarawagi, and D. Srivastava. Record linkage: similarity measures and algorithms. In Proc. SIGMOD Conf., pages 802-803, 2006.
    • (2006) Proc. SIGMOD Conf , pp. 802-803
    • Koudas, N.1    Sarawagi, S.2    Srivastava, D.3
  • 22
    • 0037481024 scopus 로고    scopus 로고
    • Xclust: Clustering xml schemas for effective integration
    • M.-L. Lee, L. H. Yang, W. Hsu, and X. Yang. Xclust: clustering xml schemas for effective integration. In Proc. CIKM Conf., pages 292-299, 2002.
    • (2002) Proc. CIKM Conf , pp. 292-299
    • Lee, M.-L.1    Yang, L.H.2    Hsu, W.3    Yang, X.4
  • 23
    • 67649647573 scopus 로고    scopus 로고
    • A decade of xml data management: An industrial experience report from oracle
    • Z. H. Liu and R. Murthy. A decade of xml data management: An industrial experience report from oracle. In Proc. ICDE Conf., pages 1351-1362, 2009.
    • (2009) Proc. ICDE Conf , pp. 1351-1362
    • Liu, Z.H.1    Murthy, R.2
  • 24
    • 0001906874 scopus 로고    scopus 로고
    • Index structures for path expressions
    • T. Milo and D. Suciu. Index structures for path expressions. In Proc. ICDT Conf., pages 277-295, 1999.
    • (1999) Proc. ICDT Conf , pp. 277-295
    • Milo, T.1    Suciu, D.2
  • 25
    • 14644393851 scopus 로고    scopus 로고
    • Evaluating structural similarity in xml documents
    • A. Nierman and H. V. Jagadish. Evaluating structural similarity in xml documents. In Proc. WebDB Workshop, pages 61-66, 2002.
    • (2002) Proc. WebDB Workshop , pp. 61-66
    • Nierman, A.1    Jagadish, H.V.2
  • 26
    • 1542287497 scopus 로고    scopus 로고
    • Combining document representations for known-item search
    • P. Ogilvie and J. P. Callan. Combining document representations for known-item search. In Proc. SIGIR Conf., pages 143-150, 2003.
    • (2003) Proc. SIGIR Conf , pp. 143-150
    • Ogilvie, P.1    Callan, J.P.2
  • 28
    • 54249160182 scopus 로고    scopus 로고
    • Evaluating performance and quality of xml-based similarity joins
    • L. A. Ribeiro and T. Härder. Evaluating performance and quality of xml-based similarity joins. In Proc. ADBIS Conf., pages 246-261, 2008.
    • (2008) Proc. ADBIS Conf , pp. 246-261
    • Ribeiro, L.A.1    Härder, T.2
  • 31
    • 3142777876 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In Proc. SIGMOD Conf., pages 743-754, 2004.
    • (2004) Proc. SIGMOD Conf , pp. 743-754
    • Sarawagi, S.1    Kirpal, A.2
  • 32
    • 0002442796 scopus 로고    scopus 로고
    • Machine learning in automated text categorization
    • F. Sebastiani. Machine learning in automated text categorization. CSUR, 34(1):1-47, 2002.
    • (2002) CSUR , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 33
    • 0018491659 scopus 로고
    • The tree-to-tree correction problem
    • K.-C. Tai. The tree-to-tree correction problem. JACM, 26(3):422-433, 1979.
    • (1979) JACM , vol.26 , Issue.3 , pp. 422-433
    • Tai, K.-C.1
  • 34
    • 0001467848 scopus 로고
    • Query evaluation: Strategies and optimizations
    • H. R. Turtle and J. Flood. Query evaluation: Strategies and optimizations. Information Processing Management, 31(6):831-850, 1995.
    • (1995) Information Processing Management , vol.31 , Issue.6 , pp. 831-850
    • Turtle, H.R.1    Flood, J.2
  • 35
    • 29844441371 scopus 로고    scopus 로고
    • Dogmatix tracks down duplicates in xml
    • M. Weis and F. Naumann. Dogmatix tracks down duplicates in xml. In Proc. SIGMOD Conf., pages 431-442, 2005.
    • (2005) Proc. SIGMOD Conf , pp. 431-442
    • Weis, M.1    Naumann, F.2
  • 36
    • 70350645042 scopus 로고    scopus 로고
    • W. Winkler. Overview of record linkage and current research directions. Technical report, Statistical Research Division, U.S. Bureau of the Census, 2006.
    • W. Winkler. Overview of record linkage and current research directions. Technical report, Statistical Research Division, U.S. Bureau of the Census, 2006.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.