메뉴 건너뛰기




Volumn 3, Issue 3, 2009, Pages 151-173

An overview on XML similarity: Background, current trends and future directions

Author keywords

[No Author keywords available]

Indexed keywords

CLASSIFICATION/CLUSTERING; COMPLEX DATA; FUTURE RESEARCH DIRECTIONS; MULTIMEDIA OBJECT; XML QUERYING; XML STANDARDS;

EID: 68549107726     PISSN: 15740137     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cosrev.2009.03.001     Document Type: Article
Times cited : (94)

References (83)
  • 1
    • 0018491659 scopus 로고
    • The tree-to-tree correction problem
    • Tai K.C. The tree-to-tree correction problem. Journal of the ACM 26 (1979) 422-433
    • (1979) Journal of the ACM , vol.26 , pp. 422-433
    • Tai, K.C.1
  • 2
    • 0024889169 scopus 로고
    • Simple fast algorithms for the editing distance between trees and related problems
    • Zhang K., and Shasha D. Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing 18 6 (1989) 1245-1262
    • (1989) SIAM Journal of Computing , vol.18 , Issue.6 , pp. 1245-1262
    • Zhang, K.1    Shasha, D.2
  • 6
    • 0002816102 scopus 로고    scopus 로고
    • Comparing hierarchical data in external memory
    • S. Chawathe, Comparing hierarchical data in external memory, in: Proceedings of the VLDB Conference, 1999, pp. 90-101
    • (1999) Proceedings of the VLDB Conference , pp. 90-101
    • Chawathe, S.1
  • 8
    • 29144484106 scopus 로고    scopus 로고
    • A methodology for clustering XML documents by structure
    • Dalamagas T., Cheng T., Winkel K., and Sellis T. A methodology for clustering XML documents by structure. Information Systems 31 3 (2006) 187-228
    • (2006) Information Systems , vol.31 , Issue.3 , pp. 187-228
    • Dalamagas, T.1    Cheng, T.2    Winkel, K.3    Sellis, T.4
  • 9
    • 38349081267 scopus 로고    scopus 로고
    • Structural similarity evaluation between XML documents and DTDs
    • Proceedings of the 8th International Conference on Web Information Systems Engineering, WISE'07. Nancy, France, Springer-Verlag, Berlin Heidelberg
    • Tekli J., Chbeir R., and Yetongnon K. Structural similarity evaluation between XML documents and DTDs. Proceedings of the 8th International Conference on Web Information Systems Engineering, WISE'07. Nancy, France. LNCS vol. 4831 (2007), Springer-Verlag, Berlin Heidelberg 196-201
    • (2007) LNCS , vol.4831 , pp. 196-201
    • Tekli, J.1    Chbeir, R.2    Yetongnon, K.3
  • 10
    • 0034785011 scopus 로고    scopus 로고
    • A Query Language for Information Retrieval
    • XIRQL:, New Orleans
    • N. Fuhr, K. Großjohann, XIRQL: A Query Language for Information Retrieval. In: Proceedings of ACM-SIGIR, New Orleans, 2001, pp. 172-180
    • (2001) Proceedings of ACM-SIGIR , pp. 172-180
    • Fuhr, N.1    Großjohann, K.2
  • 16
    • 68549121765 scopus 로고    scopus 로고
    • A matrix model for XML data
    • Databases and information systems. Barzdins J., and Caplinskas A. (Eds). (Selected Papers from the Sixth International Baltic Conference DB&IS'2004), IOS Press
    • Pokorny J., and Rejlek V. A matrix model for XML data. In: Barzdins J., and Caplinskas A. (Eds). Databases and information systems. (Selected Papers from the Sixth International Baltic Conference DB&IS'2004). Frontiers in Artificial Intelligence and Applications vol. 118 (2005), IOS Press 53-64
    • (2005) Frontiers in Artificial Intelligence and Applications , vol.118 , pp. 53-64
    • Pokorny, J.1    Rejlek, V.2
  • 23
    • 84912150847 scopus 로고    scopus 로고
    • Measuring the structural similarity of semistructured documents using entropy
    • S. Helmer, Measuring the structural similarity of semistructured documents using entropy, in: Proceedings of the VLDB'07 Conference, 2007, pp. 1022-1032
    • (2007) Proceedings of the VLDB'07 Conference , pp. 1022-1032
    • Helmer, S.1
  • 24
    • 27244432105 scopus 로고    scopus 로고
    • Approximate subtree identification in heterogeneous XML documents collections
    • I. Sanz, M. Mesiti, G. Guerrini, R. Berlanga Lavori, Approximate subtree identification in heterogeneous XML documents collections, in: XML Symposium, 2005, pp. 192-206
    • (2005) XML Symposium , pp. 192-206
    • Sanz, I.1    Mesiti, M.2    Guerrini, G.3    Berlanga Lavori, R.4
  • 25
    • 0242578166 scopus 로고    scopus 로고
    • A matching algorithm for measuring the structural similarity between an XML documents and a DTD and its applications
    • Bertino E., Guerrini G., and Mesiti M. A matching algorithm for measuring the structural similarity between an XML documents and a DTD and its applications. Elsevier Computer Science 29 (2004) 23-46
    • (2004) Elsevier Computer Science , vol.29 , pp. 23-46
    • Bertino, E.1    Guerrini, G.2    Mesiti, M.3
  • 26
    • 26444434845 scopus 로고    scopus 로고
    • LAX: An efficient approximate XML join based on clustered leaf nodes for XML data integration
    • Proceedings of BNCOD'05, Springer
    • Liang W., and Yokota H. LAX: An efficient approximate XML join based on clustered leaf nodes for XML data integration. Proceedings of BNCOD'05. LNCS vol. 3567 (2005), Springer 82-97
    • (2005) LNCS , vol.3567 , pp. 82-97
    • Liang, W.1    Yokota, H.2
  • 31
    • 84876713613 scopus 로고    scopus 로고
    • D. Fallside, P. Walmsley, XML Schema part 0: Primer second edition W3C, October 2004. http://www.w3.org/TR/xmlschema-0/
    • D. Fallside, P. Walmsley, XML Schema part 0: Primer second edition W3C, October 2004. http://www.w3.org/TR/xmlschema-0/
  • 36
    • 0001116877 scopus 로고
    • Binary codes capable of correcting deletions, insertions and reversals
    • Levenshtein V. Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 6 (1966) 707-710
    • (1966) Soviet Physics Doklady , vol.6 , pp. 707-710
    • Levenshtein, V.1
  • 40
    • 33745128489 scopus 로고
    • An O(ND) difference algorithm and its variations
    • Myers E. An O(ND) difference algorithm and its variations. Algorithmica 1 2 (1986) 251-266
    • (1986) Algorithmica , vol.1 , Issue.2 , pp. 251-266
    • Myers, E.1
  • 42
    • 84976791819 scopus 로고
    • Bounds on the complexity of the longest common subsequence problem
    • Aho A., Hirschberg D., and Ullman J. Bounds on the complexity of the longest common subsequence problem. Association for Computing Machinery 23 1 (1976) 1-12
    • (1976) Association for Computing Machinery , vol.23 , Issue.1 , pp. 1-12
    • Aho, A.1    Hirschberg, D.2    Ullman, J.3
  • 49
    • 85023916139 scopus 로고
    • Properties of extended Boolean models in information retrieval
    • Springer-Verlag, New York
    • Lee J.H. Properties of extended Boolean models in information retrieval. Proceedings of the ACM SIGIR Conference (1994), Springer-Verlag, New York 182-190
    • (1994) Proceedings of the ACM SIGIR Conference , pp. 182-190
    • Lee, J.H.1
  • 50
    • 77957175435 scopus 로고
    • Probabilistic models in information retrieval
    • Fuhr N. Probabilistic models in information retrieval. The Computer Journal 35 3 (1992) 243-255
    • (1992) The Computer Journal , vol.35 , Issue.3 , pp. 243-255
    • Fuhr, N.1
  • 52
    • 33750388702 scopus 로고    scopus 로고
    • Probabilistic models of information retrieval based on measuring the divergence from randomness
    • Amati G., and Van Rijsbergen C.J. Probabilistic models of information retrieval based on measuring the divergence from randomness. ACM Transactions on Information Systems 20 4 (2002) 357-389
    • (2002) ACM Transactions on Information Systems , vol.20 , Issue.4 , pp. 357-389
    • Amati, G.1    Van Rijsbergen, C.J.2
  • 53
    • 0032091575 scopus 로고    scopus 로고
    • Integration of heterogeneous databases without common domains using queries based on textual similarity
    • W. Cohen, Integration of heterogeneous databases without common domains using queries based on textual similarity, in: Proceedings of ACM SIGMOD, 1998, pp. 291-211
    • (1998) Proceedings of ACM SIGMOD , pp. 291-211
    • Cohen, W.1
  • 59
    • 0016518550 scopus 로고
    • A linear space algorithm for computing maximal common subsequences
    • Hirschberg D.S. A linear space algorithm for computing maximal common subsequences. Communications of the ACM 18 6 (1975) 341-343
    • (1975) Communications of the ACM , vol.18 , Issue.6 , pp. 341-343
    • Hirschberg, D.S.1
  • 61
    • 84994092452 scopus 로고    scopus 로고
    • DataGuides: Enabling query formulation and optimization in semistructured databases
    • R. Goldman, J. Widom, DataGuides: Enabling query formulation and optimization in semistructured databases, in: Proceedings of the VLDB Conference, 1997, pp. 436-445
    • (1997) Proceedings of the VLDB Conference , pp. 436-445
    • Goldman, R.1    Widom, J.2
  • 64
    • 84876726002 scopus 로고    scopus 로고
    • R. Quinlan, Data mining tools see5 and c5.0, 2004
    • R. Quinlan, Data mining tools see5 and c5.0, 2004
  • 65
    • 26944472379 scopus 로고    scopus 로고
    • SSC: Statistical clustering
    • 4th International Conference on Machine Learning and Data Mining in Pattern Recognition. Perner P., and Imiya A. (Eds)
    • Candillier L., Tellier I., Torre F., and Bouquet O. SSC: Statistical clustering. In: Perner P., and Imiya A. (Eds). 4th International Conference on Machine Learning and Data Mining in Pattern Recognition. LNCS vol. LNAI 3587 (2005) 100-109
    • (2005) LNCS , vol.LNAI 3587 , pp. 100-109
    • Candillier, L.1    Tellier, I.2    Torre, F.3    Bouquet, O.4
  • 73
    • 68549095225 scopus 로고    scopus 로고
    • SLAX: An improved leaf-clustering based approximate XML join algorithm for integrating XML data at subtree classes
    • Liang W., and Yokota H. SLAX: An improved leaf-clustering based approximate XML join algorithm for integrating XML data at subtree classes. Transactions of Information Processing Society of Japan 47 (2006) 47-57
    • (2006) Transactions of Information Processing Society of Japan , vol.47 , pp. 47-57
    • Liang, W.1    Yokota, H.2
  • 77
    • 0034172470 scopus 로고    scopus 로고
    • WHIRL: A word-based information representation language
    • Cohen W. WHIRL: A word-based information representation language. Journal of Artificial Intelligence 118 (2000) 163-196
    • (2000) Journal of Artificial Intelligence , vol.118 , pp. 163-196
    • Cohen, W.1
  • 81
    • 17444376834 scopus 로고    scopus 로고
    • Semantic similarity search on semistructured data with the XXL search engine
    • Schenkel R., Theobald A., and Weikum G. Semantic similarity search on semistructured data with the XXL search engine. Information Retrieval 8 (2005) 521-545
    • (2005) Information Retrieval , vol.8 , pp. 521-545
    • Schenkel, R.1    Theobald, A.2    Weikum, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.