메뉴 건너뛰기




Volumn 20, Issue 4, 2007, Pages 336-349

XML schema clustering with semantic and hierarchical similarity measures

Author keywords

Clustering; Data mining; Document mining; Schema matching; Semantic similarity; Semi structured data; Structural similarity; XML

Indexed keywords

COMPUTER PROGRAMMING LANGUAGES; DATA MINING; LINGUISTICS; SEMANTICS; XML;

EID: 34047253932     PISSN: 09507051     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.knosys.2006.08.006     Document Type: Article
Times cited : (54)

References (36)
  • 2
    • 34047250854 scopus 로고    scopus 로고
    • R. Agrawal, R. Srikant, 1996, Mining Sequential Patterns: Generalizations and Performance Improvements. Paper presented at the fifth International Conference on Extending Database Technology (EDBT'96), France.
  • 3
    • 34047271816 scopus 로고    scopus 로고
    • P. Berkhin, 2002. Survey of Clustering Data Mining Techniques: Technical Report, Accrue Software, San Jose, CA.
  • 4
    • 0242578166 scopus 로고    scopus 로고
    • A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications
    • Bertino E., Guerrini G., and Mesiti M. A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications. Information Systems 29 1 (2004) 23-46
    • (2004) Information Systems , vol.29 , Issue.1 , pp. 23-46
    • Bertino, E.1    Guerrini, G.2    Mesiti, M.3
  • 5
    • 34047273246 scopus 로고    scopus 로고
    • S. Boag, D. Chamberlin, M. Fernández, D. Florescu, J. Robie, J. Siméon, XQuery 1.0: An XML query language. Retrieved September, 2005, .
  • 6
    • 33745910578 scopus 로고    scopus 로고
    • A. Boukottaya, C. Vanoirbeek, 2005, November 02-04, Schema matching for transforming structured documents. Paper presented at the The 2005 ACM Symposium on Document engineering, Bristol, United Kingdom.
  • 8
    • 34047268456 scopus 로고    scopus 로고
    • H.H. Do, E. Rahm, 2002 August, COMA - a system for flexible combination of schema matching approaches. Paper presented at the 28th VLDB, Hong Kong, China.
  • 9
    • 0034825478 scopus 로고    scopus 로고
    • A. Doan, R. Domingos, A.Y. Halevy, 2001, Reconciling schemas of disparate sources: a machine-learning approach. Paper presented at the ACM SIGMOD, Santa Barbara, California, United States.
  • 12
    • 34047252215 scopus 로고    scopus 로고
    • G. Guardalben, Integrating XML and relational database technologies: a position paper. Retrieved May 1st, 2005, < http://www.hitsw.com/products_services/whitepapers/integrating_xml_rdb/integrating_xml_white_paper.pdf>, 2004.
  • 13
    • 34047255946 scopus 로고    scopus 로고
    • Introduction to XML Schema by Rrefsnes data, , 2005, April 25.
  • 14
    • 0035755685 scopus 로고    scopus 로고
    • E. Jeong, C.-N. Hsu, 2001, Induction of integrated view for XML data with heterogeneous DTDs. Paper presented at the 10th International Conference on Information and Knowledge Management, Atlanta, Georgia, USA.
  • 15
    • 24344448287 scopus 로고    scopus 로고
    • Peer-to-peer management of XML data: issues and research challenges
    • Koloniari G., and Pitoura E. Peer-to-peer management of XML data: issues and research challenges. SIGMOD Record 34 2 (2005) 6-17
    • (2005) SIGMOD Record , vol.34 , Issue.2 , pp. 6-17
    • Koloniari, G.1    Pitoura, E.2
  • 16
    • 34047248097 scopus 로고    scopus 로고
    • L. Kurgan, W. Swiercz, K. Cios, 2002, Semantic mapping of XML tags using inductive machine learning. Paper presented at the ICMLA.
  • 17
    • 34047274483 scopus 로고    scopus 로고
    • J.W.Lee, S.S. Park, 2004, October 20-24. Finding maximal similar paths between XML documents using sequential patterns. Paper presented at the ADVIS, Izmir, Turkey.
  • 18
    • 0037481024 scopus 로고    scopus 로고
    • L.M. Lee, L.H. Yang, W. Hsu, X. Yang, 2002, November, XClust: clustering XML schemas for effective integration. Paper presented at the 11th ACM International Conference on Information and Knowledge Management (CIKM'02), Virginia.
  • 19
    • 23844475451 scopus 로고    scopus 로고
    • On the use of hierarchical information in sequential mining-based XML document similarity computation
    • Leung H.-p., Chung F.-l., and Chan S.C.-f. On the use of hierarchical information in sequential mining-based XML document similarity computation. Knowledge and Information Systems 7 4 (2005) 476-498
    • (2005) Knowledge and Information Systems , vol.7 , Issue.4 , pp. 476-498
    • Leung, H.-p.1    Chung, F.-l.2    Chan, S.C.-f.3
  • 20
    • 84944328057 scopus 로고    scopus 로고
    • J. Madhavan, P.A. Bernstein, E. Rahm, 2001, Generic schema matching with cupid. Paper presented at the 27th VLDB, Roma, Italy.
  • 21
    • 34047247419 scopus 로고    scopus 로고
    • W. Meier, 2002, eXist: an open source native XML database. Paper presented at the Web, Web-services, and database systems.
  • 22
    • 34047274482 scopus 로고    scopus 로고
    • S. Melnik, H. Garcia-Molina, E. Rahm, 2002. Similarity flooding: a versatile graph matching algorithm. Paper presented at the ICDE.
  • 23
    • 34047262386 scopus 로고    scopus 로고
    • R. Nayak, R. Witt, A. Tonev, Data mining and XML documents. Paper presented at the The 2002 International Workshop on the Web and Database (WebDB 2002), June 24-27, 2002.
  • 24
    • 34047247070 scopus 로고    scopus 로고
    • R. Nayak, S. Xu, XCLS: a fast and effective clustering algorithm for heterogenous XML documents. Paper presented at the The 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), Singapore, 2006.
  • 25
    • 84874345870 scopus 로고    scopus 로고
    • Nayak R., and Zaki M. (Eds), Springer-Verlag, Heidelberg
    • In: Nayak R., and Zaki M. (Eds). Knowledge discovery from XML documents: PAKDD 2006 workshop proceedings. Lecture Notes in Computer Science vol. 3915 (2006), Springer-Verlag, Heidelberg
    • (2006) Lecture Notes in Computer Science , vol.3915
  • 26
    • 34047247761 scopus 로고    scopus 로고
    • A. Nierman, H.V. Jagadish, 2002, December, Evaluating structural similarity in XML documents. Paper presented at the fifth International Conference on Computational Science (ICCS'05), Wisconsin, USA.
  • 27
    • 0004734437 scopus 로고    scopus 로고
    • Classes of cost functions for string edit distance
    • Rice S.V., Bunke H., and Nartker T.A. Classes of cost functions for string edit distance. Algorithmica 18 2 (1997) 271-280
    • (1997) Algorithmica , vol.18 , Issue.2 , pp. 271-280
    • Rice, S.V.1    Bunke, H.2    Nartker, T.A.3
  • 28
    • 33644539174 scopus 로고    scopus 로고
    • N. Suzuki, Finding an optimum edit script between an XML document and a DTD. Paper presented at the Proceedings of the 2005 ACM symposium on Applied computing, Santa Fe, New Mexico, March 13-17, 2005.
  • 29
    • 34047266248 scopus 로고    scopus 로고
    • A. Theobald, G. Wiekum, Adding relevance to XML. Paper presented at the The third International Workshop on the Web and Databases (WebDB'00), Dallas, 2000.
  • 30
    • 0344927764 scopus 로고    scopus 로고
    • Y. Wang, D.J. DeWitt, J.Y. Cai, X-diff: an effective change detection algorithm for XML documents. Paper presented at the The 19th IEEE ICDE, 2003.
  • 31
    • 34047276504 scopus 로고    scopus 로고
    • wCluto: Web Interface for CLustering TOolKit, Retrieved July 25, 2005, , 2003.
  • 32
    • 34047248257 scopus 로고    scopus 로고
    • XML Schema. .
  • 33
    • 0034860127 scopus 로고    scopus 로고
    • L. Xylem, Xylem: a dynamic warehouse for XML data of the web. Paper presented at the IDEAS, 2001.
  • 34
    • 34047270284 scopus 로고    scopus 로고
    • F. Yergeau, T. Bray, J. Paoli, C.M. Sperberg-McQueen, E. Maler, Extensible Markup Language (XML) 1.0 (Third Edition) W3C Recommendation. Retrieved February, 2004, , 2004.
  • 35
    • 0024889169 scopus 로고
    • Simple fast algorithms for the editing distance between trees and related problems
    • Zhang K., and Shasha D. Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal Computing 18 6 (1989) 1245-1262
    • (1989) SIAM Journal Computing , vol.18 , Issue.6 , pp. 1245-1262
    • Zhang, K.1    Shasha, D.2
  • 36
    • 34047272379 scopus 로고    scopus 로고
    • Y. Zhao, G. Karypis, Evaluation of hierarchical clustering algorithms for document datasets. Paper presented at the The 2002 ACM CIKM, Virginia, USA, 2002.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.