메뉴 건너뛰기




Volumn 32, Issue 1, 2012, Pages 109-139

Using structural similarity for clustering XML documents

Author keywords

Clustering; Context; Node; Similarity; Structural classification; Threshold; Tree

Indexed keywords


EID: 84862668106     PISSN: 02191377     EISSN: 02193116     Source Type: Journal    
DOI: 10.1007/s10115-011-0421-5     Document Type: Article
Times cited : (19)

References (54)
  • 4
    • 0242578166 scopus 로고    scopus 로고
    • A matching algorithm for measuring the structural similarity between an XML documents and a DTD and its applications
    • Bertino E, Guerrini G, Mesiti M (2004) A matching algorithm for measuring the structural similarity between an XML documents and a DTD and its applications. Inf Syst 29(1): 23-46.
    • (2004) Inf Syst , vol.29 , Issue.1 , pp. 23-46
    • Bertino, E.1    Guerrini, G.2    Mesiti, M.3
  • 5
    • 0002816102 scopus 로고    scopus 로고
    • Comparing hierarchical data in external memory
    • VLDB 1999, Edinburgh, Scotland, UK, 7-10 September 1999
    • Chawathe S (1999) Comparing hierarchical data in external memory. In: Proceedings of the 25th international conference on very large data bases, VLDB 1999, Edinburgh, Scotland, UK, 7-10 September 1999, pp 90-101.
    • (1999) Proceedings of the 25th international conference on very large data bases , pp. 90-101
    • Chawathe, S.1
  • 11
    • 79955055308 scopus 로고    scopus 로고
    • TagClus: a random walk-based method for tag clustering
    • Cui J, Liu H, He J, Li P, Du X et al (2011) TagClus: a random walk-based method for tag clustering. Knowl Inf Syst 27(2): 193-225.
    • (2011) Knowl Inf Syst , vol.27 , Issue.2 , pp. 193-225
    • Cui, J.1    Liu, H.2    He, J.3    Li, P.4    Du, X.5
  • 12
    • 29144484106 scopus 로고    scopus 로고
    • A methodology for clustering XML documents by structure
    • Dalamagas T, Cheng T, Winkel K-J, Sellis TK (2006) A methodology for clustering XML documents by structure. Inf Syst 31(3): 187-228.
    • (2006) Inf Syst , vol.31 , Issue.3 , pp. 187-228
    • Dalamagas, T.1    Cheng, T.2    Winkel, K.-J.3    Sellis, T.K.4
  • 22
    • 62449164226 scopus 로고    scopus 로고
    • Clustering XML documents by combining content and structure
    • ISISE 2008, Shanghai, China, 20-22 December 2008, published by IEEE Computer Society Washington, DC, USA 2008
    • Guo Y, Chen D, Le J (2008) Clustering XML documents by combining content and structure. In: Proceedings of the 2008 international symposium on information science and engineering, ISISE 2008, Shanghai, China, 20-22 December 2008, published by IEEE Computer Society Washington, DC, USA 2008, pp 583-587.
    • (2008) Proceedings of the 2008 international symposium on information science and engineering , pp. 583-587
    • Guo, Y.1    Chen, D.2    Le, J.3
  • 24
    • 84862694937 scopus 로고    scopus 로고
    • Querying and indexing XML documents
    • Publisher IOS Press, 2005
    • Hu G, Hammad R (2005) Querying and indexing XML documents. Computational methods in science and enginnering. Publisher IOS Press, 2005, 5(1): 219-233.
    • (2005) Computational methods in science and enginnering , vol.5 , Issue.1 , pp. 219-233
    • Hu, G.1    Hammad, R.2
  • 25
    • 0344065589 scopus 로고    scopus 로고
    • XR-Tree: Indexing XML data for efficient structural joins
    • ICDE 2003, Bangalore, India, 5-8 March 2003, published by IEEE Computer Society
    • Jiang H, Lu H, Wang W, Ooi BC (2003) XR-Tree: Indexing XML data for efficient structural joins. In: Proceedings of the 19th international conference on data engineering, ICDE 2003, Bangalore, India, 5-8 March 2003, published by IEEE Computer Society, pp 253-263.
    • (2003) Proceedings of the 19th international conference on data engineering , pp. 253-263
    • Jiang, H.1    Lu, H.2    Wang, W.3    Ooi, B.C.4
  • 27
    • 84896693236 scopus 로고    scopus 로고
    • Computing the edit-distance between unrooted ordered trees
    • Venice, Italy, 24-26 August 1998, published by Springer-Verlag London, UK 1998
    • Klein PN (1998) Computing the edit-distance between unrooted ordered trees. In: Proceedings of the 6th annual European symposium on algorithms ESA 98, Venice, Italy, 24-26 August 1998, published by Springer-Verlag London, UK 1998, pp 91-102.
    • (1998) Proceedings of the 6th annual European symposium on algorithms ESA 98 , pp. 91-102
    • Klein, P.N.1
  • 28
  • 31
    • 9744254146 scopus 로고    scopus 로고
    • An open source native XML database
    • In: Chaudhri A. B, Jeckle M, Rahm R, Unland R (eds), Erfurt, Germany, 7-10 October 2002, LNCS 2593, Springer 2003, available at
    • Meier W (2002) An open source native XML database. In: Chaudhri A. B, Jeckle M, Rahm R, Unland R (eds) Web, web-services, and database systems, NODe 2002 web and database-related workshops, Erfurt, Germany, 7-10 October 2002, LNCS 2593, Springer 2003, pp 169-183, available at http://exist-db. org/webdb. pdf.
    • (2002) Web, web-services, and database systems, NODe 2002 web and database-related workshops , pp. 169-183
    • Meier, W.1
  • 33
    • 38649116960 scopus 로고    scopus 로고
    • Fast and effective clustering of XML data using structural information
    • Nayak R (2008) Fast and effective clustering of XML data using structural information. Knowl Inf Syst 14(2): 197-215.
    • (2008) Knowl Inf Syst , vol.14 , Issue.2 , pp. 197-215
    • Nayak, R.1
  • 35
    • 56749160639 scopus 로고    scopus 로고
    • RRSi: indexing XML data for proximity twig queries
    • Ng PKL, Ng VTY (2009) RRSi: indexing XML data for proximity twig queries. Knowl Inf Syst 17(2): 193-216.
    • (2009) Knowl Inf Syst , vol.17 , Issue.2 , pp. 193-216
    • Ng, P.K.L.1    Ng, V.T.Y.2
  • 36
    • 14644393851 scopus 로고    scopus 로고
    • Evaluating structural similarity in XML documents
    • Madison, Wisconsin, USA, 6-7 June 2002, Publisher: Citeseer, Available from
    • Nierman A, Jagadish HV (2002) Evaluating structural similarity in XML documents. In: Proceedings of the fifth international workshop on the web and databases WebDB 2002, Madison, Wisconsin, USA, 6-7 June 2002, Publisher: Citeseer, pp 61-66, Available from http://citeseerx. ist. psu. edu.
    • (2002) Proceedings of the fifth international workshop on the web and databases WebDB 2002 , pp. 61-66
    • Nierman, A.1    Jagadish, H.V.2
  • 40
    • 0001122858 scopus 로고
    • The tree-to-tree editing problem
    • Selkow S (1977) The tree-to-tree editing problem. Inf Process Lett 6(6): 184-186.
    • (1977) Inf Process Lett , vol.6 , Issue.6 , pp. 184-186
    • Selkow, S.1
  • 41
    • 84974696102 scopus 로고    scopus 로고
    • Identification of syntactically similar DTD elements in schema matching across DTDs
    • WAIM 2001, Xi'an, China, 9-11 July 2001, LNCS Springer 2118
    • Su H, Padmanabhan S (2001) Identification of syntactically similar DTD elements in schema matching across DTDs. In: Proceedings of the 2th international conference on web-age information management, WAIM 2001, Xi'an, China, 9-11 July 2001, LNCS Springer 2118, 2001, pp 145-159.
    • (2001) Proceedings of the 2th international conference on web-age information management , vol.2001 , pp. 145-159
    • Su, H.1    Padmanabhan, S.2
  • 42
    • 0018491659 scopus 로고
    • The tree-to-tree correction problem
    • Tai KC (1979) The tree-to-tree correction problem. J ACM (JACM) 26(3): 422-433.
    • (1979) J ACM (JACM) , vol.26 , Issue.3 , pp. 422-433
    • Tai, K.C.1
  • 43
    • 77955045199 scopus 로고    scopus 로고
    • Efficient XML Structural similarity detection using sub-tree commonalities
    • SBBD 2007, Joao Pessoa, Paraiba, Brasil, 15-19 October 2007
    • Tekli J, Chbeir R, Yetongnon K (2007) Efficient XML Structural similarity detection using sub-tree commonalities. In: Proceedings of the 22nd Brazilian symposium on databases, SBBD 2007, Joao Pessoa, Paraiba, Brasil, 15-19 October 2007, pp 116-130.
    • (2007) Proceedings of the 22nd Brazilian symposium on databases , pp. 116-130
    • Tekli, J.1    Chbeir, R.2    Yetongnon, K.3
  • 44
    • 3042780922 scopus 로고    scopus 로고
    • Tree finder: a first step towards XML data mining
    • ICDM 2002, 9-12 December 2002, Maebashi City, Japan, published by IEEE Computer Society 2002
    • Termier A, Rousset MC, Sebag M (2002) Tree finder: a first step towards XML data mining. In: Proceedings of the 2002 IEEE international conference on data mining, ICDM 2002, 9-12 December 2002, Maebashi City, Japan, published by IEEE Computer Society 2002, pp 450-457.
    • (2002) Proceedings of the 2002 IEEE international conference on data mining , pp. 450-457
    • Termier, A.1    Rousset, M.C.2    Sebag, M.3
  • 47
    • 33744927170 scopus 로고    scopus 로고
    • Practical indexing XML documents for twig query
    • Data management on the web, Kunming, China, 7-9 December 2005, LNCS 3818 Springer 2005,
    • Wang H, Wang W, Li J, Lin X, Wong R (2005) Practical indexing XML documents for twig query. In: Proceedings of the 10th Asian computing science conference, ASIAN 2005, Data management on the web, Kunming, China, 7-9 December 2005, LNCS 3818 Springer 2005, pp 208-222.
    • (2005) Proceedings of the 10th Asian computing science conference, ASIAN 2005 , pp. 208-222
    • Wang, H.1    Wang, W.2    Li, J.3    Lin, X.4    Wong, R.5
  • 49
    • 84880424902 scopus 로고    scopus 로고
    • Classification automatique de documents structurés: Application au corpus d'arbres étiquetés de type XML
    • CORIA 2005, Grenoble, France, 9-11 march 2005
    • Wisniewsky G, Denoyer L, Gallinari P (2005) Classification automatique de documents structurés: Application au corpus d'arbres étiquetés de type XML. In: Proceedings of the 2nd French information retrieval conference, CORIA 2005, Grenoble, France, 9-11 march 2005, pp 167-184.
    • (2005) Proceedings of the 2nd French information retrieval conference , pp. 167-184
    • Wisniewsky, G.1    Denoyer, L.2    Gallinari, P.3
  • 50
    • 63649131887 scopus 로고    scopus 로고
    • Learning element similarity matrix for semi-structured document analysis
    • Yang J, Cheung WK, Chen X (2009) Learning element similarity matrix for semi-structured document analysis. Knowl Inf Syst 19(1): 53-78.
    • (2009) Knowl Inf Syst , vol.19 , Issue.1 , pp. 53-78
    • Yang, J.1    Cheung, W.K.2    Chen, X.3
  • 54
    • 0037869164 scopus 로고    scopus 로고
    • Technical report, 01-40, Department of computer science, University of Minnesota, Minneapolis, MN 55455, Availlable at
    • Zhao Y, Karypis G (2001) Criterion functions for document clustering: experiments and analysis. Technical report, 01-40, Department of computer science, University of Minnesota, Minneapolis, MN 55455, Availlable at http://glaros. dtc. umn. edu/gkhome/node/165.
    • (2001) Criterion functions for document clustering: Experiments and analysis
    • Zhao, Y.1    Karypis, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.