메뉴 건너뛰기




Volumn , Issue , 2008, Pages 825-834

Learning deterministic regular expressions for the inference of schemas from XML data

Author keywords

Regular expressions; Schema inference; XML

Indexed keywords

CODES (SYMBOLS); INTERNET; MARKUP LANGUAGES; MINERALS; ORES; TURBULENT FLOW; WORLD WIDE WEB; XML;

EID: 56649124711     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1367497.1367609     Document Type: Conference Paper
Times cited : (39)

References (51)
  • 1
    • 57349137884 scopus 로고    scopus 로고
    • Castor. www.castor.org.
    • Castor
  • 2
    • 57349097845 scopus 로고    scopus 로고
    • SUN Microsystems JAXB. java.sun.com/webservices/jaxb
    • SUN Microsystems JAXB. java.sun.com/webservices/jaxb.
  • 4
    • 0010250803 scopus 로고    scopus 로고
    • Generating Grammars for Structured Documents using Grammatical Inference Methods
    • Report A-1996-4, Dept. of Comp. Sci, Univ. of Finland
    • H. Ahonen. Generating Grammars for Structured Documents using Grammatical Inference Methods. Report A-1996-4, Dept. of Comp. Sci., Univ. of Finland, 1996.
    • (1996)
    • Ahonen, H.1
  • 5
    • 0020815483 scopus 로고
    • Inductive Inference: Theory and Methods
    • D. Angluin and C. Smith. Inductive Inference: Theory and Methods. ACM Computing Surveys, 15(3):237-269, 1983.
    • (1983) ACM Computing Surveys , vol.15 , Issue.3 , pp. 237-269
    • Angluin, D.1    Smith, C.2
  • 6
    • 29344459396 scopus 로고    scopus 로고
    • Studying the XML Web: Gathering Statistics from an XML Sample
    • D. Barbosa, L. Mignet, and P. Veltri. Studying the XML Web: Gathering Statistics from an XML Sample. World Wide Web, 8(4):413-438, 2005.
    • (2005) World Wide Web , vol.8 , Issue.4 , pp. 413-438
    • Barbosa, D.1    Mignet, L.2    Veltri, P.3
  • 7
    • 33244464022 scopus 로고    scopus 로고
    • XPath Satisfiability in the Presence of DTDs
    • M. Benedikt, W. Fan, and F. Geerts. XPath Satisfiability in the Presence of DTDs. In PODS, pages 25-36, 2005.
    • (2005) PODS , pp. 25-36
    • Benedikt, M.1    Fan, W.2    Geerts, F.3
  • 8
    • 27644431505 scopus 로고    scopus 로고
    • Applying Model Management to Classical Meta Data Problems
    • P. A. Bernstein. Applying Model Management to Classical Meta Data Problems. In CIDR, 2003.
    • (2003) CIDR
    • Bernstein, P.A.1
  • 11
    • 77954428731 scopus 로고    scopus 로고
    • DTDs versus XML Schema: A Practical Study
    • G. J. Bex, F. Neven, and J. Van den Bussche. DTDs versus XML Schema: a Practical Study. In WebDB, pages 79-84, 2004.
    • (2004) WebDB , pp. 79-84
    • Bex, G.J.1    Neven, F.2    Van den Bussche, J.3
  • 12
    • 84882718707 scopus 로고    scopus 로고
    • Inferring XML Schema Definitions from XML data
    • G. J. Bex, F. Neven, and S. Vansummeren. Inferring XML Schema Definitions from XML data. In VLDB, pages 998-1009, 2007.
    • (2007) VLDB , pp. 998-1009
    • Bex, G.J.1    Neven, F.2    Vansummeren, S.3
  • 13
    • 0027846518 scopus 로고
    • Efficient Identification of Regular Expressions from Representative Examples
    • ACM Press
    • A. Brāzma. Efficient Identification of Regular Expressions from Representative Examples. In COLT, pages 236-242. ACM Press, 1993.
    • (1993) COLT , pp. 236-242
    • Brāzma, A.1
  • 14
    • 0027694328 scopus 로고
    • Regular Expressions into Finite Automata
    • A. Brüggeman-Klein. Regular Expressions into Finite Automata. Theor. Comput. Sci., 120(2):197-213, 1993.
    • (1993) Theor. Comput. Sci , vol.120 , Issue.2 , pp. 197-213
    • Brüggeman-Klein, A.1
  • 17
    • 0242296053 scopus 로고    scopus 로고
    • Characterization of Glushkov Automata
    • P. Caron and D. Ziadi. Characterization of Glushkov Automata. Theo. Comp. Sc., 233(1-2):75-90, 2000.
    • (2000) Theo. Comp. Sc , vol.233 , Issue.1-2 , pp. 75-90
    • Caron, P.1    Ziadi, D.2
  • 18
    • 35348840678 scopus 로고    scopus 로고
    • Query Optimization in XML Structured-Document Databases
    • D. Che, K. Aberer, and M. T. Ozsu. Query Optimization in XML Structured-Document Databases. VLDB J., 15(3):263-289, 2006.
    • (2006) VLDB J , vol.15 , Issue.3 , pp. 263-289
    • Che, D.1    Aberer, K.2    Ozsu, M.T.3
  • 19
    • 34948856796 scopus 로고    scopus 로고
    • Schema Extraction from XML: A Grammatical Inference Approach
    • B. Chidlovskii. Schema Extraction from XML: a Grammatical Inference Approach. In KRDB, 2001.
    • (2001) KRDB
    • Chidlovskii, B.1
  • 21
    • 0008595474 scopus 로고    scopus 로고
    • RELAX NG Specification
    • December
    • J. Clark and M. Murata. RELAX NG Specification. OASIS, December 2001.
    • (2001) OASIS
    • Clark, J.1    Murata, M.2
  • 22
    • 57349114249 scopus 로고    scopus 로고
    • R. Cover. The Cover Pages, http://xml.coverpages.org/, 2003.
    • (2003) The Cover
    • Cover, R.1
  • 23
    • 85136080342 scopus 로고    scopus 로고
    • ShreX: Managing XML Documents in Relational Databases
    • J. F. Fang Du, Sihem Amer-Yahia. ShreX: Managing XML Documents in Relational Databases. In VLDB, pages 1297-1300, 2004.
    • (2004) VLDB , pp. 1297-1300
    • Fang Du, J.F.1    Amer-Yahia, S.2
  • 24
    • 33646519420 scopus 로고    scopus 로고
    • Algorithms for Learning Regular Expressions
    • H. Fernau. Algorithms for Learning Regular Expressions. In ALT, pages 297-311, 2005.
    • (2005) ALT , pp. 297-311
    • Fernau, H.1
  • 26
    • 34250754626 scopus 로고    scopus 로고
    • Managing Semi-structured Data
    • October
    • D. Florescu. Managing Semi-structured Data. ACM Queue, 3(8), October 2005.
    • (2005) ACM Queue , vol.3 , Issue.8
    • Florescu, D.1
  • 27
    • 77958023331 scopus 로고    scopus 로고
    • April 2006
    • J.-M. François. Jahmm. http://www.run.montefiore.ulg. ac.be/̃francois/software/jahmm/, April 2006.
    • Jahmm
    • François, J.-M.1
  • 29
    • 84943738891 scopus 로고    scopus 로고
    • Information Extraction with HMM Structures Learned by Stochastic Optimization
    • D. Freitag and A. McCallum. Information Extraction with HMM Structures Learned by Stochastic Optimization. In AAAI/IAAI, pages 584-589, 2000.
    • (2000) AAAI/IAAI , pp. 584-589
    • Freitag, D.1    McCallum, A.2
  • 30
    • 0025484018 scopus 로고
    • Inference of k-Testable Languages in the Strict Sense and Application to Syntactic Pattern Recognition
    • P. Garcia and E. Vidal. Inference of k-Testable Languages in the Strict Sense and Application to Syntactic Pattern Recognition. IEEE Trans. Pattern Anal. Mach. Intell., 12(9):920-925, 1990.
    • (1990) IEEE Trans. Pattern Anal. Mach. Intell , vol.12 , Issue.9 , pp. 920-925
    • Garcia, P.1    Vidal, E.2
  • 32
    • 49949150022 scopus 로고
    • Language Identification in the Limit
    • May
    • E. Gold. Language Identification in the Limit. Information and Control, 10(5):447-474, May 1967.
    • (1967) Information and Control , vol.10 , Issue.5 , pp. 447-474
    • Gold, E.1
  • 33
    • 84994092452 scopus 로고    scopus 로고
    • DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
    • R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In VLDB, pages 436-445, 1997.
    • (1997) VLDB , pp. 436-445
    • Goldman, R.1    Widom, J.2
  • 34
    • 84899045808 scopus 로고    scopus 로고
    • XStruct: Efficient Schema Extraction from Multiple and Large XML Documents
    • J. Hegewald, F. Naumann, and M. Weis. XStruct: Efficient Schema Extraction from Multiple and Large XML Documents. In IGDE Workshops, page 81, 2006.
    • (2006) IGDE Workshops , pp. 81
    • Hegewald, J.1    Naumann, F.2    Weis, M.3
  • 36
    • 77954439407 scopus 로고    scopus 로고
    • Schema-based Scheduling of Event Processors and Buffer Minimization for Queries on Structured Data Streams
    • C. Koch, S. Scherzinger, N. Schweikardt, and B. Stegmaier. Schema-based Scheduling of Event Processors and Buffer Minimization for Queries on Structured Data Streams. In VLDB, pages 228-239, 2004.
    • (2004) VLDB , pp. 228-239
    • Koch, C.1    Scherzinger, S.2    Schweikardt, N.3    Stegmaier, B.4
  • 37
    • 84933178526 scopus 로고    scopus 로고
    • Answering XML Queries on Heterogeneous Data Sources
    • I. Manolescu, D. Florescu, and D. Kossmann. Answering XML Queries on Heterogeneous Data Sources. In VLDB, pages 241-250, 2001.
    • (2001) VLDB , pp. 241-250
    • Manolescu, I.1    Florescu, D.2    Kossmann, D.3
  • 38
  • 39
    • 84255215195 scopus 로고    scopus 로고
    • The XML Web: A First Study
    • L. Mignet, D. Barbosa, and P. Veltri. The XML Web: A First Study. In WWW, pages 500-510, 2003.
    • (2003) , pp. 500-510
    • Mignet, L.1    Barbosa, D.2    Veltri, P.3
  • 40
    • 33644699458 scopus 로고    scopus 로고
    • M. Murata, D. Lee, M. Mani, and K. Kawaguchi. Taxonomy of xml schema languages using formal language theory. ACM Trans. Internet Techn., 5(4):660-704, 2005.
    • M. Murata, D. Lee, M. Mani, and K. Kawaguchi. Taxonomy of xml schema languages using formal language theory. ACM Trans. Internet Techn., 5(4):660-704, 2005.
  • 41
    • 0032094142 scopus 로고    scopus 로고
    • Extracting Schema from Semistructured Data
    • S. Nestorov, S. Abiteboul, and R. Motwani. Extracting Schema from Semistructured Data. In ICDM, pages 295-306. 1998.
    • (1998) ICDM , pp. 295-306
    • Nestorov, S.1    Abiteboul, S.2    Motwani, R.3
  • 42
    • 84948579632 scopus 로고    scopus 로고
    • On the Complexity of XPath Containment in the Presence of Disjunction, DTDs, and Variables
    • F. Neven and T. Schwentick. On the Complexity of XPath Containment in the Presence of Disjunction, DTDs, and Variables. Logical Methods in Computer Science, 2(3), 2006.
    • (2006) Logical Methods in Computer Science , vol.2 , Issue.3
    • Neven, F.1    Schwentick, T.2
  • 43
    • 85037549689 scopus 로고
    • Inductive Inference, DFAs, and Computational Complexity
    • L. Pitt. Inductive Inference, DFAs, and Computational Complexity. In All, pages 18-44, 1989.
    • (1989) All , pp. 18-44
    • Pitt, L.1
  • 44
    • 0030157207 scopus 로고    scopus 로고
    • LORE: A Lightweight Object REpository for Semistructured Data
    • D. Quass, J. Widom, R. Goldman, et al. LORE: a Lightweight Object REpository for Semistructured Data. In SIGMOD, page 549, 1996.
    • (1996) SIGMOD , pp. 549
    • Quass, D.1    Widom, J.2    Goldman, R.3
  • 45
    • 0024610919 scopus 로고
    • A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
    • L. Rabiner. A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proc. IEEE, 77(2):257-286, 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 46
    • 0035657983 scopus 로고    scopus 로고
    • E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. VLDB J., 10(4):334-350, 2001.
    • E. Rahm and P. A. Bernstein. A Survey of Approaches to Automatic Schema Matching. VLDB J., 10(4):334-350, 2001.
  • 47
    • 11344284020 scopus 로고    scopus 로고
    • Everything You Ever Wanted to Know about DTDs, but Were Afraid to Ask
    • A. Sahuguet. Everything You Ever Wanted to Know about DTDs, but Were Afraid to Ask. In WebDB, 2000.
    • (2000) WebDB
    • Sahuguet, A.1
  • 48
    • 0031249196 scopus 로고    scopus 로고
    • Recent Advances of Grammatical Inference
    • Y. Sakakibara. Recent Advances of Grammatical Inference. Theor. Comput. Sci., 185(1):15-45, 1997.
    • (1997) Theor. Comput. Sci , vol.185 , Issue.1 , pp. 15-45
    • Sakakibara, Y.1
  • 49
    • 0035751913 scopus 로고    scopus 로고
    • Structural Inference for Semistructured Data
    • J. Sankey and R. K. Wong. Structural Inference for Semistructured Data. In CIKM, pages 159-166. 2001.
    • (2001) CIKM , pp. 159-166
    • Sankey, J.1    Wong, R.K.2
  • 51
    • 0034247207 scopus 로고    scopus 로고
    • Stochastic Grammatical Inference of Text Database Structure
    • M. Young-Lai and F. W. Tompa. Stochastic Grammatical Inference of Text Database Structure. Machine Learning, 40(2):111-137, 2000.
    • (2000) Machine Learning , vol.40 , Issue.2 , pp. 111-137
    • Young-Lai, M.1    Tompa, F.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.