메뉴 건너뛰기




Volumn 48, Issue 3, 2004, Pages 297-333

Element matching across data-oriented XML sources using a multi-strategy clustering model

Author keywords

Element matching; Information integration; Object clustering; Reconciliation; XML

Indexed keywords

DATA STRUCTURES; DATABASE SYSTEMS; DIGITAL LIBRARIES; ELECTRONIC COMMERCE; ELECTRONIC MAIL; INFORMATION TECHNOLOGY; WEB BROWSERS; WEBSITES; XML;

EID: 1142288175     PISSN: 0169023X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.datak.2003.06.001     Document Type: Article
Times cited : (13)

References (68)
  • 2
  • 3
    • 1142307822 scopus 로고    scopus 로고
    • Altova, XSPY, Web site: http://www.xmlspy.com/download_spy_enterprise.html, 2003.
    • (2003)
  • 7
    • 0023023948 scopus 로고
    • A comparative analysis of methodologies for database schema integration
    • Batini C., Lenzerini M., Navathe S.B. A comparative analysis of methodologies for database schema integration. ACM Computing Surveys. 18(4):1986;323-364.
    • (1986) ACM Computing Surveys , vol.18 , Issue.4 , pp. 323-364
    • Batini, C.1    Lenzerini, M.2    Navathe, S.B.3
  • 8
    • 0001802606 scopus 로고    scopus 로고
    • The X-tree: An index structure for high-dimensional data
    • Mumbai (Bombay), India
    • S. Berchtold, D.A. Keim, H.P. Kriegel, The X-tree: an index structure for high-dimensional data, in: Proceedings of the VLDB'96, Mumbai (Bombay), India, 1996.
    • (1996) Proceedings of the VLDB'96
    • Berchtold, S.1    Keim, D.A.2    Kriegel, H.P.3
  • 9
    • 0013193318 scopus 로고    scopus 로고
    • Semantic integration of semistructured and structured data sources
    • Bergamaschi S., Castano S., Vincini M. Semantic integration of semistructured and structured data sources. SIGMOD Record. 28(1):1999;54-59.
    • (1999) SIGMOD Record , vol.28 , Issue.1 , pp. 54-59
    • Bergamaschi, S.1    Castano, S.2    Vincini, M.3
  • 10
    • 0031162001 scopus 로고    scopus 로고
    • Distance-based indexing for high-dimensional metric spaces
    • Tucson, Arizona
    • T. Bozkaya, M. Ozsoyoglu, Distance-based indexing for high-dimensional metric spaces, in: Proceedings of the SIGMOD'97, Tucson, Arizona, 1997.
    • (1997) Proceedings of the SIGMOD'97
    • Bozkaya, T.1    Ozsoyoglu, M.2
  • 14
    • 84993661659 scopus 로고    scopus 로고
    • M-tree: An efficient access method for similarity search in metric spaces
    • Athens, Greece
    • P. Ciaccia, M. Patella, P. Zezula, M-tree: an efficient access method for similarity search in metric spaces, in: Proceedings of the VLDB'97, Athens, Greece, 1997.
    • (1997) Proceedings of the VLDB'97
    • Ciaccia, P.1    Patella, M.2    Zezula, P.3
  • 15
    • 0010355394 scopus 로고    scopus 로고
    • The WHIRL approach to data integration
    • Cohen W.W. The WHIRL approach to data integration. IEEE Intelligent Systems. 13(3):1998;20-24.
    • (1998) IEEE Intelligent Systems , vol.13 , Issue.3 , pp. 20-24
    • Cohen, W.W.1
  • 16
    • 0021519434 scopus 로고
    • View definition and generalization for database integration in multibase: A system for heterogeneous distributed databases
    • Dayal U., Hwang H. View definition and generalization for database integration in multibase: a system for heterogeneous distributed databases. The ACM Transactions on Software Engineering - TOSE. 10(6):1984;628-644.
    • (1984) The ACM Transactions on Software Engineering - TOSE , vol.10 , Issue.6 , pp. 628-644
    • Dayal, U.1    Hwang, H.2
  • 17
    • 0036565014 scopus 로고    scopus 로고
    • A distance-based approach to entity reconciliation in heterogeneous databases
    • Dey D., Sarkar S., De P. A distance-based approach to entity reconciliation in heterogeneous databases. IEEE Transactions on Knowledge and Data Engineering. 14(3):2002;567-582.
    • (2002) IEEE Transactions on Knowledge and Data Engineering , vol.14 , Issue.3 , pp. 567-582
    • Dey, D.1    Sarkar, S.2    De, P.3
  • 19
    • 0032182242 scopus 로고    scopus 로고
    • A probabilistic decision model for entity matching in heterogeneous databases
    • Dey D., Sarkar S., De P. A probabilistic decision model for entity matching in heterogeneous databases. Management Science. 44(10):1998;1379-1395.
    • (1998) Management Science , vol.44 , Issue.10 , pp. 1379-1395
    • Dey, D.1    Sarkar, S.2    De, P.3
  • 22
    • 0003857169 scopus 로고
    • Packed R-trees Using Fractals
    • University of Maryland Institute for Advanced Computer Studies, Department of Computer Science, University of Maryland, College Park, Maryland, December
    • C. Faloutsos, I. Kamel, Packed R-trees Using Fractals, Technical Report CS-TR-3009, University of Maryland Institute for Advanced Computer Studies, Department of Computer Science, University of Maryland, College Park, Maryland, December 1993.
    • (1993) Technical Report , vol.CS-TR-3009
    • Faloutsos, C.1    Kamel, I.2
  • 27
    • 1142283607 scopus 로고    scopus 로고
    • Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville
    • H. Gu, Designing and implementing a DTD inference engine for the IWIZ project, Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville, 2000.
    • (2000) Designing and Implementing a DTD Inference Engine for the IWIZ Project
    • Gu, H.1
  • 28
    • 0021615874 scopus 로고
    • R-Trees: A dynamic index structure for spatial searching
    • Boston, MA
    • A. Guttman, R-Trees: a dynamic index structure for spatial searching, in: Proceedings of the SIGMOD'84, Boston, MA, 1984.
    • (1984) Proceedings of the SIGMOD'84
    • Guttman, A.1
  • 29
    • 1142283613 scopus 로고    scopus 로고
    • University of Washington
    • A. Halevy, "Tukwila," University of Washington, http://data.cs.washington.edu/integration/tukwila/, 2000.
    • (2000) Tukwila
    • Halevy, A.1
  • 30
    • 1142295642 scopus 로고    scopus 로고
    • University of Florida, Gainesville, FL, Project Description TR99-019, October
    • J. Hammer, The Information Integration Wizard (IWiz) Project, University of Florida, Gainesville, FL, Project Description TR99-019, October 1999.
    • (1999) The Information Integration Wizard (IWiz) Project
    • Hammer, J.1
  • 36
    • 0031162081 scopus 로고    scopus 로고
    • The SR-tree: An index structure for high-dimensional nearest neighbor queries
    • Tucson, Arizona
    • N. Katayama, S.I. Satoh, The SR-tree: an index structure for high-dimensional nearest neighbor queries, in: Proceedings of the SIGMOD'97, Tucson, Arizona, 1997.
    • (1997) Proceedings of the SIGMOD'97
    • Katayama, N.1    Satoh, S.I.2
  • 38
    • 0000514558 scopus 로고
    • Solving domain mismatch and schema mismatch problems with an object-oriented database programming language
    • Barcelona, Spain
    • W. Kent, Solving domain mismatch and schema mismatch problems with an object-oriented database programming language, in: Proceedings of the VLDB'91, Barcelona, Spain, 1991.
    • (1991) Proceedings of the VLDB'91
    • Kent, W.1
  • 39
    • 0002719797 scopus 로고
    • The Hungarian method for the assignment algorithm
    • Kuhn H.W. The Hungarian method for the assignment algorithm. Naval Research Logistics Quarterly. 1:1955;83-97.
    • (1955) Naval Research Logistics Quarterly , vol.1 , pp. 83-97
    • Kuhn, H.W.1
  • 41
    • 0002095423 scopus 로고    scopus 로고
    • SchemaSQL - A language for interoperability in relational multi-database systems
    • Mumbai, India
    • L.V.S. Lakshmanan, F. Sadri, I.N. Subramanian, SchemaSQL - a language for interoperability in relational multi-database systems, in: Proceedings of the VLDB'96, Mumbai, India, 1996.
    • (1996) Proceedings of the VLDB'96
    • Lakshmanan, L.V.S.1    Sadri, F.2    Subramanian, I.N.3
  • 42
    • 0002486924 scopus 로고    scopus 로고
    • The information manifold approach to data integration
    • Levy A. The information manifold approach to data integration. IEEE Intelligent Systems. 13(3):1998;12-16.
    • (1998) IEEE Intelligent Systems , vol.13 , Issue.3 , pp. 12-16
    • Levy, A.1
  • 43
    • 0034173996 scopus 로고    scopus 로고
    • SEMINT: A tool for identifying attribute correspondences in heterogeneous databases using neural networks
    • Li W.S., Clifton C. SEMINT: a tool for identifying attribute correspondences in heterogeneous databases using neural networks. Data and Knowledge Engineering. 33(1):2000;49-84.
    • (2000) Data and Knowledge Engineering , vol.33 , Issue.1 , pp. 49-84
    • Li, W.S.1    Clifton, C.2
  • 44
    • 34249762939 scopus 로고
    • The TV-Tree: An index structure for high-dimensional data
    • Lin K.I., Jagadish H.V., Faloutsos C. The TV-Tree: an index structure for high-dimensional data. VLDBJ. (3):1994;517-542.
    • (1994) VLDBJ , Issue.3 , pp. 517-542
    • Lin, K.I.1    Jagadish, H.V.2    Faloutsos, C.3
  • 45
    • 0032091574 scopus 로고    scopus 로고
    • Using schematically heterogeneous structures
    • Seattle, Washington
    • R.J. Miller, Using schematically heterogeneous structures, in: Proceedings of the SIGMOD'98, Seattle, Washington, 1998.
    • (1998) Proceedings of the SIGMOD'98
    • Miller, R.J.1
  • 46
    • 0020848951 scopus 로고
    • A survey of recent advances in hierarchical clustering algorithms
    • Murtagh F. A survey of recent advances in hierarchical clustering algorithms. The Computer Journal. 26(4):1983;354-359.
    • (1983) The Computer Journal , vol.26 , Issue.4 , pp. 354-359
    • Murtagh, F.1
  • 51
    • 26344439404 scopus 로고    scopus 로고
    • A classification scheme for semantic and schematic heterogeneities in XML data sources
    • Computer and Information Science and Engineer, University of Florida, Gainesville, FL, September
    • C. Pluempitiwiriyawej, J. Hammer, A classification scheme for semantic and schematic heterogeneities in XML data sources, Technical Report TR00-004, Computer and Information Science and Engineer, University of Florida, Gainesville, FL, September 2000.
    • (2000) Technical Report , vol.TR00-004
    • Pluempitiwiriyawej, C.1    Hammer, J.2
  • 52
    • 1142283612 scopus 로고    scopus 로고
    • Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville
    • R. Ramani, A toolkit for managing XML data with a relational database management system, Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville, 2001, p. 54.
    • (2001) A Toolkit for Managing XML Data with a Relational Database Management System , pp. 54
    • Ramani, R.1
  • 53
    • 0000019005 scopus 로고
    • Clustering algorithms
    • W.B. Frakes, & R. Baeza-Yates. Englewood, NJ: Prentice Hall
    • Rasmussen E. Clustering algorithms. Frakes W.B., Baeza-Yates R. Information Retrieval: Data Structure and Algorithms. 1992;419-442 Prentice Hall, Englewood, NJ.
    • (1992) Information Retrieval: Data Structure and Algorithms , pp. 419-442
    • Rasmussen, E.1
  • 55
    • 0036495698 scopus 로고    scopus 로고
    • RACHET: An efficient cover-based merging of clustering hierarchies from distributed datasets
    • Samatova N.F., Ostrouchov G., Geist A., Melechko A.V. RACHET: an efficient cover-based merging of clustering hierarchies from distributed datasets. Distributed and Parallel Databases. 11(2):2002;157-180.
    • (2002) Distributed and Parallel Databases , vol.11 , Issue.2 , pp. 157-180
    • Samatova, N.F.1    Ostrouchov, G.2    Geist, A.3    Melechko, A.V.4
  • 56
    • 0028446067 scopus 로고
    • Using semantic values to facilitate interoperability among heterogeneous information systems
    • Sciore E., Siegel M., Rosenthal A. Using semantic values to facilitate interoperability among heterogeneous information systems. The ACM Transactions on Database Systems - TODS. 19(2):1994;254-290.
    • (1994) The ACM Transactions on Database Systems - TODS , vol.19 , Issue.2 , pp. 254-290
    • Sciore, E.1    Siegel, M.2    Rosenthal, A.3
  • 58
    • 0025489261 scopus 로고
    • Federated database systems for managing distributed, heterogeneous and autonomous databases
    • Sheth A.P., Larson J.A. Federated database systems for managing distributed, heterogeneous and autonomous databases. ACM Computing Surveys. 22(3):1990;183-236.
    • (1990) ACM Computing Surveys , vol.22 , Issue.3 , pp. 183-236
    • Sheth, A.P.1    Larson, J.A.2
  • 59
    • 0039141773 scopus 로고
    • A metadata approach to resolving semantic conflicts
    • Barcelona, Spain
    • M. Siegel, S.E. Madnick, A metadata approach to resolving semantic conflicts, in: Proceedings of the VLDB'91, Barcelona, Spain, 1991.
    • (1991) Proceedings of the VLDB'91
    • Siegel, M.1    Madnick, S.E.2
  • 61
    • 1142307813 scopus 로고    scopus 로고
    • Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville, FL
    • A. Teterovskaya, Conflict detection and resolution during restructuring of XML data, Master's thesis, Computer and Information Science and Engineering Department, University of Florida, Gainesville, FL, 2000.
    • (2000) Conflict Detection and Resolution during Restructuring of XML Data
    • Teterovskaya, A.1
  • 62
    • 84994181860 scopus 로고    scopus 로고
    • Don't scrap it, wrap it! a wrapper architecture for legacy data sources
    • Athens, Greece
    • M. Tork Roth, P.M. Schwarz, Don't scrap it, wrap it! a wrapper architecture for legacy data sources, in: Proceedings of the VLDB'97, Athens, Greece, 1997, pp. 266-275.
    • (1997) Proceedings of the VLDB'97 , pp. 266-275
    • Tork Roth, M.1    Schwarz, P.M.2
  • 63
    • 0026256261 scopus 로고
    • Satisfying general proximity/similarity queries with metric trees
    • Uhlmann J.K. Satisfying general proximity/similarity queries with metric trees. Information Processing Letter. 40(4):1991;175-179.
    • (1991) Information Processing Letter , vol.40 , Issue.4 , pp. 175-179
    • Uhlmann, J.K.1
  • 68
    • 0002848777 scopus 로고    scopus 로고
    • Exploring the similarity space
    • Zobel J., Moffat A. Exploring the similarity space. SIGIR Forum. 32(1):1998;18-34.
    • (1998) SIGIR Forum , vol.32 , Issue.1 , pp. 18-34
    • Zobel, J.1    Moffat, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.