메뉴 건너뛰기




Volumn , Issue , 2010, Pages 63-74

Sampling dirty data for matching attributes

Author keywords

database integration; sampling; schema matching

Indexed keywords

DATABASE INTEGRATION; DIRTY DATA; EFFICIENT ALGORITHM; REAL WORLD DATA; RELATIONAL DATABASE; SCHEMA MATCHING; SIMILARITY COMPUTATION; SIMILARITY MEASURE; TEST RESULTS; TWO STAGE; VALUE SETS;

EID: 77954738593     PISSN: 07308078     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1807167.1807177     Document Type: Conference Paper
Times cited : (20)

References (27)
  • 1
    • 35448951563 scopus 로고    scopus 로고
    • Data integration: The teenage years
    • A. Y. Halevy, A. Rajaraman, and J. J. Ordille, "Data integration: The teenage years," in VLDB, 2006, pp. 9-16.
    • (2006) VLDB , pp. 9-16
    • Halevy, A.Y.1    Rajaraman, A.2    Ordille, J.J.3
  • 2
    • 0035657983 scopus 로고    scopus 로고
    • A survey of approaches to automatic schema matching
    • E. Rahm and P. A. Bernstein, "A survey of approaches to automatic schema matching," VLDB J., vol. 10, no. 4, pp. 334-350, 2001.
    • (2001) VLDB J. , vol.10 , Issue.4 , pp. 334-350
    • Rahm, E.1    Bernstein, P.A.2
  • 3
    • 31444453796 scopus 로고    scopus 로고
    • From databases to dataspaces: A new abstraction for information management
    • M. J. Franklin, A. Y. Halevy, and D. Maier, "From databases to dataspaces: a new abstraction for information management," SIGMOD Record, vol. 34, no. 4, pp. 27-33, 2005.
    • (2005) SIGMOD Record , vol.34 , Issue.4 , pp. 27-33
    • Franklin, M.J.1    Halevy, A.Y.2    Maier, D.3
  • 4
    • 34250660624 scopus 로고    scopus 로고
    • Principles of dataspace systems
    • A. Y. Halevy, M. J. Franklin, and D. Maier, "Principles of dataspace systems," in PODS, 2006, pp. 1-9.
    • (2006) PODS , pp. 1-9
    • Halevy, A.Y.1    Franklin, M.J.2    Maier, D.3
  • 5
    • 0036366837 scopus 로고    scopus 로고
    • Mining database structure; or, how to build a data quality browser
    • T. Dasu, T. Johnson, S. Muthukrishnan, and V. Shkapenyuk, "Mining database structure; or, how to build a data quality browser," in SIGMOD, 2002, pp. 240-251.
    • (2002) SIGMOD , pp. 240-251
    • Dasu, T.1    Johnson, T.2    Muthukrishnan, S.3    Shkapenyuk, V.4
  • 8
    • 85043988965 scopus 로고
    • Finding similar files in a large file system
    • U. Manber, "Finding similar files in a large file system," in USENIX Winter, 1994, pp. 1-10.
    • (1994) USENIX Winter , pp. 1-10
    • Manber, U.1
  • 9
    • 79956075292 scopus 로고    scopus 로고
    • Identifying and filtering near-duplicate documents
    • A. Z. Broder, "Identifying and filtering near-duplicate documents," in CPM, 2000, pp. 1-10.
    • (2000) CPM , pp. 1-10
    • Broder, A.Z.1
  • 11
    • 85011032600 scopus 로고    scopus 로고
    • Vgram: Improving performance of approximate queries on string collections using variable-length grams
    • C. Li, B. Wang, and X. Yang, "Vgram: Improving performance of approximate queries on string collections using variable-length grams," in VLDB, 2007, pp. 303-314.
    • (2007) VLDB , pp. 303-314
    • Li, C.1    Wang, B.2    Yang, X.3
  • 12
    • 34548738941 scopus 로고    scopus 로고
    • Efficiently detecting inclusion dependencies
    • J. Bauckmann, U. Leser, F. Naumann, and V. Tietz, "Efficiently detecting inclusion dependencies," in ICDE, 2007, pp. 1448-1450.
    • (2007) ICDE , pp. 1448-1450
    • Bauckmann, J.1    Leser, U.2    Naumann, F.3    Tietz, V.4
  • 15
    • 0002513261 scopus 로고
    • Random sampling from databases - A survey
    • F. Olken and D. Rotem, "Random sampling from databases - a survey," Statistics and Computing, vol. 5, pp. 25-42, 1994.
    • (1994) Statistics and Computing , vol.5 , pp. 25-42
    • Olken, F.1    Rotem, D.2
  • 16
    • 0003229927 scopus 로고    scopus 로고
    • Schema mapping as query discovery
    • R. J. Miller, L. M. Haas, and M. A. Hernández, "Schema mapping as query discovery," in VLDB, 2000, pp. 77-88.
    • (2000) VLDB , pp. 77-88
    • Miller, R.J.1    Haas, L.M.2    Hernández, M.A.3
  • 17
    • 3142720555 scopus 로고    scopus 로고
    • iMAP: Discovering complex mappings between database schemas
    • R. Dhamankar, Y. Lee, A. Doan, A. Y. Halevy, and P. Domingos, "iMAP: Discovering complex mappings between database schemas," in SIGMOD, 2004, pp. 383-394.
    • (2004) SIGMOD , pp. 383-394
    • Dhamankar, R.1    Lee, Y.2    Doan, A.3    Halevy, A.Y.4    Domingos, P.5
  • 19
    • 0032091575 scopus 로고    scopus 로고
    • Integration of heterogeneous databases without common domains using queries based on textual similarity
    • W. W. Cohen, "Integration of heterogeneous databases without common domains using queries based on textual similarity," in SIGMOD, 1998, pp. 201-212.
    • (1998) SIGMOD , pp. 201-212
    • Cohen, W.W.1
  • 21
    • 0022821574 scopus 로고
    • Simple random sampling from relational databases
    • F. Olken and D. Rotem, "Simple random sampling from relational databases," in VLDB, 1986, pp. 160-169.
    • (1986) VLDB , pp. 160-169
    • Olken, F.1    Rotem, D.2
  • 22
    • 0030157210 scopus 로고    scopus 로고
    • Bifocal sampling for skew-resistant join size estimation
    • S. Ganguly, P. B. Gibbons, Y. Matias, and A. Silberschatz, "Bifocal sampling for skew-resistant join size estimation," in SIGMOD, 1996, pp. 271-281.
    • (1996) SIGMOD , pp. 271-281
    • Ganguly, S.1    Gibbons, P.B.2    Matias, Y.3    Silberschatz, A.4
  • 24
    • 0040885649 scopus 로고    scopus 로고
    • Congressional samples for approximate answering of group-by queries
    • S. Acharya, P. B. Gibbons, and V. Poosala, "Congressional samples for approximate answering of group-by queries," in SIGMOD, 2000, pp. 487-498.
    • (2000) SIGMOD , pp. 487-498
    • Acharya, S.1    Gibbons, P.B.2    Poosala, V.3
  • 25
    • 3142697062 scopus 로고    scopus 로고
    • Effective use of block-level sampling in statistics estimation
    • S. Chaudhuri, G. Das, and U. Srivastava, "Effective use of block-level sampling in statistics estimation," in SIGMOD Conf., 2004, pp. 287-298.
    • SIGMOD Conf., 2004 , pp. 287-298
    • Chaudhuri, S.1    Das, G.2    Srivastava, U.3
  • 26
    • 3142745395 scopus 로고    scopus 로고
    • A bi-level Bernoulli scheme for database sampling
    • P. J. Haas and C. Koenig, "A bi-level Bernoulli scheme for database sampling," in SIGMOD, 2004, pp. 275-286.
    • (2004) SIGMOD , pp. 275-286
    • Haas, P.J.1    Koenig, C.2
  • 27
    • 3142748410 scopus 로고    scopus 로고
    • Query sampling in DB2 universal database
    • J. Gryz, J. Guo, L. Liu, and C. Zuzarte, "Query sampling in DB2 universal database," in SIGMOD, 2004, pp. 839-843.
    • (2004) SIGMOD , pp. 839-843
    • Gryz, J.1    Guo, J.2    Liu, L.3    Zuzarte, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.