-
1
-
-
35448951563
-
Data integration: The teenage years
-
A. Y. Halevy, A. Rajaraman, and J. J. Ordille, "Data integration: The teenage years," in VLDB, 2006, pp. 9-16.
-
(2006)
VLDB
, pp. 9-16
-
-
Halevy, A.Y.1
Rajaraman, A.2
Ordille, J.J.3
-
2
-
-
0035657983
-
A survey of approaches to automatic schema matching
-
E. Rahm and P. A. Bernstein, "A survey of approaches to automatic schema matching," VLDB J., vol. 10, no. 4, pp. 334-350, 2001.
-
(2001)
VLDB J.
, vol.10
, Issue.4
, pp. 334-350
-
-
Rahm, E.1
Bernstein, P.A.2
-
3
-
-
31444453796
-
From databases to dataspaces: A new abstraction for information management
-
M. J. Franklin, A. Y. Halevy, and D. Maier, "From databases to dataspaces: a new abstraction for information management," SIGMOD Record, vol. 34, no. 4, pp. 27-33, 2005.
-
(2005)
SIGMOD Record
, vol.34
, Issue.4
, pp. 27-33
-
-
Franklin, M.J.1
Halevy, A.Y.2
Maier, D.3
-
4
-
-
34250660624
-
Principles of dataspace systems
-
A. Y. Halevy, M. J. Franklin, and D. Maier, "Principles of dataspace systems," in PODS, 2006, pp. 1-9.
-
(2006)
PODS
, pp. 1-9
-
-
Halevy, A.Y.1
Franklin, M.J.2
Maier, D.3
-
5
-
-
0036366837
-
Mining database structure; or, how to build a data quality browser
-
T. Dasu, T. Johnson, S. Muthukrishnan, and V. Shkapenyuk, "Mining database structure; or, how to build a data quality browser," in SIGMOD, 2002, pp. 240-251.
-
(2002)
SIGMOD
, pp. 240-251
-
-
Dasu, T.1
Johnson, T.2
Muthukrishnan, S.3
Shkapenyuk, V.4
-
7
-
-
0010362121
-
Syntactic clustering of the web
-
A. Z. Broder, S. C. Glassman, M. S. Manasse, and G. Zweig, "Syntactic clustering of the web," Computer Networks, vol. 29, no. 8-13, pp. 1157-1166, 1997.
-
(1997)
Computer Networks
, vol.29
, Issue.8-13
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
8
-
-
85043988965
-
Finding similar files in a large file system
-
U. Manber, "Finding similar files in a large file system," in USENIX Winter, 1994, pp. 1-10.
-
(1994)
USENIX Winter
, pp. 1-10
-
-
Manber, U.1
-
9
-
-
79956075292
-
Identifying and filtering near-duplicate documents
-
A. Z. Broder, "Identifying and filtering near-duplicate documents," in CPM, 2000, pp. 1-10.
-
(2000)
CPM
, pp. 1-10
-
-
Broder, A.Z.1
-
11
-
-
85011032600
-
Vgram: Improving performance of approximate queries on string collections using variable-length grams
-
C. Li, B. Wang, and X. Yang, "Vgram: Improving performance of approximate queries on string collections using variable-length grams," in VLDB, 2007, pp. 303-314.
-
(2007)
VLDB
, pp. 303-314
-
-
Li, C.1
Wang, B.2
Yang, X.3
-
12
-
-
34548738941
-
Efficiently detecting inclusion dependencies
-
J. Bauckmann, U. Leser, F. Naumann, and V. Tietz, "Efficiently detecting inclusion dependencies," in ICDE, 2007, pp. 1448-1450.
-
(2007)
ICDE
, pp. 1448-1450
-
-
Bauckmann, J.1
Leser, U.2
Naumann, F.3
Tietz, V.4
-
13
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios, "Duplicate record detection: A survey," IEEE Trans. Knowl. Data Eng., vol. 19, no. 1, pp. 1-16, 2007.
-
(2007)
IEEE Trans. Knowl. Data Eng.
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
14
-
-
0002368671
-
The New Jersey data reduction report
-
D. Barbar'a, W. Dumouchel, C. Faloutsos, P. J. Haas, J. M. Hellerstein, Y. Ioannidis, H. V. Jagadish, T. Johnson, R. Ng, V. Poosala, K. A. Ross, and K. C. Sevcik, "The New Jersey data reduction report," IEEE Data Engineering Bulletin, vol. 20, pp. 3-45, 1997.
-
(1997)
IEEE Data Engineering Bulletin
, vol.20
, pp. 3-45
-
-
Barbar'a, D.1
Dumouchel, W.2
Faloutsos, C.3
Haas, P.J.4
Hellerstein, J.M.5
Ioannidis, Y.6
Jagadish, H.V.7
Johnson, T.8
Ng, R.9
Poosala, V.10
Ross, K.A.11
Sevcik, K.C.12
-
15
-
-
0002513261
-
Random sampling from databases - A survey
-
F. Olken and D. Rotem, "Random sampling from databases - a survey," Statistics and Computing, vol. 5, pp. 25-42, 1994.
-
(1994)
Statistics and Computing
, vol.5
, pp. 25-42
-
-
Olken, F.1
Rotem, D.2
-
16
-
-
0003229927
-
Schema mapping as query discovery
-
R. J. Miller, L. M. Haas, and M. A. Hernández, "Schema mapping as query discovery," in VLDB, 2000, pp. 77-88.
-
(2000)
VLDB
, pp. 77-88
-
-
Miller, R.J.1
Haas, L.M.2
Hernández, M.A.3
-
17
-
-
3142720555
-
iMAP: Discovering complex mappings between database schemas
-
R. Dhamankar, Y. Lee, A. Doan, A. Y. Halevy, and P. Domingos, "iMAP: Discovering complex mappings between database schemas," in SIGMOD, 2004, pp. 383-394.
-
(2004)
SIGMOD
, pp. 383-394
-
-
Dhamankar, R.1
Lee, Y.2
Doan, A.3
Halevy, A.Y.4
Domingos, P.5
-
18
-
-
52749083110
-
Validating multi-column schema matchings by type
-
B. T. Dai, N. Koudas, D. Srivastava, A. K. H. Tung, and S. Venkatasubramanian, "Validating multi-column schema matchings by type," in ICDE, 2008, pp. 120-129.
-
(2008)
ICDE
, pp. 120-129
-
-
Dai, B.T.1
Koudas, N.2
Srivastava, D.3
Tung, A.K.H.4
Venkatasubramanian, S.5
-
19
-
-
0032091575
-
Integration of heterogeneous databases without common domains using queries based on textual similarity
-
W. W. Cohen, "Integration of heterogeneous databases without common domains using queries based on textual similarity," in SIGMOD, 1998, pp. 201-212.
-
(1998)
SIGMOD
, pp. 201-212
-
-
Cohen, W.W.1
-
20
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava, "Approximate string joins in a database (almost) for free," in VLDB, 2001, pp. 491-500.
-
(2001)
VLDB
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
21
-
-
0022821574
-
Simple random sampling from relational databases
-
F. Olken and D. Rotem, "Simple random sampling from relational databases," in VLDB, 1986, pp. 160-169.
-
(1986)
VLDB
, pp. 160-169
-
-
Olken, F.1
Rotem, D.2
-
22
-
-
0030157210
-
Bifocal sampling for skew-resistant join size estimation
-
S. Ganguly, P. B. Gibbons, Y. Matias, and A. Silberschatz, "Bifocal sampling for skew-resistant join size estimation," in SIGMOD, 1996, pp. 271-281.
-
(1996)
SIGMOD
, pp. 271-281
-
-
Ganguly, S.1
Gibbons, P.B.2
Matias, Y.3
Silberschatz, A.4
-
23
-
-
0347761807
-
On random sampling over joins
-
S. Chaudhuri, R. Motwani, and V. R. Narasayya, "On random sampling over joins," in SIGMOD, 1999, pp. 263-274.
-
(1999)
SIGMOD
, pp. 263-274
-
-
Chaudhuri, S.1
Motwani, R.2
Narasayya, V.R.3
-
24
-
-
0040885649
-
Congressional samples for approximate answering of group-by queries
-
S. Acharya, P. B. Gibbons, and V. Poosala, "Congressional samples for approximate answering of group-by queries," in SIGMOD, 2000, pp. 487-498.
-
(2000)
SIGMOD
, pp. 487-498
-
-
Acharya, S.1
Gibbons, P.B.2
Poosala, V.3
-
25
-
-
3142697062
-
Effective use of block-level sampling in statistics estimation
-
S. Chaudhuri, G. Das, and U. Srivastava, "Effective use of block-level sampling in statistics estimation," in SIGMOD Conf., 2004, pp. 287-298.
-
SIGMOD Conf., 2004
, pp. 287-298
-
-
Chaudhuri, S.1
Das, G.2
Srivastava, U.3
-
26
-
-
3142745395
-
A bi-level Bernoulli scheme for database sampling
-
P. J. Haas and C. Koenig, "A bi-level Bernoulli scheme for database sampling," in SIGMOD, 2004, pp. 275-286.
-
(2004)
SIGMOD
, pp. 275-286
-
-
Haas, P.J.1
Koenig, C.2
-
27
-
-
3142748410
-
Query sampling in DB2 universal database
-
J. Gryz, J. Guo, L. Liu, and C. Zuzarte, "Query sampling in DB2 universal database," in SIGMOD, 2004, pp. 839-843.
-
(2004)
SIGMOD
, pp. 839-843
-
-
Gryz, J.1
Guo, J.2
Liu, L.3
Zuzarte, C.4
|