-
3
-
-
26244461684
-
Clustering with bregman divergences
-
A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh. Clustering with bregman divergences. Journal of Machine Learning Research, 6:1705-1749, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 1705-1749
-
-
Banerjee, A.1
Merugu, S.2
Dhillon, I.S.3
Ghosh, J.4
-
4
-
-
84944042118
-
Database schema matching using machine learning with feature selection
-
J. Berlin and A. Motro. Database schema matching using machine learning with feature selection. In CAiSE, pages 452-466, 2002.
-
(2002)
CAiSE
, pp. 452-466
-
-
Berlin, J.1
Motro, A.2
-
5
-
-
84859197607
-
Webtables: Exploring the power of tables on the web
-
M. J. Cafarella, A. Y. Halevy, D. Z. Wang, E. Wu, and Y. Zhang. Webtables: exploring the power of tables on the web. PVLDB, 1(1):538-549, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 538-549
-
-
Cafarella, M.J.1
Halevy, A.Y.2
Wang, D.Z.3
Wu, E.4
Zhang, Y.5
-
6
-
-
47749140025
-
Bigtable: A distributed storage system for structured data
-
F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. Gruber. Bigtable: A distributed storage system for structured data. In OSDI, pages 205-218, 2006.
-
(2006)
OSDI
, pp. 205-218
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.9
-
7
-
-
35448971511
-
The case for a wide-table approach to manage sparse relational data sets
-
E. Chu, J. L. Beckmann, and J. F. Naughton. The case for a wide-table approach to manage sparse relational data sets. In SIGMOD Conference, pages 821-832, 2007.
-
(2007)
SIGMOD Conference
, pp. 821-832
-
-
Chu, E.1
Beckmann, J.L.2
Naughton, J.F.3
-
8
-
-
52649168832
-
Rapid identification of column heterogeneity
-
B. T. Dai, N. Koudas, B. C. Ooi, D. Srivastava, and S. Venkatasubramanian. Rapid identification of column heterogeneity. In ICDM, pages 159-170, 2006.
-
(2006)
ICDM
, pp. 159-170
-
-
Dai, B.T.1
Koudas, N.2
Ooi, B.C.3
Srivastava, D.4
Venkatasubramanian, S.5
-
9
-
-
52749083110
-
Validating multi-column schema matchings by type
-
B. T. Dai, N. Koudas, D. Srivastava, A. K. H. Tung, and S. Venkatasubramanian. Validating multi-column schema matchings by type. In ICDE, pages 120-129, 2008.
-
(2008)
ICDE
, pp. 120-129
-
-
Dai, B.T.1
Koudas, N.2
Srivastava, D.3
Tung, A.K.H.4
Venkatasubramanian, S.5
-
10
-
-
0034825478
-
Reconciling schemas of disparate data sources: A machine-learning approach
-
A. Doan, P. Domingos, and A. Y. Halevy. Reconciling schemas of disparate data sources: A machine-learning approach. In SIGMOD Conference, pages 509-520, 2001.
-
(2001)
SIGMOD Conference
, pp. 509-520
-
-
Doan, A.1
Domingos, P.2
Halevy, A.Y.3
-
11
-
-
77954093076
-
Malleable schemas: A preliminary report
-
X. Dong and A. Y. Halevy. Malleable schemas: A preliminary report. In WebDB, pages 139-144, 2005.
-
(2005)
WebDB
, pp. 139-144
-
-
Dong, X.1
Halevy, A.Y.2
-
12
-
-
85011051649
-
Data integration with uncertainty
-
X. L. Dong, A. Y. Halevy, and C. Yu. Data integration with uncertainty. In VLDB, pages 687-698, 2007.
-
(2007)
VLDB
, pp. 687-698
-
-
Dong, X.L.1
Halevy, A.Y.2
Yu, C.3
-
13
-
-
0034819889
-
Optimal aggregation algorithms for middleware
-
R. Fagin, A. Lotem, and M. Naor. Optimal aggregation algorithms for middleware. In PODS, 2001.
-
(2001)
PODS
-
-
Fagin, R.1
Lotem, A.2
Naor, M.3
-
14
-
-
70849108163
-
Top-k queries on uncertain data: On score distribution and typical answers
-
T. Ge, S. B. Zdonik, and S. Madden. Top-k queries on uncertain data: on score distribution and typical answers. In SIGMOD Conference, pages 375-388, 2009.
-
(2009)
SIGMOD Conference
, pp. 375-388
-
-
Ge, T.1
Zdonik, S.B.2
Madden, S.3
-
15
-
-
57349126701
-
The structure of collaborative tagging systems
-
S. A. Golder and B. A. Huberman. The structure of collaborative tagging systems. CoRR, 2005.
-
(2005)
CoRR
-
-
Golder, S.A.1
Huberman, B.A.2
-
16
-
-
77954904501
-
Google fusion tables: Data management, integration and collaboration in the cloud
-
ACM
-
H. Gonzalez, A. Halevy, C. Jensen, A. Langen, J. Madhavan, R. Shapley, and W. Shen. Google fusion tables: data management, integration and collaboration in the cloud. In Proceedings of the 1st ACM symposium on Cloud computing, pages 175-180. ACM, 2010.
-
(2010)
Proceedings of the 1st ACM Symposium on Cloud Computing
, pp. 175-180
-
-
Gonzalez, H.1
Halevy, A.2
Jensen, C.3
Langen, A.4
Madhavan, J.5
Shapley, R.6
Shen, W.7
-
17
-
-
79959927816
-
On-the-fly entity-aware query processing in the presence of linkage
-
E. Ioannou, W. Nejdl, C. Niederée, and Y. Velegrakis. On-the-fly entity-aware query processing in the presence of linkage. PVLDB, 3(1):429-438, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 429-438
-
-
Ioannou, E.1
Nejdl, W.2
Niederée, C.3
Velegrakis, Y.4
-
19
-
-
57149125295
-
Ease: An effective 3-in-1 keyword search method for unstructured, semi-structured and structured data
-
G. Li, B. C. Ooi, J. Feng, J. Wang, and L. Zhou. Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In SIGMOD Conference, pages 903-914, 2008.
-
(2008)
SIGMOD Conference
, pp. 903-914
-
-
Li, G.1
Ooi, B.C.2
Feng, J.3
Wang, J.4
Zhou, L.5
-
20
-
-
0001768982
-
Semantic integration in heterogeneous databases using neural networks
-
W.-S. Li and C. Clifton. Semantic integration in heterogeneous databases using neural networks. In VLDB, pages 1-12, 1994.
-
(1994)
VLDB
, pp. 1-12
-
-
Li, W.-S.1
Clifton, C.2
-
21
-
-
84865656858
-
Evaluating similarity measures for emergent semantics of social tagging
-
B. Markines, C. Cattuto, F. Menczer, D. Benz, A. Hotho, and G. Stumme. Evaluating similarity measures for emergent semantics of social tagging. In WWW, pages 641-650, 2009.
-
(2009)
WWW
, pp. 641-650
-
-
Markines, B.1
Cattuto, C.2
Menczer, F.3
Benz, D.4
Hotho, A.5
Stumme, G.6
-
22
-
-
84859172690
-
Rdf-3x: A risc-style engine for rdf
-
T. Neumann and G. Weikum. Rdf-3x: a risc-style engine for rdf. PVLDB, 1(1):647-659, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 647-659
-
-
Neumann, T.1
Weikum, G.2
-
23
-
-
0035657983
-
A survey of approaches to automatic schema matching
-
DOI 10.1007/s007780100057
-
E. Rahm and P. A. Bernstein. A survey of approaches to automatic schema matching. The VLDB Journal, 10(4):334-350, 2001. (Pubitemid 33570972)
-
(2001)
VLDB Journal
, vol.10
, Issue.4
, pp. 334-350
-
-
Rahm, E.1
Bernstein, P.A.2
-
24
-
-
79952758877
-
Approximate lineage for probabilistic databases
-
C. Ré and D. Suciu. Approximate lineage for probabilistic databases. PVLDB, 1(1):797-808, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 797-808
-
-
Ré, C.1
Suciu, D.2
-
25
-
-
57149128190
-
Bootstrapping pay-as-you-go data integration systems
-
A. D. Sarma, X. Dong, and A. Y. Halevy. Bootstrapping pay-as-you-go data integration systems. In SIGMOD Conference, pages 861-874, 2008.
-
(2008)
SIGMOD Conference
, pp. 861-874
-
-
Sarma, A.D.1
Dong, X.2
Halevy, A.Y.3
-
26
-
-
77954722588
-
Openii: An open source information integration toolkit
-
L. Seligman, P. Mork, A. Halevy, K. Smith, M. J. Carey, K. Chen, C. Wolf, J. Madhavan, A. Kannan, and D. Burdick. Openii: an open source information integration toolkit. In SIGMOD '10, pages 1057-1060, 2010.
-
(2010)
SIGMOD '10
, pp. 1057-1060
-
-
Seligman, L.1
Mork, P.2
Halevy, A.3
Smith, K.4
Carey, M.J.5
Chen, K.6
Wolf, C.7
Madhavan, J.8
Kannan, A.9
Burdick, D.10
-
27
-
-
34548724406
-
Top-k query processing in uncertain databases
-
M. A. Soliman, I. F. Ilyas, and K. C.-C. Chang. Top-k query processing in uncertain databases. In ICDE, pages 896-905, 2007.
-
(2007)
ICDE
, pp. 896-905
-
-
Soliman, M.A.1
Ilyas, I.F.2
Chang, K.C.-C.3
-
29
-
-
84858835658
-
Trio: A system for integrated management of data, accuracy, and lineage
-
J. Widom. Trio: A system for integrated management of data, accuracy, and lineage. In CIDR, pages 262-276, 2005.
-
(2005)
CIDR
, pp. 262-276
-
-
Widom, J.1
-
30
-
-
35448953024
-
Indexing dataspaces
-
D. Xin and H. Alon. Indexing dataspaces. In ACM SIGMOD, pages 43-54, 2007.
-
(2007)
ACM SIGMOD
, pp. 43-54
-
-
Xin, D.1
Alon, H.2
-
31
-
-
35448999370
-
Effective keyword-based selection of relational databases
-
B. Yu, G. Li, K. R. Sollins, and A. K. H. Tung. Effective keyword-based selection of relational databases. In SIGMOD Conference, pages 139-150, 2007.
-
(2007)
SIGMOD Conference
, pp. 139-150
-
-
Yu, B.1
Li, G.2
Sollins, K.R.3
Tung, A.K.H.4
-
32
-
-
77956960464
-
Similarity search on bregman divergence: Towards non-metric indexing
-
Z. Zhang, B. Ooi, S. Parthasarathy, and A. Tung. Similarity search on bregman divergence: Towards non-metric indexing. Proceedings of the VLDB Endowment, 2(1):13-24, 2009.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 13-24
-
-
Zhang, Z.1
Ooi, B.2
Parthasarathy, S.3
Tung, A.4
-
33
-
-
35448995724
-
Query relaxation using malleable schemas
-
X. Zhou, J. Gaugaz, W.-T. Balke, and W. Nejdl. Query relaxation using malleable schemas. In SIGMOD Conference, pages 545-556, 2007.
-
(2007)
SIGMOD Conference
, pp. 545-556
-
-
Zhou, X.1
Gaugaz, J.2
Balke, W.-T.3
Nejdl, W.4
|