-
1
-
-
84875168076
-
-
Apache Oozie. http://oozie.apache.org.
-
Apache Oozie
-
-
-
2
-
-
84904332358
-
-
Apache Storm. http://storm.incubator.apache.org.
-
Apache Storm
-
-
-
4
-
-
85080699869
-
-
Greenplum. http://bit.ly/1oL4Srq.
-
Greenplum
-
-
-
5
-
-
85080697851
-
-
Greenplum. http://basho.com/riak/.
-
Greenplum
-
-
-
6
-
-
85080671150
-
-
Netezza. http://www.ibm.com/software/data/netezza/.
-
-
-
-
7
-
-
85080700912
-
-
Vertica. http://www.vertica.com/.
-
-
-
-
8
-
-
85076640925
-
Reoptimizing data parallel computing
-
San Jose, CA, USENIX
-
S. Agarwal, S. Kandula, N. Bruno, M.-C. Wu, I. Stoica, and J. Zhou. Reoptimizing data parallel computing. In Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), pages 281–294, San Jose, CA, 2012. USENIX.
-
(2012)
Presented as Part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12)
, pp. 281-294
-
-
Agarwal, S.1
Kandula, S.2
Bruno, N.3
Wu, M.-C.4
Stoica, I.5
Zhou, J.6
-
9
-
-
84877703682
-
BlinkDB: Queries with bounded errors and bounded response times on very large data
-
New York, NY, USA, ACM
-
S. Agarwal, B. Mozafari, A. Panda, H. Milner, S. Madden, and I. Stoica. Blinkdb: Queries with bounded errors and bounded response times on very large data. In Proceedings of the 8th ACM European Conference on Computer Systems, EuroSys’13, pages 29–42, New York, NY, USA, 2013. ACM.
-
(2013)
Proceedings of the 8th ACM European Conference on Computer Systems, EuroSys’13
, pp. 29-42
-
-
Agarwal, S.1
Mozafari, B.2
Panda, A.3
Milner, H.4
Madden, S.5
Stoica, I.6
-
10
-
-
84883775660
-
Efficient olap query processing in distributed data warehouses
-
London, UK, UK, Springer-Verlag
-
M. O. Akinde, M. H. Böhlen, T. Johnson, L. V. S. Lakshmanan, and D. Srivastava. Efficient olap query processing in distributed data warehouses. In Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology, EDBT’02, pages 336–353, London, UK, UK, 2002. Springer-Verlag.
-
(2002)
Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology, EDBT ’02
, pp. 336-353
-
-
Akinde, M.O.1
Böhlen, M.H.2
Johnson, T.3
Lakshmanan, L.V.S.4
Srivastava, D.5
-
11
-
-
0030685998
-
Distributed data management in workflow environments
-
G. Alonso, B. Reinwald, and C. Mohan. Distributed data management in workflow environments. In RIDE, 1997.
-
(1997)
RIDE
-
-
Alonso, G.1
Reinwald, B.2
Mohan, C.3
-
12
-
-
84864232092
-
Data infrastructure at linkedin
-
IEEE
-
A. Auradkar, C. Botev, S. Das, D. De Maagd, A. Feinberg, P. Ganti, L. Gao, B. Ghosh, K. Gopalakrishna, and B. Harris. Data infrastructure at linkedin. In 2012 IEEE 28th International Conference on Data Engineering, pages 1370–1381. IEEE, 2012.
-
(2012)
2012 IEEE 28th International Conference on Data Engineering
, pp. 1370-1381
-
-
Auradkar, A.1
Botev, C.2
Das, S.3
de Maagd, D.4
Feinberg, A.5
Ganti, P.6
Gao, L.7
Ghosh, B.8
Gopalakrishna, K.9
Harris, B.10
-
14
-
-
0344811122
-
An overview of data warehousing and OLAP technology
-
S. Chaudhuri and U. Dayal. An overview of data warehousing and OLAP technology. SIGMOD Rec., 1997.
-
(1997)
SIGMOD Rec
-
-
Chaudhuri, S.1
Dayal, U.2
-
15
-
-
47249142395
-
Data placement for scientific applications in distributed environments
-
Washington, DC, USA, IEEE Computer Society
-
A. Chervenak, E. Deelman, M. Livny, M.-H. Su, R. Schuler, S. Bharathi, G. Mehta, and K. Vahi. Data placement for scientific applications in distributed environments. In Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, GRID ’07, pages 267–274, Washington, DC, USA, 2007. IEEE Computer Society.
-
(2007)
Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, GRID ’07
, pp. 267-274
-
-
Chervenak, A.1
Deelman, E.2
Livny, M.3
Su, M.-H.4
Schuler, R.5
Bharathi, S.6
Mehta, G.7
Vahi, K.8
-
16
-
-
85080648835
-
The mixed workload CH-benCHmark
-
R. Cole, F. Funke, L. Giakoumakis, W. Guy, A. Kemper, S. Krompass, H. Kuno, R. Nambiar, T. Neumann, M. Poess, K.-U. Sattler, M. Seibold, E. Simon, and F. Waas. The mixed workload CH-benCHmark. In DBTest’11.
-
DBTest’11
-
-
Cole, R.1
Funke, F.2
Giakoumakis, L.3
Guy, W.4
Kemper, A.5
Krompass, S.6
Kuno, H.7
Nambiar, R.8
Neumann, T.9
Poess, M.10
Sattler, K.-U.11
Seibold, M.12
Simon, E.13
Waas, F.14
-
17
-
-
84867112010
-
Pnuts: Yahoo!’s hosted data serving platform
-
Aug
-
B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H.-A. Jacobsen, N. Puz, D. Weaver, and R. Yerneni. Pnuts: Yahoo!’s hosted data serving platform. Proc. VLDB Endow., 1(2):1277–1288, Aug. 2008.
-
(2008)
Proc. VLDB Endow.
, vol.1
, Issue.2
, pp. 1277-1288
-
-
Cooper, B.F.1
Ramakrishnan, R.2
Srivastava, U.3
Silberstein, A.4
Bohannon, P.5
Jacobsen, H.-A.6
Puz, N.7
Weaver, D.8
Yerneni, R.9
-
18
-
-
85065170765
-
Spanner: Google’s globally-distributed database
-
Hollywood, CA, Oct. USENIX Association
-
J. C. Corbett, J. Dean, M. Epstein, A. Fikes, C. Frost, J. Furman, S. Ghemawat, A. Gubarev, C. Heiser, P. Hochschild, W. Hsieh, S. Kanthak, E. Kogan, H. Li, A. Lloyd, S. Melnik, D. Mwaura, D. Nagle, S. Quinlan, R. Rao, L. Rolig, Y. Saito, M. Szymaniak, C. Taylor, R. Wang, and D. Woodford. Spanner: Google’s globally-distributed database. In 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI 12), pages 261–264, Hollywood, CA, Oct. 2012. USENIX Association.
-
(2012)
10th USENIX Symposium on Operating Systems Design and Implementation (OSDI 12)
, pp. 261-264
-
-
Corbett, J.C.1
Dean, J.2
Epstein, M.3
Fikes, A.4
Frost, C.5
Furman, J.6
Ghemawat, S.7
Gubarev, A.8
Heiser, C.9
Hochschild, P.10
Hsieh, W.11
Kanthak, S.12
Kogan, E.13
Li, H.14
Lloyd, A.15
Melnik, S.16
Mwaura, D.17
Nagle, D.18
Quinlan, S.19
Rao, R.20
Rolig, L.21
Saito, Y.22
Szymaniak, M.23
Taylor, C.24
Wang, R.25
Woodford, D.26
more..
-
19
-
-
29644434815
-
Pegasus: A framework for mapping complex scientific workflows onto distributed systems
-
July
-
E. Deelman, G. Singh, M.-H. Su, J. Blythe, Y. Gil, C. Kesselman, G. Mehta, K. Vahi, G. B. Berriman, J. Good, A. Laity, J. C. Jacob, and D. S. Katz. Pegasus: A framework for mapping complex scientific workflows onto distributed systems. Sci. Program., 13(3):219–237, July 2005.
-
(2005)
Sci. Program.
, vol.13
, Issue.3
, pp. 219-237
-
-
Deelman, E.1
Singh, G.2
Su, M.-H.3
Blythe, J.4
Gil, Y.5
Kesselman, C.6
Mehta, G.7
Vahi, K.8
Berriman, G.B.9
Good, J.10
Laity, A.11
Jacob, J.C.12
Katz, D.S.13
-
21
-
-
84880569459
-
BigBench: Towards an industry standard benchmark for big data analytics
-
New York, NY, USA, ACM
-
A. Ghazal, T. Rabl, M. Hu, F. Raab, M. Poess, A. Crolotte, and H.-A. Jacobsen. Bigbench: Towards an industry standard benchmark for big data analytics. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD’13, pages 1197–1208, New York, NY, USA, 2013. ACM.
-
(2013)
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD’13
, pp. 1197-1208
-
-
Ghazal, A.1
Rabl, T.2
Hu, M.3
Raab, F.4
Poess, M.5
Crolotte, A.6
Jacobsen, H.-A.7
-
22
-
-
84896839521
-
Shared workload optimization
-
G. Giannikis, D. Makreshanski, G. Alonso, and D. Kossmann. Shared workload optimization. Proceedings of the VLDB Endowment, 7(6), 2014.
-
(2014)
Proceedings of the VLDB Endowment
, vol.7
, Issue.6
-
-
Giannikis, G.1
Makreshanski, D.2
Alonso, G.3
Kossmann, D.4
-
23
-
-
84905096068
-
MESA: Geo-replicated, near real-time, scalable data warehousing
-
A. Gupta, F. Yang, J. Govig, A. Kirsch, K. Chan, K. Lai, S. Wu, S. Dhoot, A. Kumar, A. Agiwal, S. Bhansali, M. Hong, J. Cameron, M. Siddiqi, D. Jones, J. Shute, A. Gubarev, S. Venkataraman, and D. Agrawal. Mesa: Geo-replicated, near real-time, scalable data warehousing. In VLDB, 2014.
-
(2014)
VLDB
-
-
Gupta, A.1
Yang, F.2
Govig, J.3
Kirsch, A.4
Chan, K.5
Lai, K.6
Wu, S.7
Dhoot, S.8
Kumar, A.9
Agiwal, A.10
Bhansali, S.11
Hong, M.12
Cameron, J.13
Siddiqi, M.14
Jones, D.15
Shute, J.16
Gubarev, A.17
Venkataraman, S.18
Agrawal, D.19
-
24
-
-
82155174846
-
Profiling, what-if analysis, and cost-based optimization of mapreduce programs
-
H. Herodotou and S. Babu. Profiling, what-if analysis, and cost-based optimization of mapreduce programs. PVLDB, 2011.
-
(2011)
PVLDB
-
-
Herodotou, H.1
Babu, S.2
-
25
-
-
34548724425
-
A cooperative, self-configuring high-availability solution for stream processing
-
April
-
J.-H. Hwang, Y. Xing, U. Cetintemel, and S. Zdonik. A cooperative, self-configuring high-availability solution for stream processing. In Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on, pages 176–185, April 2007.
-
(2007)
Data Engineering, 2007. ICDE 2007. IEEE 23rd International Conference on
, pp. 176-185
-
-
Hwang, J.-H.1
Xing, Y.2
Cetintemel, U.3
Zdonik, S.4
-
26
-
-
84904284766
-
Dynamically optimizing queries over large scale data platforms
-
New York, NY, USA, ACM
-
K. Karanasos, A. Balmin, M. Kutsch, F. Ozcan, V. Ercegovac, C. Xia, and J. Jackson. Dynamically optimizing queries over large scale data platforms. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD’14, pages 943–954, New York, NY, USA, 2014. ACM.
-
(2014)
Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD’14
, pp. 943-954
-
-
Karanasos, K.1
Balmin, A.2
Kutsch, M.3
Ozcan, F.4
Ercegovac, V.5
Xia, C.6
Jackson, J.7
-
27
-
-
0000500922
-
The state of the art in distributed query processing
-
D. Kossmann. The state of the art in distributed query processing. ACM Comput. Surv., 2000.
-
(2000)
ACM Comput. Surv.
-
-
Kossmann, D.1
-
28
-
-
84877720388
-
MDCC: Multi-data center consistency
-
T. Kraska, G. Pang, M. J. Franklin, S. Madden, and A. Fekete. MDCC: Multi-data center consistency. In EuroSys, 2013.
-
(2013)
EuroSys
-
-
Kraska, T.1
Pang, G.2
Franklin, M.J.3
Madden, S.4
Fekete, A.5
-
29
-
-
84873206169
-
The unified logging infrastructure for data analytics at twitter
-
G. Lee, J. Lin, C. Liu, A. Lorek, and D. Ryaboy. The unified logging infrastructure for data analytics at Twitter. PVLDB, 2012.
-
(2012)
PVLDB
-
-
Lee, G.1
Lin, J.2
Liu, C.3
Lorek, A.4
Ryaboy, D.5
-
30
-
-
84873155673
-
Stubby: A transformation-based optimizer for mapreduce workflows
-
July
-
H. Lim, H. Herodotou, and S. Babu. Stubby: A transformation-based optimizer for mapreduce workflows. Proc. VLDB Endow., 5(11):1196–1207, July 2012.
-
(2012)
Proc. VLDB Endow.
, vol.5
, Issue.11
, pp. 1196-1207
-
-
Lim, H.1
Herodotou, H.2
Babu, S.3
-
31
-
-
0022821509
-
R* optimizer validation and performance evaluation for distributed queries
-
L. F. Mackert and G. M. Lohman. R* optimizer validation and performance evaluation for distributed queries. In VLDB, 1986.
-
(1986)
VLDB
-
-
Mackert, L.F.1
Lohman, G.M.2
-
32
-
-
77958152063
-
Database abstractions for managing sensor network data
-
S. Madden. Database abstractions for managing sensor network data. Proc. of the IEEE, 98(11):1879–1886, 2010.
-
(2010)
Proc. of the IEEE
, vol.98
, Issue.11
, pp. 1879-1886
-
-
Madden, S.1
-
33
-
-
70849116921
-
Privacy integrated queries: An extensible platform for privacy-preserving data analysis
-
F. D. McSherry. Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In SIGMOD, 2009.
-
(2009)
SIGMOD
-
-
McSherry, F.D.1
-
34
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
New York, NY, USA, ACM
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig latin: A not-so-foreign language for data processing. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD’08, pages 1099–1110, New York, NY, USA, 2008. ACM.
-
(2008)
Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD ’08
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
35
-
-
85076894949
-
Aggregation and degradation in jetstream: Streaming analytics in the wide area
-
Seattle, WA, Apr. USENIX Association
-
A. Rabkin, M. Arye, S. Sen, V. S. Pai, and M. J. Freedman. Aggregation and degradation in jetstream: Streaming analytics in the wide area. In 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI 14), pages 275–288, Seattle, WA, Apr. 2014. USENIX Association.
-
(2014)
11th USENIX Symposium on Networked Systems Design and Implementation (NSDI 14)
, pp. 275-288
-
-
Rabkin, A.1
Arye, M.2
Sen, S.3
Pai, V.S.4
Freedman, M.J.5
-
36
-
-
85009705529
-
Appinsight: Mobile app performance monitoring in the wild
-
Berkeley, CA, USA, USENIX Association
-
L. Ravindranath, J. Padhye, S. Agarwal, R. Mahajan, I. Obermiller, and S. Shayandeh. Appinsight: Mobile app performance monitoring in the wild. In Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation, OSDI’12, pages 107–120, Berkeley, CA, USA, 2012. USENIX Association.
-
(2012)
Proceedings of the 10th USENIX Conference on Operating Systems Design and Implementation, OSDI’12
, pp. 107-120
-
-
Ravindranath, L.1
Padhye, J.2
Agarwal, S.3
Mahajan, R.4
Obermiller, I.5
Shayandeh, S.6
-
37
-
-
84880515345
-
SciDB: A database management system for applications with complex analytics
-
May
-
M. Stonebraker, P. Brown, D. Zhang, and J. Becla. Scidb: A database management system for applications with complex analytics. Computing in Science Engineering, 15(3):54–62, May 2013.
-
(2013)
Computing in Science Engineering
, vol.15
, Issue.3
, pp. 54-62
-
-
Stonebraker, M.1
Brown, P.2
Zhang, D.3
Becla, J.4
-
38
-
-
77954709174
-
Data warehousing and analytics infrastructure at facebook
-
New York, NY, USA, ACM
-
A. Thusoo, Z. Shao, S. Anthony, D. Borthakur, N. Jain, J. Sen Sarma, R. Murthy, and H. Liu. Data warehousing and analytics infrastructure at facebook. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, SIGMOD’10, pages 1013–1020, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, SIGMOD’10
, pp. 1013-1020
-
-
Thusoo, A.1
Shao, Z.2
Anthony, S.3
Borthakur, D.4
Jain, N.5
Sen Sarma, J.6
Murthy, R.7
Liu, H.8
-
39
-
-
84880533620
-
Shark: Sql and rich analytics at scale
-
New York, NY, USA, ACM
-
R. S. Xin, J. Rosen, M. Zaharia, M. J. Franklin, S. Shenker, and I. Stoica. Shark: Sql and rich analytics at scale. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD’13, pages 13–24, New York, NY, USA, 2013. ACM.
-
(2013)
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, SIGMOD’13
, pp. 13-24
-
-
Xin, R.S.1
Rosen, J.2
Zaharia, M.3
Franklin, M.J.4
Shenker, S.5
Stoica, I.6
-
40
-
-
85040175609
-
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
-
USENIX Association
-
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, pages 2–2. USENIX Association, 2012.
-
(2012)
Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation
, pp. 2
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
McCauley, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
|