-
12
-
-
84957589502
-
-
https://issues.apache.org/jira/browse/HDFS-5851
-
A. Agarwal. https://issues.apache.org/jira/browse/hdfs-5851, 2014. https://issues.apache.org/jira/browse/HDFS-5851.
-
(2014)
-
-
Agarwal, A.1
-
13
-
-
77954942463
-
Towards automatic optimization of mapreduce programs
-
New York, NY, USA, ACM
-
S. Babu. Towards automatic optimization of mapreduce programs. In Proceedings of the 1st ACM Symposium on Cloud Computing, SoCC '10, pages 137-142, New York, NY, USA, 2010. ACM.
-
(2010)
Proceedings of the 1st ACM Symposium on Cloud Computing, SoCC '10
, pp. 137-142
-
-
Babu, S.1
-
14
-
-
77954948422
-
Nephele/PACTs: A programming model and execution framework for web-scale analytical processing
-
D. Battré, S. Ewen, F. Hueske, O. Kao, V. Markl, and D. Warneke. Nephele/PACTs: A programming model and execution framework for web-scale analytical processing. In SOCC, 2010.
-
(2010)
SOCC
-
-
Battré, D.1
Ewen, S.2
Hueske, F.3
Kao, O.4
Markl, V.5
Warneke, D.6
-
15
-
-
79957872898
-
Hyracks: A flexible and extensible foundation for data-intensive computing
-
V. Borkar, M. Carey, R. Grover, N. Onose, and R. Vernica. Hyracks: A flexible and extensible foundation for data-intensive computing. In ICDE, 2011.
-
(2011)
ICDE
-
-
Borkar, V.1
Carey, M.2
Grover, R.3
Onose, N.4
Vernica, R.5
-
16
-
-
84860687282
-
-
D. Borthakur. Hdfs architecture guide. HADOOP APACHE PROJECT http://hadoop. apache.org/common/docs/current/hdfs design. pdf, 2008.
-
(2008)
Hdfs Architecture Guide
-
-
Borthakur, D.1
-
18
-
-
84881332659
-
Tajo: A distributed data warehouse system on large clusters
-
IEEE
-
H. Choi, J. Son, H. Yang, H. Ryu, B. Lim, S. Kim, and Y. D. Chung. Tajo: A distributed data warehouse system on large clusters. In Data Engineering (ICDE), 2013 IEEE 29th International Conference on, pages 1320-1323. IEEE, 2013.
-
(2013)
Data Engineering (ICDE), 2013 IEEE 29th International Conference On
, pp. 1320-1323
-
-
Choi, H.1
Son, J.2
Yang, H.3
Ryu, H.4
Lim, B.5
Kim, S.6
Chung, Y.D.7
-
19
-
-
84891121890
-
Reef: Retainable evaluator execution framework
-
B.-G. Chun, T. Condie, C. Curino, C. Douglas, S. Matusevych, B. Myers, S. Narayanamurthy, R. Ramakrishnan, S. Rao, J. Rosen, et al. Reef: Retainable evaluator execution framework. Proceedings of the VLDB Endowment, 6(12):1370-1373, 2013.
-
(2013)
Proceedings of the VLDB Endowment
, vol.6
, Issue.12
, pp. 1370-1373
-
-
Chun, B.-G.1
Condie, T.2
Curino, C.3
Douglas, C.4
Matusevych, S.5
Myers, B.6
Narayanamurthy, S.7
Ramakrishnan, R.8
Rao, S.9
Rosen, J.10
-
21
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. Commun. ACM, 51, 2008.
-
(2008)
Commun. ACM
, vol.51
-
-
Dean, J.1
Ghemawat, S.2
-
23
-
-
84991773782
-
Apache drill: Interactive ad-hoc analysis at scale
-
M. Hausenblas and J. Nadeau. Apache drill: interactive ad-hoc analysis at scale. Big Data, 1(2):100-104, 2013.
-
(2013)
Big Data
, vol.1
, Issue.2
, pp. 100-104
-
-
Hausenblas, M.1
Nadeau, J.2
-
24
-
-
35448961922
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. SIGOPS, 2007.
-
(2007)
SIGOPS
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
25
-
-
85084011754
-
Impala: A modern, open-source SQL engine for hadoop
-
M. Kornacker, A. Behm, V. Bittorf, and et al. Impala: A modern, open-source sql engine for hadoop. In CIDR, 2015.
-
(2015)
CIDR
-
-
Kornacker, M.1
Behm, A.2
Bittorf, V.3
-
26
-
-
79958258284
-
Dremel: Interactive analysis of web-scale datasets
-
Sept.
-
S. Melnik, A. Gubarev, J. J. Long, G. Romer, S. Shivakumar, M. Tolton, and T. Vassilakis. Dremel: Interactive analysis of web-scale datasets. Proc. VLDB Endow., 3(1-2):330-339, Sept. 2010.
-
(2010)
Proc. VLDB Endow.
, vol.3
, Issue.1-2
, pp. 330-339
-
-
Melnik, S.1
Gubarev, A.2
Long, J.J.3
Romer, G.4
Shivakumar, S.5
Tolton, M.6
Vassilakis, T.7
-
27
-
-
84875605548
-
-
Technical report, Tech. rep., Apache Hadoop
-
A. C. Murthy, C. Douglas, M. Konar, O. O'Malley, S. Radia, S. Agarwal, and V. KV. Architecture of next generation apache hadoop mapreduce framework. Technical report, Tech. rep., Apache Hadoop, 2011.
-
(2011)
Architecture of Next Generation Apache Hadoop Mapreduce Framework
-
-
Murthy, A.C.1
Douglas, C.2
Konar, M.3
O'Malley, O.4
Radia, S.5
Agarwal, S.6
Vi, K.V.7
-
28
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
New York, NY, USA, ACM
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. Pig latin: a not-so-foreign language for data processing. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data, SIGMOD '08, pages 1099-1110, New York, NY, USA, 2008. ACM.
-
(2008)
Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD '08
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
29
-
-
73649141347
-
Mapreduce and parallel dbmss: Friends or foes?
-
Jan.
-
M. Stonebraker, D. Abadi, D. J. DeWitt, S. Madden, E. Paulson, A. Pavlo, and A. Rasin. Mapreduce and parallel dbmss: Friends or foes? Commun. ACM, 53(1):64-71, Jan. 2010.
-
(2010)
Commun. ACM
, vol.53
, Issue.1
, pp. 64-71
-
-
Stonebraker, M.1
Abadi, D.2
DeWitt, D.J.3
Madden, S.4
Paulson, E.5
Pavlo, A.6
Rasin, A.7
-
30
-
-
84868325513
-
Hive - A warehousing solution over a map-reduce framework
-
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy. Hive - a warehousing solution over a map-reduce framework. In PVLDB, 2009.
-
(2009)
PVLDB
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
32
-
-
84893249524
-
Apache hadoop yarn: Yet another resource negotiator
-
V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar, R. Evans, T. Graves, J. Lowe, H. Shah, S. Seth, B. Saha, C. Curino, O. O'Malley, S. Radia, B. Reed, and E. Baldeschwieler. Apache hadoop yarn: Yet another resource negotiator. In SOCC, 2013.
-
(2013)
SOCC
-
-
Vavilapalli, V.K.1
Murthy, A.C.2
Douglas, C.3
Agarwal, S.4
Konar, M.5
Evans, R.6
Graves, T.7
Lowe, J.8
Shah, H.9
Seth, S.10
Saha, B.11
Curino, C.12
O'Malley, O.13
Radia, S.14
Reed, B.15
Baldeschwieler, E.16
-
34
-
-
85084012477
-
Wanalytics: Analytics for a geo-distributed data-intensive world
-
A. Vulimiri, C. Curino, B. Godfrey, K. Karanasos, and G. Varghese. Wanalytics: Analytics for a geo-distributed data-intensive world. In CIDR, 2015.
-
(2015)
CIDR
-
-
Vulimiri, A.1
Curino, C.2
Godfrey, B.3
Karanasos, K.4
Varghese, G.5
-
35
-
-
84957558213
-
-
Yahoo Hadoop Platforms Team. Yahoo Betting on Apache Hive, Tez, and YARN, 2014. http://yahoodevelopers.tumblr.com/post/85930551108/yahoo-betting-on-apache-hive-tez-and-yarn.
-
(2014)
Yahoo Betting on Apache Hive, Tez, and YARN
-
-
-
36
-
-
77954636142
-
Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling
-
ACM
-
M. Zaharia, D. Borthakur, J. Sen Sarma, K. Elmeleegy, S. Shenker, and I. Stoica. Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling. In Proceedings of the 5th European conference on Computer systems, pages 265-278. ACM, 2010.
-
(2010)
Proceedings of the 5th European Conference on Computer Systems
, pp. 265-278
-
-
Zaharia, M.1
Borthakur, D.2
Sen Sarma, J.3
Elmeleegy, K.4
Shenker, S.5
Stoica, I.6
-
37
-
-
85040175609
-
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
-
USENIX Association
-
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I. Stoica. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, pages 2-2. USENIX Association, 2012.
-
(2012)
Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation
, pp. 2
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
McCauley, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
-
38
-
-
85085251984
-
Spark: Cluster computing with working sets
-
Berkeley, CA, USA, USENIX Association
-
M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica. Spark: Cluster computing with working sets. In Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud'10, pages 10-10, Berkeley, CA, USA, 2010. USENIX Association.
-
(2010)
Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud'10
, pp. 10
-
-
Zaharia, M.1
Chowdhury, M.2
Franklin, M.J.3
Shenker, S.4
Stoica, I.5
|