-
1
-
-
84863526924
-
-
Apache Hadoop, http://hadoop.apache.org.
-
-
-
-
2
-
-
77951634416
-
Building a cloud for Yahoo!
-
B. F. Cooper et al., "Building a cloud for Yahoo!" IEEE Data Eng. Bull., vol. 32, no. 1, pp. 36-43, 2009.
-
(2009)
IEEE Data Eng. Bull.
, vol.32
, Issue.1
, pp. 36-43
-
-
Cooper, B.F.1
-
3
-
-
73649114265
-
MapReduce: A flexible data processing tool
-
J. Dean and S. Ghemawat, "MapReduce: a flexible data processing tool," Commun. ACM, vol. 53, no. 1, pp. 72-77, 2010.
-
(2010)
Commun. ACM
, vol.53
, Issue.1
, pp. 72-77
-
-
Dean, J.1
Ghemawat, S.2
-
4
-
-
77952278077
-
Building a highlevel dataflow system on top of MapReduce: The Pig experience
-
A. Gates et al., "Building a highlevel dataflow system on top of MapReduce: the Pig experience," PVLDB, vol. 2, no. 2, pp. 1414-1425, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.2
, pp. 1414-1425
-
-
Gates, A.1
-
5
-
-
77952775707
-
Hive - A petabyte scale data warehouse using Hadoop
-
A. Thusoo et al., "Hive - a petabyte scale data warehouse using Hadoop," in ICDE, 2010, pp. 996-1005.
-
(2010)
ICDE
, pp. 996-1005
-
-
Thusoo, A.1
-
6
-
-
79956351190
-
Haloop: Efficient iterative data processing on large clusters
-
Y. Bu and et al., "Haloop: Efficient iterative data processing on large clusters," PVLDB, vol. 3, no. 1, pp. 285-296, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 285-296
-
-
Bu, Y.1
-
7
-
-
85076771850
-
MapReduce online
-
T. Condie et al., "MapReduce online," in NSDI, 2010, pp. 313-328.
-
(2010)
NSDI
, pp. 313-328
-
-
Condie, T.1
-
8
-
-
80053521271
-
Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing)
-
J. Dittrich et al., "Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing)," PVLDB, vol. 3, no. 1, pp. 518-529, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 518-529
-
-
Dittrich, J.1
-
9
-
-
81055143288
-
The performance of MapReduce: An in-depth study
-
D. Jiang et al., "The performance of MapReduce: an in-depth study," PVLDB, vol. 3, no. 1, pp. 472-483, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 472-483
-
-
Jiang, D.1
-
10
-
-
84859260019
-
MRShare: Sharing across multiple queries in MapReduce
-
T. Nykiel et al., "MRShare: sharing across multiple queries in MapReduce," PVLDB, vol. 3, no. 1, pp. 494-505, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 494-505
-
-
Nykiel, T.1
-
11
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, "Dryad: distributed data-parallel programs from sequential building blocks," in EuroSys, 2007, pp. 59-72.
-
(2007)
EuroSys
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
12
-
-
79957872898
-
Hyracks: A flexible and extensible foundation for data-intensive computing
-
V. R. Borkar, M. J. Carey, R. Grover, N. Onose, and R. Vernica, "Hyracks: A flexible and extensible foundation for data-intensive computing," in ICDE, 2011.
-
(2011)
ICDE
-
-
Borkar, V.R.1
Carey, M.J.2
Grover, R.3
Onose, N.4
Vernica, R.5
-
14
-
-
77954948422
-
Nephele/PACTs: A programming model and execution framework for web-scale analytical processing
-
D. Battré et al., "Nephele/PACTs: a programming model and execution framework for web-scale analytical processing," in SoCC, 2010, pp. 119-130.
-
(2010)
SoCC
, pp. 119-130
-
-
Battré, D.1
-
15
-
-
85049119901
-
Ciel: A universal execution engine for distributed data-flow computing
-
D. G. Murray, M. Schwarzkopf, C. Smowton, S. Smith, A. Madhavapeddy, and S. Hand, "Ciel: a universal execution engine for distributed data-flow computing," in NSDI, 2011.
-
(2011)
NSDI
-
-
Murray, D.G.1
Schwarzkopf, M.2
Smowton, C.3
Smith, S.4
Madhavapeddy, A.5
Hand, S.6
-
16
-
-
84863526919
-
-
Jaql, http://www.jaql.org.
-
-
-
-
17
-
-
77954942463
-
Towards automatic optimization of MapReduce programs
-
S. Babu, "Towards automatic optimization of MapReduce programs," in SoCC, 2010, pp. 137-142.
-
(2010)
SoCC
, pp. 137-142
-
-
Babu, S.1
-
18
-
-
84976820332
-
Adaptive parallel aggregation algorithms
-
A. Shatdal et al., "Adaptive parallel aggregation algorithms," in SIGMOD Conf., 1995, pp. 104-114.
-
(1995)
SIGMOD Conf.
, pp. 104-114
-
-
Shatdal, A.1
-
19
-
-
0003385843
-
Parallel sorting on a shared-nothing architecture using probabilistic splitting
-
D. J. DeWitt, J. F. Naughton, and D. A. Schneider, "Parallel sorting on a shared-nothing architecture using probabilistic splitting," in PDIS, 1991, pp. 280-291.
-
(1991)
PDIS
, pp. 280-291
-
-
DeWitt, D.J.1
Naughton, J.F.2
Schneider, D.A.3
-
20
-
-
84976736061
-
A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment
-
D. A. Schneider et al., "A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment," in SIGMOD Conf., 1989, pp. 110-121.
-
(1989)
SIGMOD Conf.
, pp. 110-121
-
-
Schneider, D.A.1
-
21
-
-
84863526922
-
-
Apache ZooKeeper http://hadoop.apache.org/zookeeper.
-
-
-
-
22
-
-
79951761350
-
ZooKeeper: Wait-free coordination for internet-scale systems
-
P. Hunt et al., "ZooKeeper: wait-free coordination for internet-scale systems," in USENIX Conf., 2010.
-
USENIX Conf., 2010.
-
-
Hunt, P.1
-
23
-
-
77954636142
-
Delay scheduling: A simple technique for achieving locality and fairness in cluster scheduling
-
M. Zaharia et al., "Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling," in EuroSys, 2010, pp. 265-278.
-
(2010)
EuroSys
, pp. 265-278
-
-
Zaharia, M.1
-
24
-
-
84860383003
-
FLEX: A slot allocation scheduling optimizer for MapReduce workloads
-
to appear
-
J. Wolf et al., "FLEX: a slot allocation scheduling optimizer for MapReduce workloads," in Middleware, to appear, 2010.
-
(2010)
Middleware
-
-
Wolf, J.1
-
25
-
-
85011093340
-
Adaptive aggregation on chip multiprocessors
-
J. Cieslewicz and K. A. Ross, "Adaptive aggregation on chip multiprocessors," in VLDB, 2007, pp. 339-350.
-
(2007)
VLDB
, pp. 339-350
-
-
Cieslewicz, J.1
Ross, K.A.2
-
26
-
-
77954700016
-
A comparison of join algorithms for log processing in MapReduce
-
S. Blanas et al., "A comparison of join algorithms for log processing in MapReduce," in SIGMOD Conf., 2010, pp. 975-986.
-
SIGMOD Conf., 2010
, pp. 975-986
-
-
Blanas, S.1
-
27
-
-
77954744650
-
Efficient parallel set-similarity joins using MapReduce
-
R. Vernica et al., "Efficient parallel set-similarity joins using MapReduce," in SIGMOD Conf., 2010.
-
SIGMOD Conf., 2010
-
-
Vernica, R.1
-
29
-
-
84863513686
-
-
"Skewed join in Pig,"http://wiki.apache.org/pig/ PigSkewedJoinSpec.
-
Skewed Join in Pig
-
-
-
30
-
-
79957809015
-
HadoopDB: An architectural hybrid of MapReduce and dbms technologies for analytical workloads
-
A. Abouzeid et al., "HadoopDB: an architectural hybrid of MapReduce and dbms technologies for analytical workloads," PVLDB, vol. 2, no. 1, pp. 922-933, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
, pp. 922-933
-
-
Abouzeid, A.1
-
31
-
-
70350512695
-
A comparison of approaches to large-scale data analysis
-
A. Pavlo et al., "A comparison of approaches to large-scale data analysis," in SIGMOD Conf., 2009, pp. 165-178.
-
SIGMOD Conf., 2009
, pp. 165-178
-
-
Pavlo, A.1
-
32
-
-
70349547303
-
Automatic optimization of parallel dataflow programs
-
C. Olston et al., "Automatic optimization of parallel dataflow programs," in USENIX Conf., 2008, pp. 267-273.
-
USENIX Conf., 2008
, pp. 267-273
-
-
Olston, C.1
-
33
-
-
79959939881
-
A platform for scalable one-pass analytics using mapreduce
-
B. Li, E. Mazur, Y. Diao, A. McGregor, and P. Shenoy, "A platform for scalable one-pass analytics using mapreduce,"sIGMOD 2011.
-
(2011)
SIGMOD
-
-
Li, B.1
Mazur, E.2
Diao, Y.3
McGregor, A.4
Shenoy, P.5
-
34
-
-
65749311706
-
Application of hash to data base machine and its architecture
-
M. Kitsuregawa et al., "Application of hash to data base machine and its architecture," New Generation Comput., vol. 1, no. 1, pp. 63-74, 1983.
-
(1983)
New Generation Comput.
, vol.1
, Issue.1
, pp. 63-74
-
-
Kitsuregawa, M.1
-
35
-
-
0345638022
-
Practical skew handling in parallel joins
-
D. J. DeWitt, J. F. Naughton, D. A. Schneider, and S. Seshadri, "Practical skew handling in parallel joins," in VLDB, 1992, pp. 27-40.
-
(1992)
VLDB
, pp. 27-40
-
-
DeWitt, D.J.1
Naughton, J.F.2
Schneider, D.A.3
Seshadri, S.4
-
36
-
-
0028698894
-
New algorithms for parallelizing relational database joins in the presence of data skew
-
J. L. Wolf et al., "New algorithms for parallelizing relational database joins in the presence of data skew," IEEE Trans. Knowl. Data Eng., vol. 6, no. 6, pp. 990-997, 1994.
-
(1994)
IEEE Trans. Knowl. Data Eng.
, vol.6
, Issue.6
, pp. 990-997
-
-
Wolf, J.L.1
|