-
1
-
-
77954727236
-
FlumeJava: Easy, Efficient Data-Parallel Pipelines
-
C. Chambers, A. Raniwala, F. Perry, S. Adams, R. R. Henry, R. Bradshaw, and N. Weizenbaum. FlumeJava: Easy, Efficient Data-Parallel Pipelines. In PLDI, pages 363-375, 2010.
-
(2010)
PLDI
, pp. 363-375
-
-
Chambers, C.1
Raniwala, A.2
Perry, F.3
Adams, S.4
Henry, R.R.5
Bradshaw, R.6
Weizenbaum, N.7
-
2
-
-
0006015648
-
Querying Multiple Features of Groups in Relational Databases
-
D. Chatziantoniou and K. A. Ross. Querying Multiple Features of Groups in Relational Databases. In VLDB, pages 295-306, 1996.
-
(1996)
VLDB
, pp. 295-306
-
-
Chatziantoniou, D.1
Ross, K.A.2
-
3
-
-
84873168544
-
-
Cloudera: 7 Tips for Improving MapReduce Performance. cloudera. com/blog/2009/12/7-tips-for-improving-mapreduce-performance
-
Cloudera: 7 Tips for Improving MapReduce Performance. cloudera. com/blog/2009/12/7-tips-for-improving-mapreduce-performance.
-
-
-
-
4
-
-
85030321143
-
MapReduce: Simplified Data Processing on Large Clusters
-
J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. In OSDI, pages 137-150, 2004.
-
(2004)
OSDI
, pp. 137-150
-
-
Dean, J.1
Ghemawat, S.2
-
5
-
-
77952278077
-
Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
-
A. Gates, O. Natkovich, S. Chopra, P. Kamath, S. Narayanam, C. Olston, B. Reed, S. Srinivasan, and U. Srivastava. Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience. PVLDB, 2(2):1414-1425, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.2
, pp. 1414-1425
-
-
Gates, A.1
Natkovich, O.2
Chopra, S.3
Kamath, P.4
Narayanam, S.5
Olston, C.6
Reed, B.7
Srinivasan, S.8
Srivastava, U.9
-
6
-
-
84976698894
-
The EXODUS Optimizer Generator
-
G. Graefe and D. J. DeWitt. The EXODUS Optimizer Generator. In SIGMOD, pages 160-172, 1987.
-
(1987)
SIGMOD
, pp. 160-172
-
-
Graefe, G.1
DeWitt, D.J.2
-
7
-
-
84873128036
-
-
Apache Hadoop. http://hadoop.apache.org/.
-
-
-
Hadoop, A.1
-
8
-
-
82155174846
-
Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs
-
H. Herodotou and S. Babu. Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs. PVLDB, 4(11):1111-1122, 2011.
-
(2011)
PVLDB
, vol.4
, Issue.11
, pp. 1111-1122
-
-
Herodotou, H.1
Babu, S.2
-
9
-
-
84873104343
-
-
Apache Hive. http://hive.apache.org/.
-
-
-
Hive, A.1
-
10
-
-
84863535860
-
Automatic Optimization for MapReduce Programs
-
E. Jahani, M. J. Cafarella, and C. Ŕe. Automatic Optimization for MapReduce Programs. PVLDB, 4(6):385-396, 2011.
-
PVLDB
, vol.4
, Issue.6
, pp. 385-396
-
-
Jahani, E.1
Cafarella, M.J.2
Ŕe, C.3
-
11
-
-
80051874596
-
YSmart: Yet Another SQL-to-MapReduce Translator
-
R. Lee, T. Luo, Y. Huai, F. Wang, Y. He, and X. Zhang. YSmart: Yet Another SQL-to-MapReduce Translator. In ICDCS, pages 25-36, 2011.
-
ICDCS
, pp. 25-36
-
-
Lee, R.1
Luo, T.2
Huai, Y.3
Wang, F.4
He, Y.5
Zhang, X.6
-
13
-
-
84859260019
-
MRShare: Sharing Across Multiple Queries in MapReduce
-
T. Nykiel, M. Potamias, C. Mishra, G. Kollios, and N. Koudas. MRShare: Sharing Across Multiple Queries in MapReduce. PVLDB, 3(1):494-505, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 494-505
-
-
Nykiel, T.1
Potamias, M.2
Mishra, C.3
Kollios, G.4
Koudas, N.5
-
14
-
-
70349547303
-
Automatic Optimization of Parallel Dataflow Programs
-
C. Olston, B. Reed, A. Silberstein, and U. Srivastava. Automatic Optimization of Parallel Dataflow Programs. In USENIX Annual Technical Conference, pages 267-273, 2008.
-
(2008)
USENIX Annual Technical Conference
, pp. 267-273
-
-
Olston, C.1
Reed, B.2
Silberstein, A.3
Srivastava, U.4
-
15
-
-
85039666809
-
-
Oozie: Workflow Engine for Hadoop
-
Oozie: Workflow Engine for Hadoop. http://yahoo.github.com/oozie/.
-
-
-
-
16
-
-
0003780986
-
The PageRank Citation Ranking: Bringing Order to the Web
-
Technical Report 1999-66, Stanford Info Lab, November
-
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report 1999-66, Stanford Info Lab, November 1999.
-
(1999)
-
-
Page, L.1
Brin, S.2
Motwani, R.3
Winograd, T.4
-
17
-
-
70350512695
-
A Comparison of Approaches to Large-Scale Data Analysis
-
A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, and M. Stonebraker. A Comparison of Approaches to Large-Scale Data Analysis. In SIGMOD, pages 165-178, 2009.
-
(2009)
SIGMOD
, pp. 165-178
-
-
Pavlo, A.1
Paulson, E.2
Rasin, A.3
Abadi, D.J.4
DeWitt, D.J.5
Madden, S.6
Stonebraker, M.7
-
18
-
-
84873179078
-
-
Apache Pig. http://pig.apache.org/.
-
-
-
Pig, A.1
-
19
-
-
27644552868
-
Optimizing ETL Processes in Data Warehouses
-
A. Simitsis, P. Vassiliadis, and T. K. Sellis. Optimizing ETL Processes in Data Warehouses. In ICDE, pages 564-575, 2005.
-
(2005)
ICDE
, pp. 564-575
-
-
Simitsis, A.1
Vassiliadis, P.2
Sellis, T.K.3
-
20
-
-
84873160383
-
-
TPC-H Benchmark Specification
-
TPC-H Benchmark Specification. http://www.tpc.org/tpch/.
-
-
-
-
21
-
-
79959941077
-
A Latency and Fault-Tolerance Optimizer for Online Parallel Query Plans
-
P. Upadhyaya, Y. Kwon, and M. Balazinska. A Latency and Fault-Tolerance Optimizer for Online Parallel Query Plans. In SIGMOD ACM, pages 241-252. , 2011.
-
(2011)
SIGMOD ACM
, pp. 241-252
-
-
Upadhyaya, P.1
Kwon, Y.2
Balazinska, M.3
-
23
-
-
82155187171
-
Query Optimization for Massively Parallel Data Processing
-
S. Wu, F. Li, S. Mehrotra, and B. C. Ooi. Query Optimization for Massively Parallel Data Processing. In SOCC, 2011.
-
(2011)
SOCC
-
-
Wu, S.1
Li, F.2
Mehrotra, S.3
Ooi, B.C.4
-
24
-
-
26444446303
-
A Recursive Random Search Algorithm for Large-Scale Network Parameter Configuration
-
T. Ye and S. Kalyanaraman. A Recursive Random Search Algorithm for Large-Scale Network Parameter Configuration. SIGMETRICS, pages 196-205, 2003.
-
(2003)
SIGMETRICS
, pp. 196-205
-
-
Ye, T.1
Kalyanaraman, S.2
-
25
-
-
72249089011
-
Distributed Aggregation for Data-Parallel Computing: Interfaces and Implementations
-
Y. Yu, P. K. Gunda, and M. Isard. Distributed Aggregation for Data-Parallel Computing: Interfaces and Implementations. In SOSP, pages 247-260, 2009.
-
(2009)
SOSP
, pp. 247-260
-
-
Yu, Y.1
Gunda, P.K.2
Isard, M.3
-
26
-
-
85076882757
-
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
-
Y. Yu, M. Isard, D. Fetterly, M. Budiu, U. Erlingsson, P. K. Gunda, and J. Currey. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language. In OSDI, pages 1-14, 2008.
-
(2008)
OSDI
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, U.5
Gunda, P.K.6
Currey, J.7
-
27
-
-
77952771965
-
Incorporating Partitioning and Parallel Plans into the SCOPE Optimizer
-
J. Zhou, P.-A. Larson, and R. Chaiken. Incorporating Partitioning and Parallel Plans into the SCOPE Optimizer. In ICDE, pages 1060-1071, 2010..
-
(2010)
ICDE
, pp. 1060-1071
-
-
Zhou, J.1
Larson, P.-A.2
Chaiken, R.3
|