-
1
-
-
84873959583
-
Setting the direction for big data benchmark standards
-
Springer Berlin, Heidelberg
-
[1] Baru, Chaitanya, Bhandarkar, Milind, Nambiar, Raghunath, Poess, Meikel, Rabl, Tilmann, Setting the direction for big data benchmark standards. Selected Topics in Performance Evaluation and Benchmarking, 2013, Springer, Berlin, Heidelberg, 197–208.
-
(2013)
Selected Topics in Performance Evaluation and Benchmarking
, pp. 197-208
-
-
Baru, C.1
Bhandarkar, M.2
Nambiar, R.3
Poess, M.4
Rabl, T.5
-
2
-
-
84976338252
-
-
Apache Hive vs MySQL—What are the key differences? Matthew Rathbone. (posted 12.8.15, accessed 12.14.15).
-
[2] Apache Hive vs MySQL—What are the key differences? Matthew Rathbone. http://blog.matthewrathbone.com/2015/12/08/hive-vs-mysql.html (posted 12.8.15, accessed 12.14.15).
-
-
-
-
3
-
-
84976322138
-
-
Hive vs. RDMS, Siva, (posted 8.1.14, accessed 12.14.15).
-
[3] Hive vs. RDMS, Siva, http://hadooptutorial.info/hive-vs-rdbms/ (posted 8.1.14, accessed 12.14.15).
-
-
-
-
4
-
-
84976260914
-
-
Schema-on-Read vs Schema-on-Write, Joe Pasqua, posted 2.17.15, (accessed 12.14.15).
-
[4] Schema-on-Read vs Schema-on-Write, Joe Pasqua, posted 2.17.15, http://www.marklogic.com/blog/schema-on-read-vs-schema-on-write/ (accessed 12.14.15).
-
-
-
-
5
-
-
84976260911
-
-
TPC-DS benchmarking standard.
-
[5] TPC-DS benchmarking standard. http://www.tpc.org/tpcds/spec/tpcds_1.1.0.pdf.
-
-
-
-
6
-
-
84976322144
-
-
Using MySQL as a Hive backend database, (accessed 12.14.15).
-
[6] Using MySQL as a Hive backend database, http://mapredit.blogspot.com/2011/08/using-mysql-as-hive-backend-database.html (accessed 12.14.15).
-
-
-
-
7
-
-
84976329107
-
-
AdminManual MetastoreAdmin. (modified 11.11.15, accessed 12.14.15).
-
[7] AdminManual MetastoreAdmin. https://cwiki.apache.org/confluence/display/Hive/AdminManual+MetastoreAdmin (modified 11.11.15, accessed 12.14.15).
-
-
-
-
8
-
-
37549003336
-
MapReduce: simplified data processing on large clusters
-
[8] Dean, Jeffrey, Ghemawat, Sanjay, MapReduce: simplified data processing on large clusters. Commun. ACM 51:1 (2008), 107–113.
-
(2008)
Commun. ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
9
-
-
84868325513
-
Hive: a warehousing solution over a map-reduce framework
-
[9] Thusoo, Ashish, Sarma, Joydeep Sen, Jain, Namit, Shao, Zheng, Chakka, Prasad, Anthony, Suresh, Liu, Hao, Wyckoff, Pete, Murthy, Raghotham, Hive: a warehousing solution over a map-reduce framework. Proc. VLDB Endow. 2:2 (2009), 1626–1629.
-
(2009)
Proc. VLDB Endow.
, vol.2
, Issue.2
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
10
-
-
84976318018
-
-
Hortonworks HDP 1.3.2,.
-
[10] Hortonworks HDP 1.3.2, http://hortonworks.com/products/hdp/hdp-1-3/#overview.
-
-
-
-
11
-
-
84976295394
-
-
Sort program Available in Hadoop source distribution:.
-
[11] Sort program Available in Hadoop source distribution: https://github.com/facebookarchive/hadoop-20/blob/master/src/examples/org/apache/hadoop/examples/Sort.java.
-
-
-
-
12
-
-
84976295407
-
-
Hadoop TeraSort program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort.
-
[12] Hadoop TeraSort program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort.
-
-
-
-
13
-
-
84976295406
-
-
GridMix program. Available in Hadoop source distribution: src/benchmarks/gridmix.
-
[13] GridMix program. Available in Hadoop source distribution: src/benchmarks/gridmix.
-
-
-
-
14
-
-
77952721751
-
The HiBench benchmark suite: Characterization of the MapReduce-based data analysis
-
IEEE
-
[14] Huang, Shengsheng, Huang, Jie, Dai, Jinquan, Xie, Tao, Huang, Bo, The HiBench benchmark suite: Characterization of the MapReduce-based data analysis. Proceedings of the Data Engineering Workshops (ICDEW), 2010, IEEE, 41–51.
-
(2010)
Proceedings of the Data Engineering Workshops (ICDEW)
, pp. 41-51
-
-
Huang, S.1
Huang, J.2
Dai, J.3
Xie, T.4
Huang, B.5
-
15
-
-
84880569459
-
BigBench: towards an industry standard benchmark for big data analytics
-
ACM
-
[15] Ghazal, Ahmad, Rabl, Tilmann, Hu, Minqing, Raab, Francois, Poess, Meikel, Crolotte, Alain, Jacobsen, Hans-Arno, BigBench: towards an industry standard benchmark for big data analytics. Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, 2013, ACM, 1197–1208.
-
(2013)
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
, pp. 1197-1208
-
-
Ghazal, A.1
Rabl, T.2
Hu, M.3
Raab, F.4
Poess, M.5
Crolotte, A.6
Jacobsen, H.-A.7
-
16
-
-
84976297342
-
-
Hive Performance Benchmark. Available at:.
-
[16] Hive Performance Benchmark. Available at: https://issues.apache.org/jira/browse/hive-396.
-
-
-
-
17
-
-
84976295417
-
-
Hortonworks Stinger Initiative,.
-
[17] Hortonworks Stinger Initiative, http://hortonworks.com/labs/stinger/.
-
-
-
-
18
-
-
84976322108
-
-
DSGen v1.1.0, data generation tool for TPC-DS,.
-
[18] DSGen v1.1.0, data generation tool for TPC-DS, http://www.tpc.org/tpcds/.
-
-
-
-
19
-
-
84976327877
-
-
Tom. White, Hadoop: the Definitive Guide. O'Reilly, 2012.
-
[19] Tom. White, Hadoop: the Definitive Guide. O'Reilly, 2012.
-
-
-
-
20
-
-
85134435537
-
The making of TPC-DS
-
VLDB Endowment
-
[20] Nambiar, Raghunath Othayoth, Poess, Meikel, The making of TPC-DS. Proceedings of the 32nd International Conference on Very Large Data Bases, 2006, VLDB Endowment, 1049–1058.
-
(2006)
Proceedings of the 32nd International Conference on Very Large Data Bases
, pp. 1049-1058
-
-
Nambiar, R.O.1
Poess, M.2
-
21
-
-
78651324767
-
Benchmarking cloud-based data management systems
-
ACM
-
[21] Shi, Yingjie, Meng, Xiaofeng, Zhao, Jing, Hu, Xiangmei, Liu, Bingbing, Wang, Haiping, Benchmarking cloud-based data management systems. Proceedings of the Second International Workshop on Cloud Data Management, 2010, ACM, 47–54.
-
(2010)
Proceedings of the Second International Workshop on Cloud Data Management
, pp. 47-54
-
-
Shi, Y.1
Meng, X.2
Zhao, J.3
Hu, X.4
Liu, B.5
Wang, H.6
-
22
-
-
70350512695
-
A comparison of approaches to large-scale data analysis
-
ACM
-
[22] Pavlo, Andrew, Paulson, Erik, Rasin, Alexander, Abadi, Daniel J., DeWitt, David J., Madden, Samuel, Stonebraker, Michael, A comparison of approaches to large-scale data analysis. Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, 2009, ACM, 165–178.
-
(2009)
Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data
, pp. 165-178
-
-
Pavlo, A.1
Paulson, E.2
Rasin, A.3
Abadi, D.J.4
DeWitt, D.J.5
Madden, S.6
Stonebraker, M.7
-
23
-
-
84976261825
-
-
Yuntao Jia, Running the TPC-H benchmark on Hive,.
-
[23] Yuntao Jia, Running the TPC-H benchmark on Hive, https://issues.apache.org/jira/secure/attachment/12416257/TPC-H_on_Hive_2009-08-11.pdf.
-
-
-
-
24
-
-
84976322114
-
-
Niketan Pansare, Zhuhua Cai, Using Hive to perform medium-scale data analysis, 2010.
-
[24] Niketan Pansare, Zhuhua Cai, Using Hive to perform medium-scale data analysis, 2010. http://www.cs.rice.edu/~np6/Papers/HiveProjectReport.pdf.
-
-
-
-
25
-
-
84910653308
-
-
Benchmarking performance for migrating a relational application to a parallel implementation, in: Proceedings of the Modeling and Management of Big Data Workshop, MoBiD, 33nd International Conference on Conceptual Modeling, ER, 2014, pp. 55–64.
-
[25] K.K. Gadiraju, P.G. Talaga, K.C. Davis, Benchmarking performance for migrating a relational application to a parallel implementation, in: Proceedings of the Modeling and Management of Big Data Workshop, MoBiD, 33nd International Conference on Conceptual Modeling, ER, 2014, pp. 55–64.
-
-
-
Gadiraju, K.K.1
Talaga, P.G.2
Davis, K.C.3
-
26
-
-
84976330057
-
-
Hortonworks Stinger Initiative, Apache Hive 0.13 Performance Benchmarks Query Times in Hive 0.13 v. Hive 0.10, June 2014.
-
[26] Hortonworks Stinger Initiative, Apache Hive 0.13 Performance Benchmarks Query Times in Hive 0.13 v. Hive 0.10, June 2014.
-
-
-
-
27
-
-
84855313448
-
Query optimization using column statistics in hive
-
ACM
-
[27] Gruenheid, Anja, Omiecinski, Edward, Mark, Leo, Query optimization using column statistics in hive. Proceedings of the 15th Symposium on International Database Engineering & Applications, 2011, ACM, 97–105.
-
(2011)
Proceedings of the 15th Symposium on International Database Engineering & Applications
, pp. 97-105
-
-
Gruenheid, A.1
Omiecinski, E.2
Mark, L.3
-
28
-
-
80053500227
-
-
Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, Shivnath Babu, Starfish: a self-tuning system for big data analytics, in: Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, CIDR, 2011, pp. 261–272.
-
[28] Herodotos Herodotou, Harold Lim, Gang Luo, Nedyalko Borisov, Liang Dong, Fatma Bilgen Cetin, Shivnath Babu, Starfish: a self-tuning system for big data analytics, in: Proceedings of the Fifth Biennial Conference on Innovative Data Systems Research, CIDR, 2011, pp. 261–272.
-
-
-
|