-
1
-
-
85066345979
-
-
AsterixDB. https://asterixdb.ics.uci.edu/.
-
-
-
-
2
-
-
85066347720
-
-
Cascading. http://www.cascading.org.
-
-
-
-
3
-
-
84893559421
-
-
Hadoop YARN. http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/.
-
Hadoop YARN
-
-
-
4
-
-
85066346967
-
-
Hyracks. http://hyracks.org/.
-
-
-
-
18
-
-
84937454662
-
-
Tuning Spark. http://spark.apache.org/docs/latest/tuning.html.
-
Tuning Spark
-
-
-
19
-
-
85077444173
-
Shufflewatcher: Shuffle-aware scheduling in multi-tenant mapreduce clusters
-
AHMAD, F., and CHAKRADHAR, S. T., RAGHUNATHAN, A., and VIJAYKUMAR, T. N. Shufflewatcher: Shuffle-aware scheduling in multi-tenant mapreduce clusters. In USENIX ATC (2014), pp. 1-12.
-
(2014)
USENIX ATC
, pp. 1-12
-
-
Ahmad, F.1
Chakradhar, S.T.2
Raghunathan, A.3
Vijaykumar, T.N.4
-
20
-
-
84938096686
-
Asterixdb: A scalable, open source BDMS
-
ALSUBAIEE, S., ALTOWIM, Y., ALTWAIJRY, H., BEHM, A., and BORKAR, V. R., BU, Y., CAREY, M. J., CETINDIL, I., CHEELANGI, M., FARAAZ, K., GABRIELOVA, E., GROVER, R., HEILBRON, Z., KIM, Y., LI, C., LI, G., OK, J. M., ONOSE, N., PIRZADEH, P., TSOTRAS, V. J., VERNICA, R., WEN, J., AND WESTMANN, T. Asterixdb: A scalable, open source BDMS. PVLDB 7, 14 (2014), 1905-1916.
-
(2014)
PVLDB
, vol.7
, Issue.14
, pp. 1905-1916
-
-
Alsubaiee, S.1
Altowim, Y.2
Altwaijry, H.3
Behm, A.4
Borkar, V.R.5
Bu, Y.6
Carey, M.J.7
Cetindil, I.8
Cheelangi, M.9
Faraaz, K.10
Gabrielova, E.11
Grover, R.12
Heilbron, Z.13
Kim, Y.14
Li, C.15
Li, G.16
Ok, J.M.17
Onose, N.18
Pirzadeh, P.19
Tsotras, V.J.20
Vernica, R.21
Wen, J.22
Westmann, T.23
more..
-
22
-
-
79958269648
-
ASTERIX: Towards a scalable, semistructured data platform for evolving-world models
-
BEHM, A., and BORKAR, V. R., CAREY, M. J., GROVER, R., LI, C., ONOSE, N., VERNICA, R., DEUTSCH, A., PAPAKONSTANTINOU, Y., and TSOTRAS, V. J. ASTERIX: Towards a scalable, semistructured data platform for evolving-world models. Distributed and Parallel Databases 29 (2011), 185-216.
-
(2011)
Distributed and Parallel Databases
, vol.29
, pp. 185-216
-
-
Behm, A.1
Borkar, V.R.2
Carey, M.J.3
Grover, R.4
Li, C.5
Onose, N.6
Vernica, R.7
Deutsch, A.8
Papakonstantinou, Y.9
Tsotras, V.J.10
-
24
-
-
79957872898
-
Hyracks: A flexible and extensible foundation for data-intensive computing
-
BORKAR, V. R., and CAREY, M. J., GROVER, R., ONOSE, N., and VERNICA, R. Hyracks: A flexible and extensible foundation for data-intensive computing. In ICDE (2011), pp. 1151-1162.
-
(2011)
ICDE
, pp. 1151-1162
-
-
Borkar, V.R.1
Carey, M.J.2
Grover, R.3
Onose, N.4
Vernica, R.5
-
25
-
-
84863527887
-
Inside "Big data management": Ogres, onions, or parfaits?
-
BORKAR, V. R., and CAREY, M. J., and LI, C. Inside "Big Data Management": Ogres, Onions, or Parfaits? In EDBT (2012), pp. 3-14.
-
(2012)
EDBT
, pp. 3-14
-
-
Borkar, V.R.1
Carey, M.J.2
Li, C.3
-
26
-
-
84890498888
-
A bloat-aware design for big data applications
-
BU, Y., BORKAR, V., XU, G., and CAREY, M. J. A bloat-aware design for big data applications. In ISMM (2013), pp. 119-130.
-
(2013)
ISMM
, pp. 119-130
-
-
Bu, Y.1
Borkar, V.2
Xu, G.3
Carey, M.J.4
-
27
-
-
84938089087
-
Pregelix: Big(ger) graph analytics on a dataflow engine
-
BU, Y., and BORKAR, V. R., JIA, J., CAREY, M. J., and CONDIE, T. Pregelix: Big(ger) graph analytics on a dataflow engine. PVLDB 8, 2 (2014), 161-172.
-
(2014)
PVLDB
, vol.8
, Issue.2
, pp. 161-172
-
-
Bu, Y.1
Borkar, V.R.2
Jia, J.3
Carey, M.J.4
Condie, T.5
-
28
-
-
84860560293
-
SCOPE: Easy and efficient parallel processing of massive data sets
-
CHAIKEN, R., JENKINS, B., LARSON, P., RAMSEY, B., SHAKIB, D., WEAVER, S., and ZHOU, J. SCOPE: easy and efficient parallel processing of massive data sets. PVLDB 1, 2 (2008), 1265-1276.
-
(2008)
PVLDB
, vol.1
, Issue.2
, pp. 1265-1276
-
-
Chaiken, R.1
Jenkins, B.2
Larson, P.3
Ramsey, B.4
Shakib, D.5
Weaver, S.6
Zhou, J.7
-
29
-
-
77954727236
-
FlumeJava: Easy, efficient data-parallel pipelines
-
CHAMBERS, C., RANIWALA, A., PERRY, F., ADAMS, S., and HENRY, R. R., BRADSHAW, R., and WEIZENBAUM, N. FlumeJava: Easy, efficient data-parallel pipelines. In PLDI (2010), pp. 363-375.
-
(2010)
PLDI
, pp. 363-375
-
-
Chambers, C.1
Raniwala, A.2
Perry, F.3
Adams, S.4
Henry, R.R.5
Bradshaw, R.6
Weizenbaum, N.7
-
30
-
-
47749140025
-
Bigtable: A distributed storage system for structured data
-
CHANG, F., DEAN, J., GHEMAWAT, S., HSIEH, W. C., and WALLACH, D. A., BURROWS, M., CHANDRA, T., FIKES, A., and GRUBER, R. E. Bigtable: A distributed storage system for structured data. ACM Trans. Comput. Syst. 26, 2 (2008), 4:1-4:26.
-
(2008)
ACM Trans. Comput. Syst.
, vol.26
, Issue.2
, pp. 41-426
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.E.9
-
31
-
-
56049109090
-
Map-reduce for machine learning on multicore
-
CHU, C. T., and KIM, S. K., LIN, Y. A., YU, Y., BRADSKI, G. R., and NG, A. Y., and OLUKOTUN, K. Map-reduce for machine learning on multicore. In NIPS (2006), pp. 281-288.
-
(2006)
NIPS
, pp. 281-288
-
-
Chu, C.T.1
Kim, S.K.2
Lin, Y.A.3
Yu, Y.4
Bradski, G.R.5
Ng, A.Y.6
Olukotun, K.7
-
32
-
-
85076771850
-
MapReduce online
-
CONDIE, T., CONWAY, N., ALVARO, P., HELLERSTEIN, J. M., ELMELEEGY, K., and SEARS, R. MapReduce online. In NSDI (2010), pp. 313-328.
-
(2010)
NSDI
, pp. 313-328
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.M.4
Elmeleegy, K.5
Sears, R.6
-
33
-
-
85030321143
-
MapReduce: Simplified data processing on large clusters
-
DEAN, J., and GHEMAWAT, S. MapReduce: Simplified data processing on large clusters. In OSDI (2004), pp. 137-150.
-
(2004)
OSDI
, pp. 137-150
-
-
Dean, J.1
Ghemawat, S.2
-
34
-
-
85092655306
-
Broom: Sweeping out garbage collection from big data systems
-
GOG, I., GICEVA, J., SCHWARZKOPF, M., VASWANI, K., VYTINIOTIS, D., RAMALINGAM, G., COSTA, M., and MURRAY, D. G., HAND, S., and ISARD, M. Broom: Sweeping out garbage collection from big data systems. In HotOS (2015).
-
(2015)
HotOS
-
-
Gog, I.1
Giceva, J.2
Schwarzkopf, M.3
Vaswani, K.4
Vytiniotis, D.5
Ramalingam, G.6
Costa, M.7
Murray, D.G.8
Hand, S.9
Isard, M.10
-
35
-
-
84871175805
-
Spotting code optimizations in data-parallel pipelines through periscope
-
GUO, Z., FAN, X., CHEN, R., ZHANG, J., ZHOU, H., MCDIRMID, S., LIU, C., LIN, W., ZHOU, J., and ZHOU, L. Spotting code optimizations in data-parallel pipelines through periscope. In OSDI (2012), pp. 121-133.
-
(2012)
OSDI
, pp. 121-133
-
-
Guo, Z.1
Fan, X.2
Chen, R.3
Zhang, J.4
Zhou, H.5
McDirmid, S.6
Liu, C.7
Lin, W.8
Zhou, J.9
Zhou, L.10
-
36
-
-
80053500227
-
Starfish: A self-tuning system for big data analytics
-
HERODOTOU, H., LIM, H., LUO, G., BORISOV, N., DONG, L., CETIN, F. B., and BABU, S. Starfish: A self-tuning system for big data analytics. In CIDR (2011), pp. 261-272.
-
(2011)
CIDR
, pp. 261-272
-
-
Herodotou, H.1
Lim, H.2
Luo, G.3
Borisov, N.4
Dong, L.5
Cetin, F.B.6
Babu, S.7
-
37
-
-
84893305113
-
Mesos: A platform for fine-grained resource sharing in the data center
-
HINDMAN, B., KONWINSKI, A., ZAHARIA, M., GHODSI, A., and JOSEPH, A. D., KATZ, R., SHENKER, S., and STOICA, I. Mesos: A platform for fine-grained resource sharing in the data center. In NSDI (2011), pp. 295-308.
-
(2011)
NSDI
, pp. 295-308
-
-
Hindman, B.1
Konwinski, A.2
Zaharia, M.3
Ghodsi, A.4
Joseph, A.D.5
Katz, R.6
Shenker, S.7
Stoica, I.8
-
38
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
ISARD, M., BUDIU, M., YU, Y., BIRRELL, A., and FETTERLY. D. Dryad: Distributed data-parallel programs from sequential building blocks. In EuroSys (2007), pp. 59-72.
-
(2007)
EuroSys
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
39
-
-
85066341881
-
-
Kryo. https://github.com/EsotericSoftware/kryo.
-
Kryo
-
-
-
40
-
-
84879933533
-
Managing skew in hadoop
-
KWON, Y., REN, K., BALAZINSKA, M., and HOWE, B. Managing skew in hadoop. IEEE Data Eng. Bull. 36, 1 (2013), 24-33.
-
(2013)
IEEE Data Eng. Bull.
, vol.36
, Issue.1
, pp. 24-33
-
-
Kwon, Y.1
Ren, K.2
Balazinska, M.3
Howe, B.4
-
41
-
-
85042632297
-
GraphChi: Large-scale graph computation on just a PC
-
KYROLA, A., BLELLOCH, G., and GUESTRIN, C. GraphChi: Large-Scale Graph Computation on Just a PC. In OSDI (2012), pp. 31-46.
-
(2012)
OSDI
, pp. 31-46
-
-
Kyrola, A.1
Blelloch, G.2
Guestrin, C.3
-
42
-
-
84863470460
-
Panacea: Towards holistic optimization of MapReduce applications
-
LIU, J., RAVI, N., CHAKRADHAR, S., and KANDEMIR, M. Panacea: Towards holistic optimization of MapReduce applications. In CGO (2012), pp. 33-43.
-
(2012)
CGO
, pp. 33-43
-
-
Liu, J.1
Ravi, N.2
Chakradhar, S.3
Kandemir, M.4
-
43
-
-
84863735533
-
Distributed GraphLab: A framework for machine learning in the cloud
-
LOW, Y., GONZALEZ, J., KYROLA, A., BICKSON, D., GUESTRIN, C., and HELLERSTEIN, J. M. Distributed GraphLab: A framework for machine learning in the cloud. PVLDB 5, 8 (2012), 716-727.
-
(2012)
PVLDB
, vol.5
, Issue.8
, pp. 716-727
-
-
Low, Y.1
Gonzalez, J.2
Kyrola, A.3
Bickson, D.4
Guestrin, C.5
Hellerstein, J.M.6
-
44
-
-
75149134776
-
Four trends leading to Java runtime bloat
-
MITCHELL, N., SCHONBERG, E., and SEVITSKY, G. Four trends leading to java runtime bloat. IEEE Software 27, 1 (2010), 56-63.
-
(2010)
IEEE Software
, vol.27
, Issue.1
, pp. 56-63
-
-
Mitchell, N.1
Schonberg, E.2
Sevitsky, G.3
-
45
-
-
42149169980
-
The causes of bloat, the limits of health
-
MITCHELL, N., and SEVITSKY, G. The causes of bloat, the limits of health. In OOPSLA (2007), pp. 245-260.
-
(2007)
OOPSLA
, pp. 245-260
-
-
Mitchell, N.1
Sevitsky, G.2
-
46
-
-
79959903863
-
Steno: Automatic optimization of declarative queries
-
MURRAY, D. G., ISARD, M., and YU, Y. Steno: Automatic optimization of declarative queries. In PLDI (2011), pp. 121-131.
-
(2011)
PLDI
, pp. 121-131
-
-
Murray, D.G.1
Isard, M.2
Yu, Y.3
-
47
-
-
84939193057
-
FACADE: A compiler and runtime for (almost) object-bounded big data applications
-
NGUYEN, K., WANG, K., BU, Y., FANG, L., HU, J., and XU, G. FACADE: A compiler and runtime for (almost) object-bounded big data applications. In ASPLOS (2015), pp. 675-690.
-
(2015)
ASPLOS
, pp. 675-690
-
-
Nguyen, K.1
Wang, K.2
Bu, Y.3
Fang, L.4
Hu, J.5
Xu, G.6
-
48
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
OLSTON, C., REED, B., SRIVASTAVA, U., KUMAR, R., and TOMKINS, A. Pig latin: A not-so-foreign language for data processing. In SIGMOD (2008), pp. 1099-1110.
-
(2008)
SIGMOD
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
49
-
-
30344452311
-
Interpreting the data: Parallel analysis with sawzall
-
PIKE, R., DORWARD, S., GRIESEMER, R., and QUINLAN, S. Interpreting the data: Parallel analysis with sawzall. Scientific Programming 13, 4 (2005), 277-298.
-
(2005)
Scientific Programming
, vol.13
, Issue.4
, pp. 277-298
-
-
Pike, R.1
Dorward, S.2
Griesemer, R.3
Quinlan, S.4
-
50
-
-
63549096374
-
LeakSurvivor: Towards safely tolerating memory leaks for garbage-collected languages
-
TANG, Y., GAO, Q., and QIN, F. LeakSurvivor: Towards safely tolerating memory leaks for garbage-collected languages. In USENIX ATC (2008), pp. 307-320.
-
(2008)
USENIX ATC
, pp. 307-320
-
-
Tang, Y.1
Gao, Q.2
Qin, F.3
-
51
-
-
84868325513
-
Hive - A warehousing solution over a map-reduce framework
-
THUSOO, A., and SARMA, J. S., JAIN, N., SHAO, Z., CHAKKA, P., ANTHONY, S., LIU, H., WYCKOFF, P., and MURTHY, R. Hive - A warehousing solution over a map-reduce framework. PVLDB 2, 2 (2009), 1626-1629.
-
(2009)
PVLDB
, vol.2
, Issue.2
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
53
-
-
79951599209
-
Software bloat analysis: Finding, removing, and preventing performance problems in modern large-scale object-oriented applications
-
XU, G., MITCHELL, N., ARNOLD, M., ROUNTEV, A., and SEVITSKY, G. Software bloat analysis: Finding, removing, and preventing performance problems in modern large-scale object-oriented applications. In FoSER (2010), pp. 421-426.
-
(2010)
FoSER
, pp. 421-426
-
-
Xu, G.1
Mitchell, N.2
Arnold, M.3
Rountev, A.4
Sevitsky, G.5
-
54
-
-
84901596638
-
Scalable runtime bloat detection using abstract dynamic slicing
-
XU, G. H., MITCHELL, N., ARNOLD, M., ROUNTEV, A., SCHONBERG, E., and SEVITSKY, G. Scalable runtime bloat detection using abstract dynamic slicing. TOSEM 23, 3 (2014), 23.
-
(2014)
TOSEM
, vol.23
, Issue.3
, pp. 23
-
-
Xu, G.H.1
Mitchell, N.2
Arnold, M.3
Rountev, A.4
Schonberg, E.5
Sevitsky, G.6
-
56
-
-
35448944021
-
Map-reduce-merge: Simplified relational data processing on large clusters
-
YANG, H.-C., DASDAN, A., HSIAO, R.-L., and PARKER, D. S. Map-reduce-merge: Simplified relational data processing on large clusters. In SIGMOD (2007), pp. 1029-1040.
-
(2007)
SIGMOD
, pp. 1029-1040
-
-
Yang, H.-C.1
Dasdan, A.2
Hsiao, R.-L.3
Parker, D.S.4
-
57
-
-
72249089011
-
Distributed aggregation for data-parallel computing: Interfaces and implementations
-
YU, Y., and GUNDA, P. K., and ISARD, M. Distributed aggregation for data-parallel computing: Interfaces and implementations. In SOSP (2009), pp. 247-260.
-
(2009)
SOSP
, pp. 247-260
-
-
Yu, Y.1
Gunda, P.K.2
Isard, M.3
-
58
-
-
85076882757
-
DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language
-
YU, Y., ISARD, M., FETTERLY, D., BUDIU, M., ERLINGSSON, U., GUNDA, P. K., and CURREY, J. DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language. In OSDI (2008), pp. 1-14.
-
(2008)
OSDI
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, U.5
Gunda, P.K.6
Currey, J.7
-
59
-
-
85040175609
-
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing
-
ZAHARIA, M., CHOWDHURY, M., DAS, T., DAVE, A., MA, J., MCCAULY, M., FRANKLIN, M. J., SHENKER, S., and STOICA, I. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. In NSDI (2012), pp. 15-28.
-
(2012)
NSDI
, pp. 15-28
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
McCauly, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
-
60
-
-
85085251984
-
Spark: Cluster computing with working sets
-
ZAHARIA, M., CHOWDHURY, M., FRANKLIN, M. J., SHENKER, S., and STOICA, I. Spark: Cluster computing with working sets. In HotCloud (2010).
-
(2010)
HotCloud
-
-
Zaharia, M.1
Chowdhury, M.2
Franklin, M.J.3
Shenker, S.4
Stoica, I.5
-
61
-
-
85076643377
-
Optimizing data shuffling in data-parallel computation by understanding user-defined functions
-
ZHANG, J., ZHOU, H., CHEN, R., FAN, X., GUO, Z., LIN, H., LI, J. Y., LIN, W., ZHOU, J., and ZHOU, L. Optimizing data shuffling in data-parallel computation by understanding user-defined functions. In NSDI (2012), pp. 22-22.
-
(2012)
NSDI
, pp. 22
-
-
Zhang, J.1
Zhou, H.2
Chen, R.3
Fan, X.4
Guo, Z.5
Lin, H.6
Li, J.Y.7
Lin, W.8
Zhou, J.9
Zhou, L.10
-
62
-
-
77952771965
-
Incorporating partitioning and parallel plans into the SCOPE optimizer
-
ZHOU, J., LARSON, P.-Å., AND CHAIKEN, R. Incorporating partitioning and parallel plans into the SCOPE optimizer. In ICDE (2010), pp. 1060-1071.
-
(2010)
ICDE
, pp. 1060-1071
-
-
Zhou, J.1
Larson, P.-A.2
Chaiken, R.3
|