-
2
-
-
84919827070
-
Pacman: Coordinated memory caching for parallel jobs
-
G. Ananthanarayanan, A. Ghodsi, A. Wang, D. Borthakur, S. Kandula, S. Shenker, and I. Stoica. Pacman: Coordinated memory caching for parallel jobs. In Proc. USENIX NSDI, 2012.
-
(2012)
Proc. USENIX NSDI
-
-
Ananthanarayanan, G.1
Ghodsi, A.2
Wang, A.3
Borthakur, D.4
Kandula, S.5
Shenker, S.6
Stoica, I.7
-
3
-
-
84937840715
-
-
Apache Software Foundation. Hadoop, 2011. http://hadoop.apache.org/core/.
-
(2011)
Hadoop
-
-
-
4
-
-
33751119203
-
Task scheduling strategies for workflow-based applications in grids
-
J. Blythe, S. Jain, E. Deelman, Y. Gil, K. Vahi, A. Mandal, and K. Kennedy. Task scheduling strategies for workflow-based applications in grids. In Proc. IEEE CCGrid, 2005.
-
(2005)
Proc. IEEE CCGrid
-
-
Blythe, J.1
Jain, S.2
Deelman, E.3
Gil, Y.4
Vahi, K.5
Mandal, A.6
Kennedy, K.7
-
5
-
-
70450136675
-
The hadoop distributed file system: Architecture and design
-
D. Borthakur. The hadoop distributed file system: Architecture and design. Hadoop Project Website, 2007.
-
(2007)
Hadoop Project Website
-
-
Borthakur, D.1
-
7
-
-
79960018131
-
Apache hadoop goes realtime at Facebook
-
D. e. Borthakur. Apache hadoop goes realtime at Facebook. Proc. ACM SIGMOD, 2011.
-
(2011)
Proc. ACM SIGMOD
-
-
Borthakur, D.E.1
-
9
-
-
84873134968
-
Interactive analytical proces sing in big data systems: A cross-industry study of mapreduce workloads
-
Y. Chen, S. Alspaugh, and R. Katz. Interactive analytical proces sing in big data systems: A cross-industry study of mapreduce workloads. Proc. VLDB Endowment, 5(12):1802-1813, 2012.
-
(2012)
Proc. VLDB Endowment
, vol.5
, Issue.12
, pp. 1802-1813
-
-
Chen, Y.1
Alspaugh, S.2
Katz, R.3
-
11
-
-
85047027780
-
-
F. Corona. Facebook, 2012. https://gigaom.com/2012/11/08/facebookopen-sources-corona-a-better-way-to-do-webscale-hadoop.
-
(2012)
Facebook
-
-
Corona, F.1
-
12
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. Communications of the ACM, 51(1):107-113, 2008.
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
13
-
-
84859198862
-
Clustera: An integrated computation and data management system
-
D. J. DeWitt, E. Paulson, E. Robinson, J. Naughton, J. Royalty, S. Shankar, and A. Krioukov. Clustera: An integrated computation and data management system. Proc. VLDB Endowment, 1(1):28-41, 2008.
-
(2008)
Proc. VLDB Endowment
, vol.1
, Issue.1
, pp. 28-41
-
-
DeWitt, D.J.1
Paulson, E.2
Robinson, E.3
Naughton, J.4
Royalty, J.5
Shankar, S.6
Krioukov, A.7
-
15
-
-
34548712852
-
Pfas: A resource-performance-fluctuation-aware workflow scheduling algorithm for grid computing
-
F. Dong S. G. Akl. Pfas: A resource-performance-fluctuation-aware workflow scheduling algorithm for grid computing. Proc. IEEE IPDPS. 2007.
-
(2007)
Proc. IEEE IPDPS
-
-
Dong, F.1
Akl, S.G.2
-
16
-
-
84863554438
-
Cohadoop: Flexible data placement and its exploitation in hadoop
-
M. Y. Eltabakh, Y. Tian, F. Ö zcan, R. Gemulla, A. Krettek, and J. McPherson. Cohadoop: Flexible data placement and its exploitation in hadoop. Proc. VLDB Endowment, 4(9):575-585, 2011.
-
(2011)
Proc. VLDB Endowment
, vol.4
, Issue.9
, pp. 575-585
-
-
Eltabakh, M.Y.1
Tian, Y.2
Özcan, F.3
Gemulla, R.4
Krettek, A.5
McPherson, J.6
-
17
-
-
84872201547
-
Using mix-ins with python
-
C. Esterbrook. Using mix-ins with python. Linux Journal, 2001(84es):7, 2001.
-
(2001)
Linux Journal
, vol.7
, Issue.84 ES
, pp. 2001
-
-
Esterbrook, C.1
-
18
-
-
80053500227
-
Starfish: A self-tuning system for big data analytics
-
H. Herodotou, H. Lim, G. Luo, N. Borisov, L. Dong, F. B. Cetin, and S. Babu. Starfish: A self-tuning system for big data analytics. In Proc. CIDR, 2011.
-
(2011)
Proc. CIDR
-
-
Herodotou, H.1
Lim, H.2
Luo, G.3
Borisov, N.4
Dong, L.5
Cetin, F.B.6
Babu, S.7
-
19
-
-
84904556956
-
Hibench: A representative and comprehensive hadoop benchmark suite
-
S. Huang, J. Huang, Y. Liu, L. Yi, and J. Dai. Hibench: A representative and comprehensive hadoop benchmark suite. In Proc. ICDE Workshops, 2010.
-
(2010)
Proc. ICDE Workshops
-
-
Huang, S.1
Huang, J.2
Liu, Y.3
Yi, L.4
Dai, J.5
-
20
-
-
84893272150
-
Oozie: Towards a scalable workflow management system for hadoop
-
M. Islam, A. K. Huang, M. Battisha, M. Chiang, S. Srinivasan, C. Peters, A. Neumann, and A. Abdelnur. Oozie: Towards a scalable workflow management system for hadoop. In Proc. ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, 2012.
-
(2012)
Proc. ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
-
-
Islam, M.1
Huang, A.K.2
Battisha, M.3
Chiang, M.4
Srinivasan, S.5
Peters, C.6
Neumann, A.7
Abdelnur, A.8
-
22
-
-
81155161014
-
Dynamic energy efficient data placement and cluster reconfiguration algorithm for mapreduce framework
-
N. Maheshwari, R. Nanduri, and V. Varma. Dynamic energy efficient data placement and cluster reconfiguration algorithm for mapreduce framework. Future Generation Computer Systems, 28(1):119-127, 2012.
-
(2012)
Future Generation Computer Systems
, vol.28
, Issue.1
, pp. 119-127
-
-
Maheshwari, N.1
Nanduri, R.2
Varma, V.3
-
24
-
-
79961219599
-
Scdp: Scalable, cost-effective, distributed and parallel computing model for academics
-
R. Mantri, R. Ingle, and P. Patil. Scdp: Scalable, cost-effective, distributed and parallel computing model for academics. In Proc. ICECT, 2011.
-
(2011)
Proc. ICECT
-
-
Mantri, R.1
Ingle, R.2
Patil, P.3
-
26
-
-
84937882316
-
-
A. Nutch. Nutch, 2010. http://nutch.apache.org.
-
(2010)
Nutch
-
-
Nutch, A.1
-
27
-
-
79959962180
-
Nova: Continuous pig/hadoop workflows
-
C. Olston, G. Chiou, L. Chitnis, F. Liu, Y. Han, M. Larsson, A. Neumann, V. B. Rao, V. Sankarasubramanian, S. Seth, et al. Nova: Continuous pig/hadoop workflows. In Proc. ACM SIGMOD, 2011.
-
(2011)
Proc. ACM SIGMOD
-
-
Olston, C.1
Chiou, G.2
Chitnis, L.3
Liu, F.4
Han, Y.5
Larsson, M.6
Neumann, A.7
Rao, V.B.8
Sankarasubramanian, V.9
Seth, S.10
-
28
-
-
85076901170
-
Large-scale incremental processing using distributed transactions and notifications
-
D. Peng and F. Dabek. Large-scale incremental processing using distributed transactions and notifications. In Proc. USENIX OSDI, 2010.
-
(2010)
Proc. USENIX OSDI
-
-
Peng, D.1
Dabek, F.2
-
29
-
-
84937919016
-
Scaling hdfs cluster using namenode federation
-
August
-
S. Radia and S. Srinivas. Scaling hdfs cluster using namenode federation. HDFS-1052, August, 2010.
-
(2010)
HDFS-1052
-
-
Radia, S.1
Srinivas, S.2
-
30
-
-
84881442580
-
Hotrod: Man aging grid storage with on-demand replication
-
S. Rao, B. Reed, and A. Silberstein. Hotrod: Man aging grid storage with on-demand replication. In Proc. IEEE ICDEW, 2013.
-
(2013)
Proc. IEEE ICDEW
-
-
Rao, S.1
Reed, B.2
Silberstein, A.3
-
31
-
-
84868325513
-
Hive: A warehousing solution over a mapreduce framework
-
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff, and R. Murthy. Hive: A warehousing solution over a mapreduce framework. Proc. VLDB Endowment, 2(2):1626-1629, 2009.
-
(2009)
Proc. VLDB Endowment
, vol.2
, Issue.2
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
32
-
-
84923264395
-
Big data genome sequencing on zynq based clusters
-
C. Wang, X. Li, X. Zhou, Y. Chen, and R. C. Cheung. Big data genome sequencing on zynq based clusters. In Proc. ACM SIGDA, 2014.
-
(2014)
Proc. ACM SIGDA
-
-
Wang, C.1
Li, X.2
Zhou, X.3
Chen, Y.4
Cheung, R.C.5
-
33
-
-
74049123821
-
Kepler + hadoop: A general architecture facilitating data-intensive applications in scientific workflow systems
-
J. Wang, D. Crawl, and I. Altintas. Kepler + hadoop: A general architecture facilitating data-intensive applications in scientific workflow systems. Proc. WORKS, 2009.
-
(2009)
Proc. WORKS
-
-
Wang, J.1
Crawl, D.2
Altintas, I.3
-
34
-
-
78649459815
-
Cdrm: A cost-effective dynamic replication management scheme for cloud storage cluster
-
Q. Wei, B. Veeravalli, B. Gong, L. Zeng, and D. Feng. Cdrm: A cost-effective dynamic replication management scheme for cloud storage cluster. In Proc. IEEE CLUSTER, 2010.
-
(2010)
Proc. IEEE CLUSTER
-
-
Wei, Q.1
Veeravalli, B.2
Gong, B.3
Zeng, L.4
Feng, D.5
-
37
-
-
77954042005
-
Improving mapreduce performance through data placement in heterogeneous hadoop clusters
-
J. Xie, S. Yin, X. Ruan, Z. Ding, Y. Tian, J. Majors, A. Manzanares, and X. Qin. Improving mapreduce performance through data placement in heterogeneous hadoop clusters. In Proc. IEEE IPDPSW, 2010.
-
(2010)
Proc. IEEE IPDPSW
-
-
Xie, J.1
Yin, S.2
Ruan, X.3
Ding, Z.4
Tian, Y.5
Majors, J.6
Manzanares, A.7
Qin, X.8
-
38
-
-
33244454775
-
A taxonomy of workflow management systems for grid computing
-
J. Yu and R. Buyya. A taxonomy of workflow management systems for grid computing. Journal of Grid Computing, 3(3-4):171-200, 2005.
-
(2005)
Journal of Grid Computing
, vol.3
, Issue.3-4
, pp. 171-200
-
-
Yu, J.1
Buyya, R.2
-
39
-
-
85076883048
-
Improving mapreduce performance in heterogeneous environments
-
M. Zaharia, A. Konwinski, A. D. Joseph, R. H. Katz, and I. Stoica. Improving mapreduce performance in heterogeneous environments. In Proc. USENIX OSDI, 2008.
-
(2008)
Proc. USENIX OSDI
-
-
Zaharia, M.1
Konwinski, A.2
Joseph, A.D.3
Katz, R.H.4
Stoica, I.5
-
40
-
-
71749083423
-
Cloudwf: A computational workflow system for clouds based on hadoop
-
Springer
-
C. Zhang and H. De Sterck. Cloudwf: A computational workflow system for clouds based on hadoop. In Cloud Computing, pages 393-404. Springer, 2009.
-
(2009)
Cloud Computing
, pp. 393-404
-
-
Zhang, C.1
De Sterck, H.2
|