-
1
-
-
85041206604
-
Data warehousing and analytics infrastructure at facebook
-
D. Borthakur, N. Jain, J. S. Sarma, R. Murthy, H. Liu. “Data warehousing and analytics infrastructure at facebook”. The 36th ACM SIGMOD International Conference on Management of Data, 2010.
-
(2010)
The 36th ACM SIGMOD International Conference on Management of Data
-
-
Borthakur, D.1
Jain, N.2
Sarma, J.S.3
Murthy, R.4
Liu, H.5
-
3
-
-
85080763570
-
-
Hadoop. http://hadoop.apache.org/
-
-
-
-
4
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, D. Fetterly. “Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks”. The 2nd European Conference on Computer Systems, 2007.
-
(2007)
The 2nd European Conference on Computer Systems
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
5
-
-
55349148888
-
Pig Latin: A not-so-foreign language for data processing
-
C. Olston, B. Reed, U. Srivastava, R. Kumar, A. Tomkins. “Pig latin: a not-so-foreign language for data processing”. The 34th ACM SIGMOD international conference on Management of data, 2008.
-
(2008)
The 34th ACM SIGMOD International Conference on Management of Data
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
6
-
-
85077028670
-
HIVE - A petabyte scale data warehousing using Hadoop
-
A. Thusoo, R. Murthy, J. S. Sarma, Z. Shao, N. Jain, P. Chakka, S. Anthony, H. Liu, N. Zhang. “Hive - A Petabyte Scale Data Warehousing Using Hadoop”. The 26th IEEE International Conference on Data Engineering, 2010.
-
(2010)
The 26th IEEE International Conference on Data Engineering
-
-
Thusoo, A.1
Murthy, R.2
Sarma, J.S.3
Shao, Z.4
Jain, N.5
Chakka, P.6
Anthony, S.7
Liu, H.8
Zhang, N.9
-
7
-
-
85080636064
-
-
Dryad. http://research.microsoft.com/en-us/projects/Dryad/
-
-
-
-
9
-
-
70350512695
-
A comparison of approaches to large-scale data analysis
-
A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, M. Stonebraker. “A comparison of approaches to large-scale data analysis”. The 35th SIGMOD international conference on Management of data, 2009.
-
(2009)
The 35th SIGMOD International Conference on Management of Data
-
-
Pavlo, A.1
Paulson, E.2
Rasin, A.3
Abadi, D.J.4
DeWitt, D.J.5
Madden, S.6
Stonebraker, M.7
-
12
-
-
81255181927
-
Chukwa: A system for reliable large-scale log collection
-
UC Berkeley
-
A. Rabkin, R. H. Katz. “Chukwa: A system for reliable large-scale log collection”, Technical Report UCB/EECS-2010-25, UC Berkeley, 2010.
-
(2010)
Technical Report UCB/EECS-2010-25
-
-
Rabkin, A.1
Katz, R.H.2
-
13
-
-
85080782072
-
-
Scribe. http://github.com/facebook/scribe
-
-
-
-
14
-
-
85080652603
-
-
Flume. https://github.com/cloudera/flume
-
-
-
-
15
-
-
85080778697
-
-
Java Instrumentation. http://download.oracle.com/javase/6/docs/api/java/lang/instrument/packagesummary.html
-
-
-
-
16
-
-
85080654954
-
-
Sort benchmark. http://sortbenchmark.org/
-
-
-
-
17
-
-
77952721751
-
The HiBench Benchmark suite: Characterization of the MapReduce-based data analysis
-
S. Huang, J. Huang, J. Dai, T. Xie, B. Huang. “The HiBench Benchmark suite: Characterization of the MapReduce-Based Data Analysis”. IEEE 26th International Conference on Data Engineering Workshops, 2010.
-
(2010)
IEEE 26th International Conference on Data Engineering Workshops
-
-
Huang, S.1
Huang, J.2
Dai, J.3
Xie, T.4
Huang, B.5
-
21
-
-
85080758421
-
-
Ganglia. http://ganglia.sourceforge.net/
-
-
-
-
23
-
-
77951466017
-
Job scheduling for multi-user MapReduce clusters
-
UC Berkeley
-
M. Zaharia, D. Borthakur, J. S. Sarma, K. Elmeleegy, S. Shenker, I. Stoica. “Job Scheduling for Multi-User MapReduce Clusters”. Technical Report UCB/EECS-2009-55, UC Berkeley, 2009.
-
(2009)
Technical Report UCB/EECS-2009-55
-
-
Zaharia, M.1
Borthakur, D.2
Sarma, J.S.3
Elmeleegy, K.4
Shenker, S.5
Stoica, I.6
-
24
-
-
77957790278
-
-
Hadoop Fair Scheduler Design Document. https://svn.apache.org/repos/asf/hadoop/mapreduce/trunk/src/contrib/fairscheduler/designdoc/fair_sche duler_design_doc.pdf
-
Hadoop Fair Scheduler Design Document
-
-
-
25
-
-
85080745884
-
-
patch
-
Hadoop LZO patch. http://github.com/kevinweil/ hadoop-lzo
-
-
-
-
28
-
-
80052989662
-
X-trace: A pervasive network tracing framework
-
R. Fonseca, G. Porter, R. H. Katz, S. Shenker, I. Stoica. “X-trace: A pervasive network tracing framework”. The 4th USENIX Symposium on Networked Systems Design & Implementation, 2007.
-
(2007)
The 4th USENIX Symposium on Networked Systems Design & Implementation
-
-
Fonseca, R.1
Porter, G.2
Katz, R.H.3
Shenker, S.4
Stoica, I.5
-
29
-
-
78650479309
-
Dapper, a large-scale distributed systems tracing infrastructure
-
B. H. Sigelman, L. A. Barroso, M. Burrows, P. Stephenson, M. Plakal, D. Beaver, S. Jaspan, C. Shanbhag. “Dapper, a Large-Scale Distributed Systems Tracing Infrastructure”. Google Research, 2010.
-
(2010)
Google Research
-
-
Sigelman, B.H.1
Barroso, L.A.2
Burrows, M.3
Stephenson, P.4
Plakal, M.5
Beaver, D.6
Jaspan, S.7
Shanbhag, C.8
-
30
-
-
74549159058
-
D3: Declarative distributed debugging
-
UC Berkeley
-
B. Chun, K. Chen, G. Lee, R. Katz, S. Shenker. “D3: Declarative Distributed Debugging”. Technical Report UCB/EECS-2008-27, UC Berkeley, 208.
-
Technical Report UCB/EECS-2008-27
, pp. 208
-
-
Chun, B.1
Chen, K.2
Lee, G.3
Katz, R.4
Shenker, S.5
-
31
-
-
77956067021
-
Google-wide profiling: A continuous profiling infrastructure for data centers
-
G. Ren, E. Tune, T. Moseley, Y. Shi, S. Rus, R. Hundt. “Google-Wide Profiling: A Continuous Profiling Infrastructure for Data Centers”. IEEE Micro (2010), pp. 65-79.
-
(2010)
IEEE Micro
, pp. 65-79
-
-
Ren, G.1
Tune, E.2
Moseley, T.3
Shi, Y.4
Rus, S.5
Hundt, R.6
-
33
-
-
85080661871
-
-
Nagios. http://www.nagios.org/
-
-
-
-
34
-
-
85080691878
-
-
Cacti. http://www.cacti.net/
-
-
-
-
36
-
-
77957761115
-
Kahuna: Problem diagnosis for MapReduce-based cloud computing environments
-
J. Tan, X. Pan, S. Kavulya, R. Gandhi, P. Narasimhan. “Kahuna: Problem Diagnosis for MapReduce-Based Cloud Computing Environments”. IEEE/IFIP Network Operations and Management Symposium (NOMS), 2010.
-
(2010)
IEEE/IFIP Network Operations and Management Symposium (NOMS)
-
-
Tan, J.1
Pan, X.2
Kavulya, S.3
Gandhi, R.4
Narasimhan, P.5
|