-
1
-
-
30344488259
-
MapReduce: Simplified Data Processing on Large Clusters
-
December
-
J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters," USENIX OSDI, December, 2004.
-
(2004)
USENIX OSDI
-
-
Dean, J.1
Ghemawat, S.2
-
2
-
-
77952688268
-
-
homepage
-
Hadoop homepage. http://hadoop.apache.org/
-
-
-
-
3
-
-
77952687950
-
-
Pig homepage
-
Pig homepage. http://hadoop.apache.org/pig/
-
-
-
-
4
-
-
77952727561
-
-
Hive homepage
-
Hive homepage. http://hadoop.apache.org/hive/
-
-
-
-
5
-
-
77952685098
-
-
Mahout homepage
-
Mahout homepage. http://lucene.apache.org/mahout/
-
-
-
-
6
-
-
77952738404
-
-
HBase homepage
-
HBase homepage. http://hadoop.apache.org/hbase/
-
-
-
-
7
-
-
77952710781
-
-
GridMix program. Available in Hadoop source distribution: src/benchmarks/gridmix
-
GridMix program. Available in Hadoop source distribution: src/benchmarks/gridmix.
-
-
-
-
8
-
-
84859240903
-
A Comparison of Approaches to Large-Scale Data Analysis
-
June
-
A. Pavlo, A. Rasin, S. Madden, M. Stonebraker, D. DeWitt, E. Paulson, L. Shrinivas, and D. J. Abadi. "A Comparison of Approaches to Large-Scale Data Analysis", SIGMOD, June, 2009
-
(2009)
SIGMOD
-
-
Pavlo, A.1
Rasin, A.2
Madden, S.3
Stonebraker, M.4
DeWitt, D.5
Paulson, E.6
Shrinivas, L.7
Abadi, D.J.8
-
10
-
-
77952703919
-
-
Sort program. Available in Hadoop source distribution: src/examples/org/apache/hadoop/examples/sort
-
Sort program. Available in Hadoop source distribution: src/examples/org/apache/hadoop/examples/sort
-
-
-
-
11
-
-
70350582319
-
Improving MapReduce Performance in Heterogeneous Environments
-
December
-
Matei Zaharia, Andy Konwinski, Anthony D. Joseph, Randy Katz, and Ion Stoica. "Improving MapReduce Performance in Heterogeneous Environments", OSDI'08, December, 2008.
-
(2008)
OSDI'08
-
-
Zaharia, M.1
Konwinski, A.2
Joseph, A.D.3
Katz, R.4
Stoica, I.5
-
12
-
-
77952696330
-
-
HDFS homepage
-
HDFS homepage. http://hadoop.apache.org/hdfs/
-
-
-
-
14
-
-
77952715612
-
-
TeraSort
-
TeraSort. http://sortbenchmark.org/
-
-
-
-
15
-
-
77952687265
-
-
Hadoop TeraSort program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort
-
Hadoop TeraSort program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort
-
-
-
-
16
-
-
77952717465
-
-
TeraGen program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort/TeraGen
-
TeraGen program. Available in Hadoop source distribution since 0.19 version: src/examples/org/apache/hadoop/examples/terasort/TeraGen
-
-
-
-
18
-
-
69949113350
-
-
Available
-
"Sorting 1PB with MapReduce", Available: http://googleblog. blogspot.com/2008/11/sorting-1pb-with-mapreduce.html
-
Sorting 1PB with MapReduce
-
-
-
19
-
-
77952729932
-
-
DFSIO program. Available in Hadoop source distribution: src/test/org/apache/hadoop/fs/TestDFSIO
-
DFSIO program. Available in Hadoop source distribution: src/test/org/apache/hadoop/fs/TestDFSIO
-
-
-
-
20
-
-
77952676777
-
-
Nutch homepage
-
Nutch homepage. http://lucene.apache.org/nutch/
-
-
-
-
21
-
-
77952689407
-
-
WordCount program. Available in Hadoop source distribution: src/examples/org/apache/hadoop/ examples/WordCount
-
WordCount program. Available in Hadoop source distribution: src/examples/org/apache/hadoop/ examples/WordCount
-
-
-
-
22
-
-
77952718479
-
-
homepage
-
Lucene homepage. http://lucene.apache.org
-
-
-
-
23
-
-
77952710129
-
-
homepage
-
SmarFrog project homepage. http://www.smartfrog.org
-
-
-
-
24
-
-
77952710423
-
-
Hadoop User Group UK talk. Available
-
P. Castagna, "Having fun with PageRank and MapReduce," Hadoop User Group UK talk. Available: http://static.last.fm/johan/huguk-20090414/paolo- castagna-pagerank.pdf
-
Having Fun with PageRank and MapReduce
-
-
Castagna, P.1
-
26
-
-
77952702590
-
-
Mahout Naïve Bayesian
-
Mahout Naïve Bayesian. http://cwiki.apache.org/MAHOUT/naivebayes. html
-
-
-
-
27
-
-
77952711114
-
-
N-Gram
-
N-Gram. http://en.wikipedia.org/wiki/N-gram
-
-
-
-
28
-
-
77952690118
-
-
Tf-Idf
-
Tf-Idf. http://en.wikipedia.org/wiki/Tf%E2%80%93idf
-
-
-
-
29
-
-
77952679642
-
-
Wikipedia Dump. http://en.wikipedia.org/wiki/index.php?curid=68321
-
-
-
-
30
-
-
77952718148
-
-
Mahout K-means
-
Mahout K-means. http://cwiki.apache.org/MAHOUT/k-means.html
-
-
-
-
31
-
-
77952690801
-
-
Available
-
Yahoo! distribution of Hadoop. Avai lable: http://developer.yahoo.com/ hadoop/distribution/
-
Distribution of Hadoop
-
-
-
33
-
-
77952715956
-
-
Available
-
Hadoop-5191. Available: http://issues.apache.org/jira/browse/HADOOP-5191.
-
Hadoop-5191
-
-
|