-
2
-
-
84870452716
-
-
"Apache Hadoop, " http://hadoop.apache.org/.
-
Apache Hadoop
-
-
-
3
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "MapReduce: Simplified Data Processing on Large Clusters, " Com ACM, pp. 107-113, 2008.
-
(2008)
Com ACM
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
4
-
-
84900451613
-
-
"Apache Spark, " https://spark.apache.org/.
-
Apache Spark
-
-
-
5
-
-
84887447695
-
The family of mapreduce and large-scale data processing systems
-
S. Sakr, A. Liu, and A. G. Fayoumi, "The Family of Mapreduce and Large-scale Data Processing Systems, " ACM Comput. Surv., vol. 46, no. 1, pp. 11:1-11:44, 2013.
-
(2013)
ACM Comput. Surv
, vol.46
, Issue.1
, pp. 1101-1144
-
-
Sakr, S.1
Liu, A.2
Fayoumi, A.G.3
-
6
-
-
85026965461
-
-
"SparkR, " http://amplab-extras.github.io/SparkR-pkg/.
-
SparkR
-
-
-
7
-
-
79957859069
-
SystemML: Declarative machine learning on mapreduce
-
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian, and S. Vaithyanathan, "SystemML: Declarative Machine Learning on MapReduce, " in Interntl Conf. on Data Engineering, 2011, pp. 231-242.
-
(2011)
Interntl Conf. on Data Engineering
, pp. 231-242
-
-
Ghoting, A.1
Krishnamurthy, R.2
Pednault, E.3
Reinwald, B.4
Sindhwani, V.5
Tatikonda, S.6
Tian, Y.7
Vaithyanathan, S.8
-
8
-
-
76749092270
-
The WEKA data mining software: An update
-
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, "The WEKA Data Mining Software: An Update, " SIGKDD Explor. Newsl., pp. 10-18, 2009.
-
(2009)
SIGKDD Explor. Newsl
, pp. 10-18
-
-
Hall, M.1
Frank, E.2
Holmes, G.3
Pfahringer, B.4
Reutemann, P.5
Witten, I.H.6
-
10
-
-
77951181190
-
Toolkit-based high-performance data mining of large data on mapreduce clusters
-
D. Wegener, M. Mock, D. Adranale, and S. Wrobel, "Toolkit-Based High-Performance Data Mining of Large Data on MapReduce Clusters, " in ICDM, 2009, pp. 296-301.
-
(2009)
ICDM
, pp. 296-301
-
-
Wegener, D.1
Mock, M.2
Adranale, D.3
Wrobel, S.4
-
11
-
-
85040175609
-
Resilient distributed datasets: A fault-Tolerant abstraction for inmemory cluster Computing
-
M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. Mc-Cauley, M. J. Franklin, S. Shenker, and I. Stoica, "Resilient Distributed Datasets: A Fault-Tolerant Abstraction for Inmemory Cluster Computing, " in NSDI, 2012.
-
(2012)
NSDI
-
-
Zaharia, M.1
Chowdhury, M.2
Das, T.3
Dave, A.4
Ma, J.5
Mc-Cauley, M.6
Franklin, M.J.7
Shenker, S.8
Stoica, I.9
-
12
-
-
0002433547
-
From data mining to knowledge discovery: An overview
-
U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth, "From Data Mining to Knowledge Discovery: An Overview, " in Advances in KDDM, 1996, pp. 1-34.
-
(1996)
Advances in KDDM
, pp. 1-34
-
-
Fayyad, U.M.1
Piatetsky-Shapiro, G.2
Smyth, P.3
-
14
-
-
84953912432
-
-
M. Hall, "Weka and Hadoop, " http://markahall.blogspot.co.uk/2013/10/weka-And-hadooppart-1.html/.
-
Weka and Hadoop
-
-
Hall, M.1
-
15
-
-
0030403087
-
Parallel mining of association rules
-
R. Agrawal and J. C. Shafer, "Parallel Mining of Association Rules, " Knowl. and Data Eng., pp. 962-969, 1996.
-
(1996)
Knowl. and Data Eng
, pp. 962-969
-
-
Agrawal, R.1
Shafer, J.C.2
-
16
-
-
84858620646
-
Disk-locality in datacenter computing considered irrelevant
-
G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, "Disk-locality in Datacenter Computing Considered Irrelevant, " in Conf. on Hot Topics in Operating Systems, 2011.
-
(2011)
Conf. on Hot Topics in Operating Systems
-
-
Ananthanarayanan, G.1
Ghodsi, A.2
Shenker, S.3
Stoica, I.4
-
17
-
-
84893331360
-
Scale-up vs scale-out for hadoop: Time to rethink?
-
R. Appuswamy, C. Gkantsidis, D. Narayanan, O. Hodson, and A. Rowstron, "Scale-up vs Scale-out for Hadoop: Time to Rethink?" in Cloud Computing, 2013, pp. 20:1-20:13.
-
(2013)
Cloud Computing
, pp. 2001-2013
-
-
Appuswamy, R.1
Gkantsidis, C.2
Narayanan, D.3
Hodson, O.4
Rowstron, A.5
-
19
-
-
84870749286
-
-
"Apache Mahout, " http://mahout.apache.org/.
-
Apache Mahout
-
-
-
20
-
-
84894647945
-
MLI: An API for distributed machine learning
-
E. R. Sparks, A. Talwalkar, V. Smith, J. Kottalam, X. Pan, J. E. Gonzalez, M. J. Franklin, M. I. Jordan, and T. Kraska, "MLI: An API for distributed machine learning, " ICDM, 2013.
-
(2013)
ICDM
-
-
Sparks, E.R.1
Talwalkar, A.2
Smith, V.3
Kottalam, J.4
Pan, X.5
Gonzalez, J.E.6
Franklin, M.J.7
Jordan, M.I.8
Kraska, T.9
-
22
-
-
77954751910
-
Ricardo: Integrating R and Hadoop
-
S. Das, Y. Sismanis, K. S. Beyer, R. Gemulla, P. J. Haas, and J. McPherson, "Ricardo: Integrating R and Hadoop, " in Intl Conf. on Management of Data, 2010, pp. 987-998.
-
(2011)
Intl Conf. on Management of Data
, pp. 987-998
-
-
Das, S.1
Sismanis, Y.2
Beyer, K.S.3
Gemulla, R.4
Haas, P.J.5
McPherson, J.6
-
23
-
-
84959538188
-
-
accessed 2015-03-03
-
"RHIPE, " https://www.datadr.org/, accessed: 2015-03-03.
-
RHIPE
-
-
-
24
-
-
84923930001
-
RABID: A distributed parallel R for large datasets
-
H. Lin, S. Yang, and S. Midkiff, "RABID: A distributed parallel R for large datasets, " in Congress on Big Data, 2014, pp. 725-732.
-
(2014)
Congress on Big Data
, pp. 725-732
-
-
Lin, H.1
Yang, S.2
Midkiff, S.3
-
25
-
-
85026967969
-
-
"MLib, " https://spark.apache.org/mllib/.
-
MLib
-
-
-
26
-
-
26944487959
-
Adapting the weka data mining toolkit to a grid based environment
-
M. Prez, A. Snchez, P. Herrero, V. Robles, and J. Pea, "Adapting the Weka Data Mining Toolkit to a Grid Based Environment, " Web Intelligence, pp. 492-497, 2005.
-
(2005)
Web Intelligence
, pp. 492-497
-
-
Prez, M.1
Snchez, A.2
Herrero, P.3
Robles, V.4
Pea, J.5
-
28
-
-
33646429469
-
Weka4WS: A WSRFenabled weka toolkit for distributed data mining on grids
-
D. Talia, P. Trunfio, and O. Verta, "Weka4WS: A WSRFEnabled Weka Toolkit for Distributed Data Mining on Grids, " Knowledge Discovery in Databases, pp. 309-320, 2005.
-
(2005)
Knowledge Discovery in Databases
, pp. 309-320
-
-
Talia, D.1
Trunfio, P.2
Verta, O.3
-
29
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference Matching
-
A. McCallum, K. Nigam, and L. H. Ungar, "Efficient Clustering of High-dimensional Data Sets with Application to Reference Matching, " in KDD, 2000, pp. 169-178.
-
(2000)
KDD
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
|