-
1
-
-
80052664667
-
-
Hadoop. http://hadoop.apache.org.
-
Hadoop
-
-
-
2
-
-
80052658870
-
-
HBase. http://hadoop.apache.org/hbase.
-
HBase
-
-
-
3
-
-
80052669821
-
-
Hive. http://hadoop.apache.org/hive.
-
Hive
-
-
-
6
-
-
80052652238
-
-
JAQL. http://www.jaql.org.
-
JAQL
-
-
-
7
-
-
80052649443
-
-
Mahout. http://lucene.apache.org/mahout/.
-
Mahout
-
-
-
8
-
-
80052682652
-
-
MPI. http://www.mpi-forum.org.
-
MPI
-
-
-
9
-
-
80052662960
-
-
OpenMP. http://www.openmp.org.
-
OpenMP
-
-
-
10
-
-
80052674249
-
-
PThreads. https://computing.llnl.gov/tutorials/pthreads.
-
PThreads
-
-
-
11
-
-
0027621699
-
Mining association rules between sets of items in large databases
-
R. Agrawal et al. Mining association rules between sets of items in large databases. ACM SIGMOD, 22(2), 1993.
-
(1993)
ACM SIGMOD
, vol.22
, Issue.2
-
-
Agrawal, R.1
-
12
-
-
0030211964
-
Bagging predictors
-
L. Breiman. Bagging predictors. Machine Learning, 24(2), 1996.
-
(1996)
Machine Learning
, vol.24
, Issue.2
-
-
Breiman, L.1
-
14
-
-
56049109090
-
Map-reduce for machine learning on multicore
-
C. Chu et al. Map-reduce for machine learning on multicore. In NIPS, 2007.
-
(2007)
NIPS
-
-
Chu, C.1
-
15
-
-
33749568964
-
A general framework for accurate and fast regression by data summarization in random decision trees
-
W. Fan et al. A general framework for accurate and fast regression by data summarization in random decision trees. In ACM SIGKDD, 2006.
-
(2006)
ACM SIGKDD
-
-
Fan, W.1
-
16
-
-
42749086305
-
Fast mining of distance-based outliers in high-dimensional datasets
-
A. Ghoting et al. Fast mining of distance-based outliers in high-dimensional datasets. DMKD, 16(3), 2008.
-
(2008)
DMKD
, vol.16
, Issue.3
-
-
Ghoting, A.1
-
17
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
M. Isard et al. Dryad: distributed data-parallel programs from sequential building blocks. In SIGOPS Operating System Review, 2007.
-
(2007)
SIGOPS Operating System Review
-
-
Isard, M.1
-
18
-
-
80052673077
-
Shared memory parallelization of data mining algorithms: Techniques, programming interface, and performance
-
R. Jin and G. Agrawal. Shared Memory Parallelization of Data Mining Algorithms: Techniques, Programming Interface, and Performance. In SDM, 2002.
-
(2002)
SDM
-
-
Jin, R.1
Agrawal, G.2
-
19
-
-
74049087889
-
PFunc: Modern task parallelism for modern high performance computing
-
P. Kambadur et al. PFunc: Modern Task Parallelism For Modern High Performance Computing. In SC, 2009.
-
(2009)
SC
-
-
Kambadur, P.1
-
20
-
-
15344347807
-
Gradient-based learning applied to document recognition
-
Y. LeCun et al. Gradient-based learning applied to document recognition. In Intelligent Signal Processing, 2001.
-
(2001)
Intelligent Signal Processing
-
-
Lecun, Y.1
-
21
-
-
63449087382
-
Pfp: Parallel fp-growth for query recommendation
-
H. Li et al. Pfp: parallel fp-growth for query recommendation. In ACM RecSys, 2008.
-
(2008)
ACM RecSys
-
-
Li, H.1
-
22
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
C. Olston et al. Pig latin: a not-so-foreign language for data processing. In ACM SIGMOD, 2008.
-
(2008)
ACM SIGMOD
-
-
Olston, C.1
-
23
-
-
77955032649
-
PLANET: Massively parallel learning of tree ensembles with MapReduce
-
B. Panda et al. PLANET: massively parallel learning of tree ensembles with MapReduce. Proceedings of the VLDB Endowment, 2(2), 2009.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.2
-
-
Panda, B.1
-
24
-
-
70350591395
-
DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language
-
Y. Yu et al. DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language. In OSDI, 2008.
-
(2008)
OSDI
-
-
Yu, Y.1
|