-
3
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," Commun ACM, 51(1), pp. 107-113, 2008.
-
(2008)
Commun ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
4
-
-
84870452716
-
-
Apache Hadoop, http://hadoop.apache.org.
-
Apache Hadoop
-
-
-
5
-
-
84887530145
-
Distributed data management using MapReduce
-
F. Li, B. C. Ooi, M. T. Özsu and S. Wu, "Distributed data management using MapReduce," ACM Computing Surveys, 46(3), pp. 1-42, 2014.
-
(2014)
ACM Computing Surveys
, vol.46
, Issue.3
, pp. 1-42
-
-
Li, F.1
Ooi, B.C.2
Özsu, M.T.3
Wu, S.4
-
6
-
-
84887843977
-
A survey of large-scale analytical query processing in MapReduce
-
C. Doulkeridis and K. Nørvåg, "A survey of large-scale analytical query processing in MapReduce," The VLDB Journal, pp. 1-26, 2013.
-
(2013)
The VLDB Journal
, pp. 1-26
-
-
Doulkeridis, C.1
Nørvåg, K.2
-
7
-
-
84887447695
-
The family of mapreduce and large-scale data processing systems
-
S. Sakr, A. Liu and A. Fayoumi, "The family of mapreduce and large-scale data processing systems," ACM Computing Surveys, 46(1), pp. 1-44, 2013.
-
(2013)
ACM Computing Surveys
, vol.46
, Issue.1
, pp. 1-44
-
-
Sakr, S.1
Liu, A.2
Fayoumi, A.3
-
8
-
-
84926426301
-
Data management in cloud environments: NoSQL and NewSQL data stores
-
K. Grolinger, W. A. Higashino, A. Tiwari and M. A. Capretz, "Data management in cloud environments: NoSQL and NewSQL data stores," Journal of Cloud Computing: Advances, Systems and Application, 2, 2013.
-
(2013)
Journal of Cloud Computing: Advances, Systems and Application
, vol.2
-
-
Grolinger, K.1
Higashino, W.A.2
Tiwari, A.3
Capretz, M.A.4
-
10
-
-
84868307166
-
MAD skills: New analysis practices for big data
-
J. Cohen, B. Dolan, M. Dunlap, J. M. Hellerstein and C. Welton, "MAD skills: New analysis practices for Big Data," VLDB Endowment, 2(2), pp. 1481-1492, 2009.
-
(2009)
VLDB Endowment
, vol.2
, Issue.2
, pp. 1481-1492
-
-
Cohen, J.1
Dolan, B.2
Dunlap, M.3
Hellerstein, J.M.4
Welton, C.5
-
11
-
-
85113862134
-
-
Apache Cassandra, http://www.datastax.com/docs.
-
-
-
-
12
-
-
79951992905
-
-
Sebastopol, CA, USA: O'Reilly Media
-
J. C. Anderson, J. Lehnardt and N. Slater, CouchDB: The Definitive Guide, Sebastopol, CA, USA: O'Reilly Media, 2010.
-
(2010)
CouchDB: The Definitive Guide
-
-
Anderson, J.C.1
Lehnardt, J.2
Slater, N.3
-
13
-
-
84868325513
-
HIVe: A warehousing solution over a map-reduce framework
-
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, S. Anthony, H. Liu, P. Wyckoff and R. Murthy, "Hive: A warehousing solution over a map-reduce framework," Proc. of the VLDB Endowment, 2(2), pp. 1626-1629, 2009.
-
(2009)
Proc. Of the VLDB Endowment
, vol.2
, Issue.2
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.S.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
14
-
-
85113839422
-
-
Apache Mahout, https://mahout.apache.org/.
-
-
-
-
15
-
-
84862703502
-
-
Oracle Big Data connectors, http://www.oracle.com/us/products/database/big-data-connectors/overview/index.html.
-
Oracle Big Data Connectors
-
-
-
16
-
-
84891115556
-
Hone: Scaling down hadoop on shared-memory systems
-
K. A. Kumar, J. Gluck, A. Deshpande and J. Lin, "Hone: Scaling down hadoop on shared-memory systems," Proc. of the VLDB Endowment, 6(12), pp. 1354-1357, 2013.
-
(2013)
Proc. Of the VLDB Endowment
, vol.6
, Issue.12
, pp. 1354-1357
-
-
Kumar, K.A.1
Gluck, J.2
Deshpande, A.3
Lin, J.4
-
17
-
-
84969140151
-
Unexpected challenges in large scale machine learning
-
C. Parker, "Unexpected challenges in large scale machine learning," Proc. of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, 2012.
-
(2012)
Proc. Of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
-
-
Parker, C.1
-
18
-
-
84991756964
-
Mapreduce is good enough? If all you have is a hammer, throw away everything that's not a nail!
-
J. Lin, "Mapreduce is good enough? if all you have is a hammer, throw away everything that's not a nail!" Big Data, 1(1), pp. 28-37, 2013.
-
(2013)
Big Data
, vol.1
, Issue.1
, pp. 28-37
-
-
Lin, J.1
-
19
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
G. Malewicz, M. H. Austern, A. J. C. Bik, J. C. Dehnert, I. Horn, N. Leiser and G. Czajkowski, "Pregel: A system for large-scale graph processing," Proc. of the 2010 ACM SIGMOD International Conference on Management of Data, 2010.
-
(2010)
Proc. Of the 2010 ACM SIGMOD International Conference on Management of Data
-
-
Malewicz, G.1
Austern, M.H.2
Bik, A.J.C.3
Dehnert, J.C.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
20
-
-
85113878382
-
-
Apache Giraph, https://giraph.apache.org/.
-
-
-
-
21
-
-
84900451613
-
-
Apache Spark, https://spark.incubator.apache.org/.
-
Apache Spark
-
-
-
22
-
-
79956351190
-
Haloop: Efficient iterative data processing on large clusters
-
Y. Bu, B. Howe, M. Balazinska and M. D. Ernst, "HaLoop: Efficient iterative data processing on large clusters," Proc.VLDB Endow., 3(1-2), pp. 285-296, 2010.
-
(2010)
Proc.VLDB Endow.
, vol.3
, Issue.1-2
, pp. 285-296
-
-
Bu, Y.1
Howe, B.2
Balazinska, M.3
Ernst, M.D.4
-
23
-
-
78650003594
-
Twister: A runtime for iterative MapReduce
-
J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S. Bae, J. Qiu and G. Fox, "Twister: A runtime for iterative MapReduce," Proc. of the 19th ACM International Symposium on High Performance Distributed Computing, 2010.
-
(2010)
Proc. Of the 19th ACM International Symposium on High Performance Distributed Computing
-
-
Ekanayake, J.1
Li, H.2
Zhang, B.3
Gunarathne, T.4
Bae, S.5
Qiu, J.6
Fox, G.7
-
24
-
-
60649115578
-
Data preprocessing for supervised learning
-
S. B. Kotsiantis, D. Kanellopoulos and P. Pintelas, "Data preprocessing for supervised learning," International Journal of Computer Science, 1(2), pp. 111, 2006.
-
(2006)
International Journal of Computer Science
, vol.1
, Issue.2
, pp. 111
-
-
Kotsiantis, S.B.1
Kanellopoulos, D.2
Pintelas, P.3
-
27
-
-
84991746870
-
Bring the noise: Embracing randomness is the key to scaling up machine learning algorithms
-
B. Dalessandro, "Bring the noise: Embracing randomness is the key to scaling up machine learning algorithms," Big Data, 1(2), pp. 110-112, 2013.
-
(2013)
Big Data
, vol.1
, Issue.2
, pp. 110-112
-
-
Dalessandro, B.1
-
28
-
-
85076916744
-
Reining in the outliers in map-reduce clusters using Mantri
-
G. Ananthanarayanan, S. Kandula, A. Greenberg, I. Stoica, Y. Lu, B. Saha and E. Harris, "Reining in the outliers in map-reduce clusters using Mantri," Proc. of the 9th USENIX Conference on Operating Systems Design and Implementation, 2010.
-
(2010)
Proc. Of the 9th USENIX Conference on Operating Systems Design and Implementation
-
-
Ananthanarayanan, G.1
Kandula, S.2
Greenberg, A.3
Stoica, I.4
Lu, Y.5
Saha, B.6
Harris, E.7
-
32
-
-
84890813536
-
Interactive analysis of big data
-
J. Heer and S. Kandel, "Interactive analysis of Big Data," XRDS: Crossroads, the ACM Magazine for Students, 19(1), pp. 50-54, 2012.
-
(2012)
XRDS: Crossroads, the ACM Magazine for Students
, vol.19
, Issue.1
, pp. 50-54
-
-
Heer, J.1
Kandel, S.2
-
33
-
-
82155187187
-
InCOOP: MapReduce for incremental computations
-
P. Bhatotia, A. Wieder, R. Rodrigues, U. A. Acar and R. Pasquin, "Incoop: MapReduce for incremental computations," Proc. of the 2nd ACM Symposium on Cloud Computing, 2011.
-
(2011)
Proc. Of the 2nd ACM Symposium on Cloud Computing
-
-
Bhatotia, P.1
Wieder, A.2
Rodrigues, R.3
Acar, U.A.4
Pasquin, R.5
-
34
-
-
84873134968
-
Interactive analytical processing in big data systems: A cross-industry study of MapReduce workloads
-
Y. Chen, S. Alspaugh and R. Katz, "Interactive analytical processing in Big Data systems: A cross-industry study of MapReduce workloads," Proc. of the VLDB Endowment, 5(12), pp. 1802-1813, 2012.
-
(2012)
Proc. Of the VLDB Endowment
, vol.5
, Issue.12
, pp. 1802-1813
-
-
Chen, Y.1
Alspaugh, S.2
Katz, R.3
-
35
-
-
79958258284
-
Dremel: Interactive analysis of Web-scale datasets
-
S. Melnik, A. Gubarev, J. J. Long, G. Romer, S. Shivakumar, M. Tolton and T. Vassilakis, "Dremel: Interactive analysis of Web-scale datasets," Proc. of the VLDB Endowment, 3(1-2), pp. 330-339, 2010.
-
(2010)
Proc. Of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 330-339
-
-
Melnik, S.1
Gubarev, A.2
Long, J.J.3
Romer, G.4
Shivakumar, S.5
Tolton, M.6
Vassilakis, T.7
-
36
-
-
84873173544
-
Processing a trillion cells per mouse click
-
A. Hall, O. Bachmann, R. Büssow, S. Gănceanu and M. Nunkesser, "Processing a trillion cells per mouse click," Proc. of the VLDB Endowment, 5(11), pp. 1436-1446, 2012.
-
(2012)
Proc. Of the VLDB Endowment
, vol.5
, Issue.11
, pp. 1436-1446
-
-
Hall, A.1
Bachmann, O.2
Büssow, R.3
Gănceanu, S.4
Nunkesser, M.5
-
37
-
-
84877703682
-
BlinkDB: Queries with bounded errors and bounded response times on very large data
-
S. Agarwal, B. Mozafari, A. Panda, H. Milner, S. Madden and I. Stoica, "BlinkDB: Queries with bounded errors and bounded response times on very large data," Proc. of the Eight ACM European Conference on Computer Systems, 2013.
-
(2013)
Proc. Of the Eight ACM European Conference on Computer Systems
-
-
Agarwal, S.1
Mozafari, B.2
Panda, A.3
Milner, H.4
Madden, S.5
Stoica, I.6
-
38
-
-
84055184196
-
Parallel visualization on large clusters using MapReduce
-
H. T. Vo, J. Bronson, B. Summa, J. L. Comba, J. Freire, B. Howe, V. Pascucci and C. T. Silva, "Parallel visualization on large clusters using MapReduce," IEEE Symposium on Large Data Analysis and Visualization, 2011.
-
(2011)
IEEE Symposium on Large Data Analysis and Visualization
-
-
Vo, H.T.1
Bronson, J.2
Summa, B.3
Comba, J.L.4
Freire, J.5
Howe, B.6
Pascucci, V.7
Silva, C.T.8
-
39
-
-
84872409772
-
Muppet: MapReduce-style processing of fast data
-
W. Lam, L. Liu, S. Prasad, A. Rajaraman, Z. Vacheri and A. Doan, "Muppet: MapReduce-style processing of fast data," Proc.VLDB Endow., 5(12), pp. 1814-1825, 2012.
-
(2012)
Proc.VLDB Endow.
, vol.5
, Issue.12
, pp. 1814-1825
-
-
Lam, W.1
Liu, L.2
Prasad, S.3
Rajaraman, A.4
Vacheri, Z.5
Doan, A.6
-
41
-
-
85076771850
-
MapReduce online
-
T. Condie, N. Conway, P. Alvaro, J. M. Hellerstein, K. Elmeleegy and R. Sears, "MapReduce online," Proc. of the 7th USENIX Conference on Networked Systems Design and Implementation, 2010.
-
(2010)
Proc. Of the 7th USENIX Conference on Networked Systems Design and Implementation
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.M.4
Elmeleegy, K.5
Sears, R.6
-
42
-
-
70349310834
-
Ad-hoc data processing in the cloud
-
D. Logothetis and K. Yocum, "Ad-hoc data processing in the cloud," Proc. of the VLDB Endowment, 1(2), pp. 1472-1475, 2008.
-
(2008)
Proc. Of the VLDB Endowment
, vol.1
, Issue.2
, pp. 1472-1475
-
-
Logothetis, D.1
Yocum, K.2
-
43
-
-
84857167165
-
Scalable and low-latency data processing with stream MapReduce
-
A. Brito, A. Martin, T. Knauth, S. Creutz, D. Becker, S. Weigert and C. Fetzer, "Scalable and low-latency data processing with stream MapReduce," IEEE Third International Conference on Cloud Computing Technology and Science, pp. 48-58, 2011.
-
(2011)
IEEE Third International Conference on Cloud Computing Technology and Science
, pp. 48-58
-
-
Brito, A.1
Martin, A.2
Knauth, T.3
Creutz, S.4
Becker, D.5
Weigert, S.6
Fetzer, C.7
-
46
-
-
84962598306
-
Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters
-
M. Zaharia, T. Das, H. Li, S. Shenker and I. Stoica, "Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters," Proc. of the 4th USENIX Conference on Hot Topics in Cloud Computing, 2012.
-
(2012)
Proc. Of the 4th USENIX Conference on Hot Topics in Cloud Computing
-
-
Zaharia, M.1
Das, T.2
Li, H.3
Shenker, S.4
Stoica, I.5
-
47
-
-
84883165397
-
Achieving accountable MapReduce in cloud computing
-
Z. Xiao and Y. Xiao, "Achieving accountable MapReduce in cloud computing," Future Generation Computer Systems, 30,pp. 1-13, 2014.
-
(2014)
Future Generation Computer Systems
, vol.30
, pp. 1-13
-
-
Xiao, Z.1
Xiao, Y.2
-
51
-
-
85015649873
-
Airavat: Security and privacy for MapReduce
-
I. Roy, S. T. Setty, A. Kilzer, V. Shmatikov and E. Witchel, "Airavat: Security and privacy for MapReduce." Proc. of the 7th Usenix Symposium on Networked Systems Design and Implementation, 2010.
-
(2010)
Proc. Of the 7th Usenix Symposium on Networked Systems Design and Implementation
-
-
Roy, I.1
Setty, S.T.2
Kilzer, A.3
Shmatikov, V.4
Witchel, E.5
|