-
1
-
-
78649359388
-
Big data: Science in the peta byte era
-
"Big data: science in the peta byte era," Nature 455 (7209): 1, 2008.
-
(2008)
Nature
, vol.455
, Issue.7209
, pp. 1
-
-
-
3
-
-
77954696736
-
An evaluation of alternative architectures for transaction processing in the cloud
-
D. Kossmann, T. Kraska, and S. Loesing, "An evaluation of alternative architectures for transaction processing in the cloud," in Proceedings of the 2010 international conference on Management of data. ACM, 2010, pp. 579-590.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data. ACM
, pp. 579-590
-
-
Kossmann, D.1
Kraska, T.2
Loesing, S.3
-
4
-
-
21644437974
-
The google file system
-
ACM
-
S. Ghemawat, H. Gobioff, and S. Leung, "The google file system," in ACM SIGOPS Operating Systems Review, vol. 37, no. 5. ACM, 2003, pp. 29-43.
-
(2003)
ACM SIGOPS Operating Systems Review
, vol.37
, Issue.5
, pp. 29-43
-
-
Ghemawat, S.1
Gobioff, H.2
Leung, S.3
-
5
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "Mapreduce: simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008.
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
6
-
-
70450136675
-
The hadoop distributed file system: Architecture and design
-
D. Borthakur, "The hadoop distributed file system: Architecture and design," Hadoop Project Website, vol. 11, 2007.
-
(2007)
Hadoop Project Website
, vol.11
-
-
Borthakur, D.1
-
8
-
-
80053256327
-
A survey of large scale data management approaches in cloud environments
-
S. Sakr, A. Liu, D. Batista, and M. Alomari, "A survey of large scale data management approaches in cloud environments," Communications Surveys & Tutorials, IEEE, vol. 13, no. 3, pp. 311-336, 2011.
-
(2011)
Communications Surveys & Tutorials, IEEE
, vol.13
, Issue.3
, pp. 311-336
-
-
Sakr, S.1
Liu, A.2
Batista, D.3
Alomari, M.4
-
9
-
-
79957815245
-
Es2: A cloud data storage system for supporting both oltp and olap
-
Y. Cao, C. Chen, F. Guo, D. Jiang, Y. Lin, B. Ooi, H. Vo, S. Wu, and Q. Xu, "Es2: A cloud data storage system for supporting both oltp and olap," in Data Engineering (ICDE), 2011 IEEE 27th International Conference on. IEEE, 2011, pp. 291-302.
-
(2011)
Data Engineering (ICDE), 2011 IEEE 27th International Conference On. IEEE
, pp. 291-302
-
-
Cao, Y.1
Chen, C.2
Guo, F.3
Jiang, D.4
Lin, Y.5
Ooi, B.6
Vo, H.7
Wu, S.8
Xu, Q.9
-
10
-
-
81355137361
-
Bigtable: A distributed structured data storage system
-
F. Chang, J. Dean, S. Ghemawat, W. Hsieh, D. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. Gruber, "Bigtable: A distributed structured data storage system," in 7th OSDI, 2006, pp. 305-314.
-
(2006)
7th OSDI
, pp. 305-314
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.4
Wallach, D.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.9
-
11
-
-
84867112010
-
Pnuts: Yahoo!'s hosted data serving platform
-
B. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H. Jacobsen, N. Puz, D. Weaver, and R. Yerneni, "Pnuts: Yahoo!'s hosted data serving platform," Proceedings of the VLDB Endowment, vol. 1, no. 2, pp. 1277-1288, 2008.
-
(2008)
Proceedings of the VLDB Endowment
, vol.1
, Issue.2
, pp. 1277-1288
-
-
Cooper, B.1
Ramakrishnan, R.2
Srivastava, U.3
Silberstein, A.4
Bohannon, P.5
Jacobsen, H.6
Puz, N.7
Weaver, D.8
Yerneni, R.9
-
12
-
-
70450064414
-
Dynamo: Amazon's highly available key value store
-
ACM
-
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels, "Dynamo: amazon's highly available keyvalue store," in ACM SIGOPS Operating Systems Review, vol. 41, no. 6. ACM, 2007, pp. 205-220.
-
(2007)
ACM SIGOPS Operating Systems Review
, vol.41
, Issue.6
, pp. 205-220
-
-
Decandia, G.1
Hastorun, D.2
Jampani, M.3
Kakulapati, G.4
Lakshman, A.5
Pilchin, A.6
Sivasubramanian, S.7
Vosshall, P.8
Vogels, W.9
-
13
-
-
79959945877
-
Llama: Leveraging columnar storage for scalable join processing in the map reduce framework
-
Y. Lin, D. Agrawal, C. Chen, B. Ooi, and S. Wu, "Llama: leveraging columnar storage for scalable join processing in the mapreduce framework," in Proceedings of the 2011 international conference on Management of data. ACM, 2011, pp. 961-972.
-
(2011)
Proceedings of the 2011 International Conference on Management of Data. ACM
, pp. 961-972
-
-
Lin, Y.1
Agrawal, D.2
Chen, C.3
Ooi, B.4
Wu, S.5
-
14
-
-
70349750047
-
The eucalyptus open-source cloud-computing system
-
D. Nurmi, R. Wolski, C. Grzegorczyk, G. Obertelli, S. Soman, L. Youseff, and D. Zagorodnov, "The eucalyptus open-source cloud-computing system," in Cluster computing and the Grid, 2009. CCGRID'09. 9th IEEE/ACM International Symposium on. IEEE, 2009, pp. 124-131.
-
(2009)
Cluster Computing and the Grid, 2009. CCGRID'09. 9th IEEE/ACM International Symposium On. IEEE
, pp. 124-131
-
-
Nurmi, D.1
Wolski, R.2
Grzegorczyk, C.3
Obertelli, G.4
Soman, S.5
Youseff, L.6
Zagorodnov, D.7
-
15
-
-
79952437126
-
A comparison and critique of eucalyptus, opennebula and nimbus
-
P. Sempolinski and D. Thain, "A comparison and critique of eucalyptus, opennebula and nimbus," in Cloud computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on. Ieee, 2010, pp. 417-426.
-
(2010)
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference On. Ieee
, pp. 417-426
-
-
Sempolinski, P.1
Thain, D.2
-
17
-
-
73649141347
-
Mapreduce and parallel dbmss: Friends or foes
-
M. Stonebraker, D. Abadi, D. DeWitt, S. Madden, E. Paulson, A. Pavlo, and A. Rasin, "Mapreduce and parallel dbmss: friends or foes," Communications of the ACM, vol. 53, no. 1, pp. 64-71, 2010.
-
(2010)
Communications of the ACM
, vol.53
, Issue.1
, pp. 64-71
-
-
Stonebraker, M.1
Abadi, D.2
Dewitt, D.3
Madden, S.4
Paulson, E.5
Pavlo, A.6
Rasin, A.7
-
18
-
-
79957809015
-
Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads
-
A. Abouzeid, K. Bajda-Pawlikowski, D. Abadi, A. Silberschatz, and A. Rasin, "Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads," Proceedings of the VLDB Endowment, vol. 2, no. 1, pp. 922-933, 2009.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, Issue.1
, pp. 922-933
-
-
Abouzeid, A.1
Bajda-Pawlikowski, K.2
Abadi, D.3
Silberschatz, A.4
Rasin, A.5
-
19
-
-
77954696367
-
Integrating hadoop and parallel dbms
-
Y. Xu, P. Kostamaa, and L. Gao, "Integrating hadoop and parallel dbms," in Proceedings of the 2010 international conference on Management of data. ACM, 2010, pp. 969-974.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data. ACM
, pp. 969-974
-
-
Xu, Y.1
Kostamaa, P.2
Gao, L.3
-
20
-
-
80053521271
-
Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing)
-
J. Dittrich, J. Quiane-Ruiz, A. Jindal, Y. Kargin, V. Setty, and J. Schad, "Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing)," Proceedings of the VLDB Endowment, vol. 3, no. 1-2, pp. 515-529, 2010.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 515-529
-
-
Dittrich, J.1
Quiane-Ruiz, J.2
Jindal, A.3
Kargin, Y.4
Setty, V.5
Schad, J.6
-
21
-
-
70349310834
-
Ad-hoc data processing in the cloud
-
D. Logothetis and K. Yocum, "Ad-hoc data processing in the cloud," Proceedings of the VLDB Endowment, vol. 1, no. 2, pp. 1472-1475, 2008.
-
(2008)
Proceedings of the VLDB Endowment
, vol.1
, Issue.2
, pp. 1472-1475
-
-
Logothetis, D.1
Yocum, K.2
-
22
-
-
84870834119
-
Inverted grid-based knn query processing with mapreduce
-
C. Ji, T. Dong, Y. Li, Y. Shen, K. Li, W. Qiu, W. Qu, and M. Guo, "Inverted grid-based knn query processing with mapreduce," in ChinaGrid, 2012 Seventh ChinaGrid Annual Conference on. IEEE, 2012, pp. 25-33.
-
(2012)
ChinaGrid, 2012 Seventh ChinaGrid Annual Conference On. IEEE
, pp. 25-33
-
-
Ji, C.1
Dong, T.2
Li, Y.3
Shen, Y.4
Li, K.5
Qiu, W.6
Qu, W.7
Guo, M.8
-
23
-
-
77954751910
-
Ricardo: Integrating r and hadoop
-
S. Das, Y. Sismanis, K. Beyer, R. Gemulla, P. Haas, and J. McPherson, "Ricardo: integrating r and hadoop," in Proceedings of the 2010 international conference on Management of data. ACM, 2010, pp. 987-998.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data. ACM
, pp. 987-998
-
-
Das, S.1
Sismanis, Y.2
Beyer, K.3
Gemulla, R.4
Haas, P.5
McPherson, J.6
-
24
-
-
84888857591
-
Rankreduce-processing k-nearest neighbor queries on top of mapreduce
-
A. Stupar, S. Michel, and R. Schenkel, "Rankreduce-processing k-nearest neighbor queries on top of mapreduce," in Proceedings of the 8th Workshop on Large-Scale Distributed Systems for Information Retrieval, 2010, pp. 13-18.
-
(2010)
Proceedings of the 8th Workshop on Large-Scale Distributed Systems for Information Retrieval
, pp. 13-18
-
-
Stupar, A.1
Michel, S.2
Schenkel, R.3
-
25
-
-
80052686089
-
Clustering very large multi-dimensional datasets with mapreduce
-
R. Ferreira Cordeiro, C. Traina Junior, A. Machado Traina, J. Lopez, U. Kang, and C. Faloutsos, "Clustering very large multi-dimensional datasets with mapreduce," in Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2011, pp. 690-698.
-
(2011)
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM
, pp. 690-698
-
-
Cordeiro, R.F.1
Junior, C.T.2
Traina, A.M.3
Lopez, J.4
Kang, U.5
Faloutsos, C.6
-
26
-
-
77954716614
-
Mapdupreducer: Detecting near duplicates over massive datasets
-
C. Wang, J. Wang, X. Lin, W. Wang, H. Wang, H. Li, W. Tian, J. Xu, and R. Li, "Mapdupreducer: detecting near duplicates over massive datasets," in Proceedings of the 2010 international conference on Management of data. ACM, 2010, pp. 1119-1122.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data. ACM
, pp. 1119-1122
-
-
Wang, C.1
Wang, J.2
Lin, X.3
Wang, W.4
Wang, H.5
Li, H.6
Tian, W.7
Xu, J.8
Li, R.9
-
27
-
-
34547679939
-
Evaluating mapreduce for multi-core and multiprocessor systems
-
C. Ranger, R. Raghuraman, A. Penmetsa, G. Bradski, and C. Kozyrakis, "Evaluating mapreduce for multi-core and multiprocessor systems," in High Performance Computer Architecture, 2007. HPCA 2007. IEEE 13th International Symposium on. IEEE, 2007, pp. 13-24.
-
(2007)
High Performance Computer Architecture, 2007. HPCA 2007. IEEE 13th International Symposium On. IEEE
, pp. 13-24
-
-
Ranger, C.1
Raghuraman, R.2
Penmetsa, A.3
Bradski, G.4
Kozyrakis, C.5
-
28
-
-
63549097654
-
Mars: A mapreduce framework on graphics processors
-
B. He, W. Fang, Q. Luo, N. Govindaraju, and T. Wang, "Mars: a mapreduce framework on graphics processors," in Proceedings of the 17th international conference on Parallel architectures and compilation techniques. ACM, 2008, pp. 260-269.
-
(2008)
Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques. ACM
, pp. 260-269
-
-
He, B.1
Fang, W.2
Luo, Q.3
Govindaraju, N.4
Wang, T.5
-
29
-
-
35448944021
-
Map-reducemerge: Simplified relational data processing on large clusters
-
H. Yang, A. Dasdan, R. Hsiao, and D. Parker, "Map-reducemerge: simplified relational data processing on large clusters," in Proceedings of the 2007 ACM SIGMOD international conference on Management of data. ACM, 2007, pp. 1029-1040.
-
(2007)
Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. ACM
, pp. 1029-1040
-
-
Yang, H.1
Dasdan, A.2
Hsiao, R.3
Parker, D.4
-
30
-
-
79960904226
-
Map-Join-Reduce: Toward scalable and efficient data analysis on large clusters
-
D. Jiang, A. Tung, and G. Chen, "Map-Join-Reduce: Toward scalable and efficient data analysis on large clusters," Knowledge and Data Engineering, IEEE Transactions on, vol. 23, no. 9, pp. 1299-1311, 2011.
-
(2011)
Knowledge and Data Engineering, IEEE Transactions on
, vol.23
, Issue.9
, pp. 1299-1311
-
-
Jiang, D.1
Tung, A.2
Chen, G.3
-
31
-
-
84859260019
-
Mrshare: Sharing across multiple queries in mapreduce
-
T. Nykiel, M. Potamias, C. Mishra, G. Kollios, and N. Koudas, "Mrshare: Sharing across multiple queries in mapreduce," Proceedings of the VLDB Endowment, vol. 3, no. 1-2, pp. 494-505, 2010.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 494-505
-
-
Nykiel, T.1
Potamias, M.2
Mishra, C.3
Kollios, G.4
Koudas, N.5
-
32
-
-
84870823927
-
Samplingbased partitioning in mapreduce for skewed data
-
Y. Xu, P. Zou, W. Qu, Z. Li, K. Li, and X. Cui, "Samplingbased partitioning in mapreduce for skewed data," in China-Grid, 2012 Seventh ChinaGrid Annual Conference on. IEEE, 2012, pp. 1-8.
-
(2012)
China-Grid, 2012 Seventh ChinaGrid Annual Conference On. IEEE
, pp. 1-8
-
-
Xu, Y.1
Zou, P.2
Qu, W.3
Li, Z.4
Li, K.5
Cui, X.6
-
33
-
-
78650003594
-
Twister: A runtime for iterative mapreduce
-
J. Ekanayake, H. Li, B. Zhang, T. Gunarathne, S. Bae, J. Qiu, and G. Fox, "Twister: a runtime for iterative mapreduce," in Proceedings of the 19th ACM International Symposium on High Performance Distributed computing. ACM, 2010, pp. 810-818.
-
(2010)
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing. ACM
, pp. 810-818
-
-
Ekanayake, J.1
Li, H.2
Zhang, B.3
Gunarathne, T.4
Bae, S.5
Qiu, J.6
Fox, G.7
-
34
-
-
79956351190
-
Haloop: Efficient iterative data processing on large clusters
-
Y. Bu, B. Howe, M. Balazinska, and M. Ernst, "Haloop: Efficient iterative data processing on large clusters," Proceedings of the VLDB Endowment, vol. 3, no. 1-2, pp. 285-296, 2010.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 285-296
-
-
Bu, Y.1
Howe, B.2
Balazinska, M.3
Ernst, M.4
-
35
-
-
77954723629
-
Pregel: A system for largescale graph processing
-
G. Malewicz, M. Austern, A. Bik, J. Dehnert, I. Horn, N. Leiser, and G. Czajkowski, "Pregel: a system for largescale graph processing," in Proceedings of the 2010 international conference on Management of data. ACM, 2010, pp. 135-146.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data. ACM
, pp. 135-146
-
-
Malewicz, G.1
Austern, M.2
Bik, A.3
Dehnert, J.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
36
-
-
85076771850
-
Mapreduce online
-
T. Condie, N. Conway, P. Alvaro, J. Hellerstein, K. Elmeleegy, and R. Sears, "Mapreduce online," in Proceedings of the 7th USENIX conference on Networked systems design and implementation, 2010, pp. 21-21.
-
(2010)
Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation
, pp. 21-21
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.4
Elmeleegy, K.5
Sears, R.6
-
37
-
-
77954799580
-
Online aggregation and continuous query support in mapreduce
-
T. Condie, N. Conway, P. Alvaro, J. Hellerstein, J. Gerth, J. Talbot, K. Elmeleegy, and R. Sears, "Online aggregation and continuous query support in mapreduce," in ACM SIGMOD, 2010, pp. 1115-1118.
-
(2010)
ACM SIGMOD
, pp. 1115-1118
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.4
Gerth, J.5
Talbot, J.6
Elmeleegy, K.7
Sears, R.8
-
38
-
-
81055143288
-
The performance of mapreduce: An in-depth study
-
D. Jiang, B. Ooi, L. Shi, and S. Wu, "The performance of mapreduce: An in-depth study," Proceedings of the VLDB Endowment, vol. 3, no. 1-2, pp. 472-483, 2010.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, Issue.1-2
, pp. 472-483
-
-
Jiang, D.1
Ooi, B.2
Shi, L.3
Wu, S.4
-
39
-
-
77954744650
-
Efficient parallel setsimilarity joins using mapreduce
-
Citeseer
-
R. Vernica, M. Carey, and C. Li, "Efficient parallel setsimilarity joins using mapreduce," in SIGMOD conference. Citeseer, 2010, pp. 495-506.
-
(2010)
SIGMOD Conference
, pp. 495-506
-
-
Vernica, R.1
Carey, M.2
Li, C.3
-
40
-
-
84863510705
-
Efficient parallel knn joins for large data in mapreduce
-
C. Zhang, F. Li, and J. Jestes, "Efficient parallel knn joins for large data in mapreduce," in Proceedings of the 15th International Conference on Extending Database Technology. ACM, 2012, pp. 38-49.
-
(2012)
Proceedings of the 15th International Conference on Extending Database Technology. ACM
, pp. 38-49
-
-
Zhang, C.1
Li, F.2
Jestes, J.3
-
41
-
-
84883008817
-
Big data challenge in the management perspective
-
X. Zhou, J. Lu, C. Li, and X. Du, "Big data challenge in the management perspective," Communications of the CCF, vol. 8, pp. 16-20, 2012.
-
(2012)
Communications of the CCF
, vol.8
, pp. 16-20
-
-
Zhou, X.1
Lu, J.2
Li, C.3
Du, X.4
|