-
1
-
-
78649359388
-
Big data: Science in the petabyte era
-
(2008) Big data: Science in the petabyte era. Nature 455 (7209): 1.
-
(2008)
Nature
, vol.455
, Issue.7209
, pp. 1
-
-
-
3
-
-
84874615486
-
Big data processing in cloud computing environments
-
2012 12th International Symposium on IEEE
-
Ji, C., Li, Y., Qiu, W., Awada, U., and Li, K. (2012) Big data processing in cloud computing environments. Pervasive Systems, Algorithms and Networks (ISPAN), 2012 12th International Symposium on, pp. 17-23, IEEE.
-
(2012)
Pervasive Systems, Algorithms and Networks (ISPAN
, pp. 17-123
-
-
Ji, C.1
Li, Y.2
Qiu, W.3
Awada, U.4
Li, K.5
-
4
-
-
77954696736
-
An evaluation of alternative archi-tectures for transaction processing in the cloud
-
ACM
-
Kossmann, D., Kraska, T., and Loesing, S. (2010) An evaluation of alternative archi-tectures for transaction processing in the cloud. Proceedings of the 2010 international conference on Management of data, pp. 579-590, ACM.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data
, pp. 579-590
-
-
Kossmann, D.1
Kraska, T.2
Loesing, S.3
-
5
-
-
84876553525
-
Big data platforms as a service: Challenges and approach
-
USENIX Association
-
Horey, J., Begoli, E., Gunasekaran, R., Lim, S., and Nutaro, J. (2012) Big data platforms as a service: Challenges and approach. Proceedings of the 4th USENIX conference on Hot Topics in Cloud Ccomputing, pp. 16-16, USENIX Association.
-
(2012)
Proceedings of the 4th USENIX Conference on Hot Topics in Cloud Ccomputing
, pp. 16-116
-
-
Horey, J.1
Begoli, E.2
Gunasekaran, R.3
Lim, S.4
Nutaro, J.5
-
6
-
-
21644437974
-
The google file system
-
ACM
-
Ghemawat, S., Gobioff, H., and Leung, S. (2003) The google file system. ACM SIGOPS Operating Systems Review, vol. 37, pp. 29-43, ACM.
-
(2003)
ACM SIGOPS Operating Systems Review
, vol.37
, pp. 29-243
-
-
Ghemawat, S.1
Gobioff, H.2
Leung, S.3
-
7
-
-
37549003336
-
Mapreduce: Simplified data processing on large clusters
-
Dean, J. and Ghemawat, S. (2008) Mapreduce: Simplified data processing on large clusters. Communications of the ACM, 51, 107-113.
-
(2008)
Communications of the ACM
, vol.51
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
8
-
-
70450136675
-
The hadoop distributed file system: Architecture and design
-
Borthakur, D. (2007) The hadoop distributed file system: Architecture and design. Hadoop Project Website, 11.
-
(2007)
Hadoop Project Website
, vol.11
-
-
Borthakur, D.1
-
10
-
-
80053256327
-
A survey of large scale data management approaches in cloud environments
-
Sakr, S., Liu, A., Batista, D., and Alomari, M. (2011) A survey of large scale data management approaches in cloud environments. Communications Surveys & Tutorials, IEEE, 13, 311-336.
-
(2011)
Communications Surveys & Tutorials IEEE
, vol.13
, pp. 311-336
-
-
Sakr, S.1
Liu, A.2
Batista, D.3
Alomari, M.4
-
11
-
-
79957815245
-
Es2: A cloud data storage system for supporting both oltp and olap
-
IEEE
-
Cao, Y., Chen, C., Guo, F., Jiang, D., Lin, Y., Ooi, B., Vo, H.,Wu, S., and Xu, Q. (2011) Es2: A cloud data storage system for supporting both oltp and olap. Data Engineering (ICDE), 2011 IEEE 27th International Conference on, pp. 291-302, IEEE.
-
(2011)
Data Engineering (ICDE 2011 IEEE 27th International Conference on
, pp. 291-302
-
-
Cao, Y.1
Chen, C.2
Guo, F.3
Jiang, D.4
Lin, Y.5
Ooi, B.6
Vo, H.7
Wu, S.8
Xu, Q.9
-
12
-
-
81355137361
-
-
Chang, F., Dean, J., Ghemawat, S., Hsieh, W., Wallach, D., Burrows, M., Chandra, T., Fikes, A., and Gruber, R. (2006) Bigtable: A distributed structured data storage system. 7th OSDI, pp. 305-314.
-
(2006)
Bigtable: A Distributed Structured Data Storage System. 7th OSDI
, pp. 305-314
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.4
Wallach, D.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.9
-
13
-
-
84867112010
-
Pnuts: Yahoo!'s hosted data serving platform
-
Cooper, B., Ramakrishnan, R., Srivastava, U., Silberstein, A., Bohannon, P., Jacobsen, H., Puz, N., Weaver, D., and Yerneni, R. (2008) Pnuts: Yahoo!'s hosted data serving platform. Proceedings of the VLDB Endowment, 1, 1277-1288.
-
(2008)
Proceedings of the VLDB Endowment
, vol.1
, pp. 1277-1288
-
-
Cooper, B.1
Ramakrishnan, R.2
Srivastava, U.3
Silberstein, A.4
Bohannon, P.5
Jacobsen, H.6
Puz, N.7
Weaver, D.8
Yerneni, R.9
-
14
-
-
41149092147
-
Dynamo: Amazon's highly available key-value store
-
DOI 10.1145/1294261.1294281, SOSP'07: Proceedings of the 21st ACM Symposium on Operating Systems Principles
-
DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., and Vogels, W. (2007) Dynamo: Amazon's highly available key-value store. ACM SIGOPS Operating Systems Review, vol. 41, pp. 205-220, ACM. (Pubitemid 351429377)
-
(2007)
Operating Systems Review (ACM)
, pp. 205-220
-
-
Decandia, G.1
Hastorun, D.2
Jampani, M.3
Kakulapati, G.4
Lakshman, A.5
Pilchin, A.6
Sivasubramanian, S.7
Vosshall, P.8
Vogels, W.9
-
15
-
-
79959945877
-
Llama: Leveraging columnar storage for scalable join processing in the mapreduce framework
-
ACM
-
Lin, Y., Agrawal, D., Chen, C., Ooi, B., and Wu, S. (2011) Llama: Leveraging columnar storage for scalable join processing in the mapreduce framework. Proceedings of the 2011 international conference on Management of data, pp. 961-972, ACM.
-
(2011)
Proceedings of the 2011 International Conference on Management of Data
, pp. 961-972
-
-
Lin, Y.1
Agrawal, D.2
Chen, C.3
Ooi, B.4
Wu, S.5
-
16
-
-
70349750047
-
The eucalyptus open-source cloud-computing system
-
2009 CCGRID'09. 9th IEEE/ACM International Symposium on IEEE
-
Nurmi, D., Wolski, R., Grzegorczyk, C., Obertelli, G., Soman, S., Youseff, L., and Zagorodnov, D. (2009) The eucalyptus open-source cloud-computing system. Cluster Computing and the Grid, 2009. CCGRID'09. 9th IEEE/ACM International Symposium on, pp. 124-131, IEEE.
-
(2009)
Cluster Computing and the Grid
, pp. 124-131
-
-
Nurmi, D.1
Wolski, R.2
Grzegorczyk, C.3
Obertelli, G.4
Soman, S.5
Youseff, L.6
Zagorodnov, D.7
-
17
-
-
79952437126
-
A comparison and critique of eucalyptus, open-nebula and nimbus
-
2010 IEEE Second International Conference on, Ieee
-
Sempolinski, P. and Thain, D. (2010) A comparison and critique of eucalyptus, open-nebula and nimbus. Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on, pp. 417-426, Ieee.
-
(2010)
Cloud Computing Technology and Science (CloudCom
, pp. 417-426
-
-
Sempolinski, P.1
Thain, D.2
-
19
-
-
73649141347
-
Mapreduce and parallel dbmss: Friends or foes
-
Stonebraker, M., Abadi, D., DeWitt, D., Madden, S., Paulson, E., Pavlo, A., and Rasin, A. (2010) Mapreduce and parallel dbmss: friends or foes. Communications of the ACM, 53, 64-71.
-
(2010)
Communications of the ACM
, vol.53
, pp. 64-71
-
-
Stonebraker, M.1
Abadi, D.2
Dewitt, D.3
Madden, S.4
Paulson, E.5
Pavlo, A.6
Rasin, A.7
-
20
-
-
79957809015
-
Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads
-
Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., and Rasin, A. (2009) Hadoopdb: An architectural hybrid of mapreduce and dbms technologies for analytical workloads. Proceedings of the VLDB Endowment, 2, 922-933.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, pp. 922-933
-
-
Abouzeid, A.1
Bajda-Pawlikowski, K.2
Abadi, D.3
Silberschatz, A.4
Rasin, A.5
-
21
-
-
77954696367
-
Integrating hadoop and parallel dbms
-
ACM
-
Xu, Y., Kostamaa, P., and Gao, L. (2010) Integrating hadoop and parallel dbms. Pro-ceedings of the 2010 international conference on Management of data, pp. 969-974, ACM.
-
(2010)
Pro-ceedings of the 2010 International Conference on Management of Data
, pp. 969-974
-
-
Xu, Y.1
Kostamaa, P.2
Gao, L.3
-
22
-
-
80053521271
-
Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing)
-
Dittrich, J., Quian'e-Ruiz, J., Jindal, A., Kargin, Y., Setty, V., and Schad, J. (2010) Hadoop++: Making a yellow elephant run like a cheetah (without it even noticing). Proceedings of the VLDB Endowment, 3, 515-529.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, pp. 515-529
-
-
Dittrich, J.1
Quian'E-Ruiz, J.2
Jindal, A.3
Kargin, Y.4
Setty, V.5
Schad, J.6
-
23
-
-
77954751910
-
Ricardo: Integrating r and hadoop
-
ACM
-
Das, S., Sismanis, Y., Beyer, K., Gemulla, R., Haas, P., and McPherson, J. (2010) Ricardo: Integrating r and hadoop. Proceedings of the 2010 international conference on Management of data, pp. 987-998, ACM.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data
, pp. 987-9998
-
-
Das, S.1
Sismanis, Y.2
Beyer, K.3
Gemulla, R.4
Haas, P.5
McPherson, J.6
-
24
-
-
84888857591
-
Rankreduce- processing k-nearest neigh-bor queries on top of mapreduce
-
Stupar, A., Michel, S., and Schenkel, R. (2010) Rankreduce- processing k-nearest neigh-bor queries on top of mapreduce. Proceedings of the 8th Workshop on Large-Scale Dis-tributed Systems for Information Retrieval, pp. 13-18.
-
(2010)
Proceedings of the 8th Workshop on Large-Scale Dis-tributed Systems for Information Retrieval
, pp. 13-18
-
-
Stupar, A.1
Michel, S.2
Schenkel, R.3
-
25
-
-
80052686089
-
Clustering very large multi-dimensional datasets with mapreduce
-
ACM
-
Ferreira Cordeiro, R., Traina Junior, C., Machado Traina, A., L'opez, J., Kang, U., and Faloutsos, C. (2011) Clustering very large multi-dimensional datasets with mapreduce. Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 690-698, ACM.
-
(2011)
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 690-698
-
-
Ferreira Cordeiro, R.1
Traina Junior, C.2
MacHado Traina, A.3
L'Opez, J.4
Kang, U.5
Faloutsos, C.6
-
26
-
-
77954716614
-
Mapdupreducer: Detecting near duplicates over massive datasets
-
ACM
-
Wang, C., Wang, J., Lin, X., Wang, W., Wang, H., Li, H., Tian, W., Xu, J., and Li, R. (2010) Mapdupreducer: Detecting near duplicates over massive datasets. Proceedings of the 2010 international conference on Management of data, pp. 1119-1122, ACM.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data
, pp. 1119-1122
-
-
Wang, C.1
Wang, J.2
Lin, X.3
Wang, W.4
Wang, H.5
Li, H.6
Tian, W.7
Xu, J.8
Li, R.9
-
27
-
-
34547679939
-
Evaluating MapReduce for multi-core and multiprocessor systems
-
DOI 10.1109/HPCA.2007.346181, 4147644, 2007 IEEE 13th Annual International Symposium on High Performance Computer Architecture, HPCA-13
-
Ranger, C., Raghuraman, R., Penmetsa, A., Bradski, G., and Kozyrakis, C. (2007) Evaluating mapreduce for multi-core and multiprocessor systems. High Performance Computer Architecture, 2007. HPCA 2007. IEEE 13th International Symposium on, pp. 13-24, IEEE. (Pubitemid 47208148)
-
(2007)
Proceedings - International Symposium on High-Performance Computer Architecture
, pp. 13-24
-
-
Ranger, C.1
Raghuraman, R.2
Penmetsa, A.3
Bradski, G.4
Kozyrakis, C.5
-
28
-
-
80052694199
-
Behavioral simulations in mapreduce
-
Wang, G., Salles, M., Sowell, B., Wang, X., Cao, T., Demers, A., Gehrke, J., and White, W. (2010) Behavioral simulations in mapreduce. Proceedings of the VLDB Endowment, 3, 952-963.
-
(2010)
Proceedings of the VLDB Endowment
, Issue.3
, pp. 952-963
-
-
Wang, G.1
Salles, M.2
Sowell, B.3
Wang, X.4
Cao, T.5
Demers, A.6
Gehrke, J.7
White, W.8
-
29
-
-
63549097654
-
Mars: A mapreduce framework on graphics processors
-
ACM
-
He, B., Fang, W., Luo, Q., Govindaraju, N., and Wang, T. (2008) Mars: A mapreduce framework on graphics processors. Proceedings of the 17th international conference on Parallel architectures and compilation techniques, pp. 260-269, ACM.
-
(2008)
Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques
, pp. 260-269
-
-
He, B.1
Fang, W.2
Luo, Q.3
Govindaraju, N.4
Wang, T.5
-
30
-
-
55349148888
-
Pig latin: A not-so-foreign language for data processing
-
SIGMOD international conference on Management of data, ACM
-
Olston, C., Reed, B., Srivastava, U., Kumar, R., and Tomkins, A. (2008) Pig latin: A not-so-foreign language for data processing. Proceedings of the 2008 ACM SIGMOD international conference on Management of data, pp. 1099-1110, ACM.
-
(2008)
Proceedings of the 2008 ACM
, pp. 1099-1110
-
-
Olston, C.1
Reed, B.2
Srivastava, U.3
Kumar, R.4
Tomkins, A.5
-
31
-
-
84868325513
-
Hive: A warehousing solution over a map-reduce framework
-
Thusoo, A., Sarma, J., Jain, N., Shao, Z., Chakka, P., Anthony, S., Liu, H., Wyckoff, P., and Murthy, R. (2009) Hive: A warehousing solution over a map-reduce framework. Proceedings of the VLDB Endowment, 2, 1626-1629.
-
(2009)
Proceedings of the VLDB Endowment
, vol.2
, pp. 1626-1629
-
-
Thusoo, A.1
Sarma, J.2
Jain, N.3
Shao, Z.4
Chakka, P.5
Anthony, S.6
Liu, H.7
Wyckoff, P.8
Murthy, R.9
-
32
-
-
85076882757
-
Dryadlinq: A system for general-purpose distributed data-parallel computing using a high-level language
-
Yu, Y., Isard, M., Fetterly, D., Budiu, M., Erlingsson, ́ U., Gunda, P., and Currey, J. (2008) Dryadlinq: A system for general-purpose distributed data-parallel computing using a high-level language. Proceedings of the 8th USENIX conference on Operating systems design and implementation, pp. 1-14.
-
(2008)
Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation
, pp. 1-14
-
-
Yu, Y.1
Isard, M.2
Fetterly, D.3
Budiu, M.4
Erlingsson, U.5
Gunda, P.6
Currey, J.7
-
34
-
-
79958258284
-
Dremel: Interactive analysis of web-scale datasets
-
Melnik, S., Gubarev, A., Long, J., Romer, G., Shivakumar, S., Tolton, M., and Vassi-lakis, T. (2010) Dremel: Interactive analysis of web-scale datasets. Proceedings of the VLDB Endowment, 3, 330-339.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, pp. 330-339
-
-
Melnik, S.1
Gubarev, A.2
Long, J.3
Romer, G.4
Shivakumar, S.5
Tolton, M.6
Vassi-Lakis, T.7
-
35
-
-
84880548490
-
Spanner: Googles globally-distributed database
-
Corbett, J., et al. (2012) Spanner: Googles globally-distributed database. To appear in Proceedings of OSDI, p. 1.
-
(2012)
To Appear in Proceedings of OSDI
, pp. 1
-
-
Corbett, J.1
-
36
-
-
85085251984
-
Spark: Cluster computing with working sets
-
USENIX Association
-
Zaharia, M., Chowdhury, M., Franklin, M., Shenker, S., and Stoica, I. (2010) Spark: Cluster computing with working sets. Proceedings of the 2nd USENIX conference on Hot topics in cloud computing, pp. 10-10, USENIX Association.
-
(2010)
Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing
, pp. 10-10
-
-
Zaharia, M.1
Chowdhury, M.2
Franklin, M.3
Shenker, S.4
Stoica, I.5
-
37
-
-
79951736167
-
S4: Distributed stream com-puting platform
-
IEEE
-
Neumeyer, L., Robbins, B., Nair, A., and Kesari, A. (2010) S4: Distributed stream com-puting platform. Data Mining Workshops (ICDMW), 2010 IEEE International Con-ference on, pp. 170-177, IEEE
-
(2010)
Data Mining Workshops (ICDMW), 2010 IEEE International Con-ference on
, pp. 170-177
-
-
Neumeyer, L.1
Robbins, B.2
Nair, A.3
Kesari, A.4
-
38
-
-
84871223446
-
Object-based image retrieval with kernel on adjacency matrix and local combined features
-
Qi, H., Li, K., Shen, Y., and Qu, W. (2012) Object-based image retrieval with kernel on adjacency matrix and local combined features. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), 8, 54.
-
(2012)
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
, vol.8
, pp. 54
-
-
Qi, H.1
Li, K.2
Shen, Y.3
Qu, W.4
-
39
-
-
84863162233
-
Parallel data processing with mapreduce: A survey
-
Lee, K., Lee, Y., Choi, H., Chung, Y., and Moon, B. (2012) Parallel data processing with mapreduce: A survey. ACM SIGMOD Record, 40, 11-20.
-
(2012)
ACM SIGMOD Record
, vol.40
, pp. 11-20
-
-
Lee, K.1
Lee, Y.2
Choi, H.3
Chung, Y.4
Moon, B.5
-
40
-
-
35448944021
-
Map-reduce-merge: Simplified relational data processing on large clusters
-
DOI 10.1145/1247480.1247602, SIGMOD 2007: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
Yang, H., Dasdan, A., Hsiao, R., and Parker, D. (2007) Map-reduce-merge: Simplified relational data processing on large clusters. Proceedings of the 2007 ACM SIGMOD international conference on Management of data, pp. 1029-1040, ACM. (Pubitemid 47630879)
-
(2007)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 1029-1040
-
-
Yang, H.-C.1
Dasdan, A.2
Hsiao, R.-L.3
Parker, D.S.4
-
41
-
-
79960904226
-
Map-Join-Reduce: Toward scalable and effi-cient data analysis on large clusters
-
Jiang, D., Tung, A., and Chen, G. (2011) Map-Join-Reduce: Toward scalable and effi-cient data analysis on large clusters. Knowledge and Data Engineering, IEEE Transac-tions on, 23, 1299-1311.
-
(2011)
Knowledge and Data Engineering IEEE Transac-tions on
, vol.23
, pp. 1299-1311
-
-
Jiang, D.1
Tung, A.2
Chen, G.3
-
42
-
-
84859260019
-
Mrshare: Sharing across multiple queries in mapreduce
-
Nykiel, T., Potamias, M., Mishra, C., Kollios, G., and Koudas, N. (2010) Mrshare: Sharing across multiple queries in mapreduce. Proceedings of the VLDB Endowment, 3, 494-505.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, pp. 494-505
-
-
Nykiel, T.1
Potamias, M.2
Mishra, C.3
Kollios, G.4
Koudas, N.5
-
43
-
-
34547151535
-
Clustering billions of images with large scale nearest neighbor search
-
2007. WACV'07. IEEE Workshop on IEEE
-
Liu, T., Rosenberg, C., and Rowley, H. (2007) Clustering billions of images with large scale nearest neighbor search. Applications of Computer Vision, 2007. WACV'07. IEEE Workshop on, pp. 28-28, IEEE.
-
(2007)
Applications of Computer Vision
, pp. 28-28
-
-
Liu, T.1
Rosenberg, C.2
Rowley, H.3
-
44
-
-
79952364812
-
Voronoi-based geospatial query processing with mapreduce
-
IEEE
-
Akdogan, A., Demiryurek, U., Banaei-Kashani, F., and Shahabi, C. (2010) Voronoi-based geospatial query processing with mapreduce. Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on, pp. 9-16, IEEE.
-
(2010)
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
, pp. 9-16
-
-
Akdogan, A.1
Demiryurek, U.2
Banaei-Kashani, F.3
Shahabi, C.4
-
45
-
-
79961053764
-
Rapid parallel genome indexing with mapreduce
-
ACM
-
Menon, R., Bhat, G., and Schatz, M. (2011) Rapid parallel genome indexing with mapreduce. Proceedings of the second international workshop on MapReduce and its applications, pp. 51-58, ACM.
-
(2011)
Proceedings of the Second International Workshop on MapReduce and Its Applications
, pp. 51-558
-
-
Menon, R.1
Bhat, G.2
Schatz, M.3
-
47
-
-
84870834119
-
Inverted grid-based knn query processing with mapreduce
-
IEEE
-
Ji, C., Dong, T., Li, Y., Shen, Y., Li, K., Qiu, W., Qu, W., and Guo, M. (2012) Inverted grid-based knn query processing with mapreduce. ChinaGrid, 2012 Seventh ChinaGrid Annual Conference on, pp. 25-33, IEEE.
-
(2012)
ChinaGrid, 2012 Seventh ChinaGrid Annual Conference on
, pp. 25-233
-
-
Ji, C.1
Dong, T.2
Li, Y.3
Shen, Y.4
Li, K.5
Qiu, W.6
Qu, W.7
Guo, M.8
-
48
-
-
77954746347
-
Indexing multi-dimensional data in a cloud system
-
ACM
-
Wang, J., Wu, S., Gao, H., Li, J., and Ooi, B. (2010) Indexing multi-dimensional data in a cloud system. Proceedings of the 2010 international conference on Management of data, pp. 591-602, ACM.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data
, pp. 591-5602
-
-
Wang, J.1
Wu, S.2
Gao, H.3
Li, J.4
Ooi, B.5
-
49
-
-
78650003594
-
Twister: A runtime for iterative mapreduce
-
ACM
-
Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S., Qiu, J., and Fox, G. (2010) Twister: A runtime for iterative mapreduce. Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, pp. 810-818, ACM.
-
(2010)
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
, pp. 810-818
-
-
Ekanayake, J.1
Li, H.2
Zhang, B.3
Gunarathne, T.4
Bae, S.5
Qiu, J.6
Fox, G.7
-
50
-
-
79956351190
-
Haloop: Efficient iterative data processing on large clusters
-
Bu, Y., Howe, B., Balazinska, M., and Ernst, M. (2010) Haloop: Efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment, 3, 285-296.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, pp. 285-296
-
-
Bu, Y.1
Howe, B.2
Balazinska, M.3
Ernst, M.4
-
51
-
-
77954723629
-
Pregel: A system for large-scale graph processing
-
ACM
-
Malewicz, G., Austern, M., Bik, A., Dehnert, J., Horn, I., Leiser, N., and Czajkowski, G. (2010) Pregel: A system for large-scale graph processing. Proceedings of the 2010 international conference on Management of data, pp. 135-146, ACM.
-
(2010)
Proceedings of the 2010 International Conference on Management of Data
, pp. 135-146
-
-
Malewicz, G.1
Austern, M.2
Bik, A.3
Dehnert, J.4
Horn, I.5
Leiser, N.6
Czajkowski, G.7
-
52
-
-
85076771850
-
Mapreduce online
-
Condie, T., Conway, N., Alvaro, P., Hellerstein, J., Elmeleegy, K., and Sears, R. (2010) Mapreduce online. Proceedings of the 7th USENIX conference on Networked systems design and implementation, pp. 21-21.
-
(2010)
Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation
, pp. 21-21
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.4
Elmeleegy, K.5
Sears, R.6
-
53
-
-
77954799580
-
Online aggregation and continuous query support in mapreduce
-
Condie, T., Conway, N., Alvaro, P., Hellerstein, J., Gerth, J., Talbot, J., Elmeleegy, K., and Sears, R. (2010) Online aggregation and continuous query support in mapreduce. ACM SIGMOD, pp. 1115-1118.
-
(2010)
ACM SIGMOD
, pp. 1115-1118
-
-
Condie, T.1
Conway, N.2
Alvaro, P.3
Hellerstein, J.4
Gerth, J.5
Talbot, J.6
Elmeleegy, K.7
Sears, R.8
-
54
-
-
81055143288
-
The performance of mapreduce: An in-depth study
-
Jiang, D., Ooi, B., Shi, L., and Wu, S. (2010) The performance of mapreduce: An in-depth study. Proceedings of the VLDB Endowment, 3, 472-483.
-
(2010)
Proceedings of the VLDB Endowment
, vol.3
, pp. 472-483
-
-
Jiang, D.1
Ooi, B.2
Shi, L.3
Wu, S.4
-
55
-
-
77954744650
-
Efficient parallel set-similarity joins using mapreduce
-
Citeseer
-
Vernica, R., Carey, M., and Li, C. (2010) Efficient parallel set-similarity joins using mapreduce. SIGMOD conference, pp. 495-506, Citeseer.
-
(2010)
SIGMOD Conference
, pp. 495-4506
-
-
Vernica, R.1
Carey, M.2
Li, C.3
-
56
-
-
84863510705
-
Efficient parallel knn joins for large data in mapreduce
-
ACM
-
Zhang, C., Li, F., and Jestes, J. (2012) Efficient parallel knn joins for large data in mapreduce. Proceedings of the 15th International Conference on Extending Database Technology, pp. 38-49, ACM.
-
(2012)
Proceedings of the 15th International Conference on Extending Database Technology
, pp. 38-49
-
-
Zhang, C.1
Li, F.2
Jestes, J.3
-
57
-
-
79952402115
-
Leen: Locality/fairness-aware key partitioning for mapreduce in the cloud
-
IEEE
-
Ibrahim, S., Jin, H., Lu, L., Wu, S., He, B., and Qi, L. (2010) Leen: Locality/fairness-aware key partitioning for mapreduce in the cloud. Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on, pp. 17-24, IEEE.
-
(2010)
Cloud Computing Technology and Science (CloudCom), 2010 IEEE Second International Conference on
, pp. 17-24
-
-
Ibrahim, S.1
Jin, H.2
Lu, L.3
Wu, S.4
He, B.5
Qi, L.6
-
58
-
-
84864208878
-
Load balancing in mapreduce based on scalable cardinality estimates
-
IEEE
-
Gufler, B., Augsten, N., Reiser, A., and Kemper, A. (2012) Load balancing in mapreduce based on scalable cardinality estimates. Data Engineering (ICDE), 2012 IEEE 28th International Conference on, pp. 522-533, IEEE.
-
(2012)
Data Engineering (ICDE), 2012 IEEE 28th International Conference on
, pp. 522-533
-
-
Gufler, B.1
Augsten, N.2
Reiser, A.3
Kemper, A.4
-
59
-
-
84870823927
-
Sampling-based partitioning in mapreduce for skewed data
-
IEEE
-
Xu, Y., Zou, P., Qu, W., Li, Z., Li, K., and Cui, X. (2012) Sampling-based partitioning in mapreduce for skewed data. ChinaGrid, 2012 Seventh ChinaGrid Annual Conference on, pp. 1-8, IEEE.
-
(2012)
ChinaGrid, 2012 Seventh ChinaGrid Annual Conference on
, pp. 1-18
-
-
Xu, Y.1
Zou, P.2
Qu, W.3
Li, Z.4
Li, K.5
Cui, X.6
-
60
-
-
84864645824
-
Delay tails in mapreduce scheduling
-
ACM
-
Tan, J., Meng, X., and Zhang, L. (2012) Delay tails in mapreduce scheduling. Proceed-ings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems, pp. 5-16, ACM.
-
(2012)
Proceed-ings of the 12th ACM SIGMETRICS/PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems
, pp. 5-16
-
-
Tan, J.1
Meng, X.2
Zhang, L.3
-
61
-
-
79956299894
-
Flex: A slot allocation scheduling optimizer for mapreduce workloads
-
Wolf, J., Rajan, D., Hildrum, K., Khandekar, R., Kumar, V., Parekh, S., Wu, K., and Balmin, A. (2010) Flex: A slot allocation scheduling optimizer for mapreduce workloads. Middleware 2010, pp. 1-20.
-
(2010)
Middleware 2010
, pp. 1-20
-
-
Wolf, J.1
Rajan, D.2
Hildrum, K.3
Khandekar, R.4
Kumar, V.5
Parekh, S.6
Wu, K.7
Balmin, A.8
-
63
-
-
79959922374
-
Schedule optimization for data processing flows on the cloud
-
Kllapi, H., Sitaridi, E., Tsangaris, M., and Ioannidis, Y. (2011) Schedule optimization for data processing flows on the cloud. Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 289-300.
-
(2011)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 289-300
-
-
Kllapi, H.1
Sitaridi, E.2
Tsangaris, M.3
Ioannidis, Y.4
-
64
-
-
84883008817
-
Big data challenge in the management perspective
-
Zhou, X., Lu, J., Li, C., and Du, X. (2012) Big data challenge in the management perspective. Communications of the CCF, 8, 16-20.
-
(2012)
Communications of the CCF
, vol.8
, pp. 16-20
-
-
Zhou, X.1
Lu, J.2
Li, C.3
Du, X.4
|