-
1
-
-
84924223547
-
-
Apache Software Foundation, “Hadoop”
-
Apache Software Foundation, “Hadoop”, http://hadoop.apache.org/core
-
-
-
-
2
-
-
79952149970
-
RanKloud: scalable multimedia data processing in server clusters
-
Candan, K., Kim, J.W., Nagarkar, P., Nagendra, M., Yu, R.: RanKloud: scalable multimedia data processing in server clusters. IEEE MultiMed. 18, 64–77 (2011)
-
(2011)
IEEE MultiMed.
, vol.18
, pp. 64-77
-
-
Candan, K.1
Kim, J.W.2
Nagarkar, P.3
Nagendra, M.4
Yu, R.5
-
3
-
-
85071319367
-
Bigtable: a distributed storage system for structured data
-
Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: 7th USENIX Symposium on Operating Systems Design and Implementation, pp. 205–218 (2006)
-
(2006)
7th USENIX Symposium on Operating Systems Design and Implementation
, pp. 205-218
-
-
Chang, F.1
Dean, J.2
Ghemawat, S.3
Hsieh, W.C.4
Wallach, D.A.5
Burrows, M.6
Chandra, T.7
Fikes, A.8
Gruber, R.E.9
-
4
-
-
37549003336
-
MapReduce: simplified data processing on large clusters
-
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)
-
(2008)
Commun. ACM
, vol.51
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
5
-
-
79961138274
-
Delma: dynamically elastic mapreduce framework for cpu-intensive applications
-
Fadika, Z., Govindaraju, M.: Delma: dynamically elastic mapreduce framework for cpu-intensive applications. In: IEEE/ACM International Symposium on Cluster, Cloud and Grid, Computing, pp.454–463 (2011)
-
(2011)
IEEE/ACM International Symposium on Cluster, Cloud and Grid, Computing
, pp. 454-463
-
-
Fadika, Z.1
Govindaraju, M.2
-
6
-
-
21644437974
-
The google file system
-
Ghemawat, S., Gobioff, H., Leung, S.-T.: The google file system. In: ACM SIGOPS Operating Systems Review, ACM, pp. 29–43 (2003)
-
(2003)
ACM SIGOPS Operating Systems Review, ACM
, pp. 29-43
-
-
Ghemawat, S.1
Gobioff, H.2
Leung, S.-T.3
-
7
-
-
80051635126
-
Jumbo: Beyond MapReduce for Workload Balancing. VLDB
-
Groot, S., Kitsuregawa, M.: Jumbo: Beyond MapReduce for Workload Balancing. VLDB, Phd Workshop (2010)
-
(2010)
Phd Workshop
-
-
Groot, S.1
Kitsuregawa, M.2
-
8
-
-
84924221518
-
-
HBase
-
HBase, http://hadoop.apache.org/hbase/
-
-
-
-
9
-
-
0038564328
-
Burst tries: a fast, efficient data structure for string keys
-
Heinz, S., Zobel, J., Williams, H.: Burst tries: a fast, efficient data structure for string keys. Trans. Inf. Syst. (TOIS) 20(12), 192–223 (2002)
-
(2002)
Trans. Inf. Syst. (TOIS)
, vol.20
, Issue.12
, pp. 192-223
-
-
Heinz, S.1
Zobel, J.2
Williams, H.3
-
10
-
-
79961142851
-
Ex-mate: data intensive computing with large reduction objects and its application to graph mining
-
Jiang, W., Agrawal, G.: Ex-mate: data intensive computing with large reduction objects and its application to graph mining. In: Cluster, Cloud and Grid Computing (CCGrid): 11th IEEE/ACM International Symposium on, IEEE 2011, pp. 475–484 (2011)
-
(2011)
Cluster, Cloud and Grid Computing (CCGrid): 11th IEEE/ACM International Symposium on, IEEE 2011
, pp. 475-484
-
-
Jiang, W.1
Agrawal, G.2
-
12
-
-
77954901315
-
An analysis of traces from a production mapreduce cluster
-
Kavulya, S., Tan, J., Gandhi, R., Narasimhan, P.: An analysis of traces from a production mapreduce cluster. In: Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference, pp. 94–103 (2010)
-
(2010)
Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference
, pp. 94-103
-
-
Kavulya, S.1
Tan, J.2
Gandhi, R.3
Narasimhan, P.4
-
13
-
-
27644464706
-
GridBLAST: a globus-based high-throughput implementation of BLAST in a Grid computing framework
-
Krishnan, A.: GridBLAST: a globus-based high-throughput implementation of BLAST in a Grid computing framework. Concurr. Comput. Pract. Exp. 17(13), 1607–1623 (2005)
-
(2005)
Concurr. Comput. Pract. Exp.
, vol.17
, Issue.13
, pp. 1607-1623
-
-
Krishnan, A.1
-
14
-
-
79961137349
-
Cloud mapreduce: a mapreduce implementation on top of a cloud operating system
-
Liu, H., Orban, D.: Cloud mapreduce: a mapreduce implementation on top of a cloud operating system. In: 11th IEEE/ACM International Symposium, pp. 464–474 (2011)
-
(2011)
11th IEEE/ACM International Symposium
, pp. 464-474
-
-
Liu, H.1
Orban, D.2
-
15
-
-
84857178178
-
Dynamic data redistribution for MapReduce joins
-
Lynden, S., Tanimura, Y., Kojima, I., Matono, A.: Dynamic data redistribution for MapReduce joins. In: IEEE International Conference on Coud Computing Technology and Science, pp. 713–717 (2011)
-
(2011)
IEEE International Conference on Coud Computing Technology and Science
, pp. 713-717
-
-
Lynden, S.1
Tanimura, Y.2
Kojima, I.3
Matono, A.4
-
16
-
-
84890571720
-
Fortes, J.: “Programming abstractions for data intensive computing on clouds and grids
-
Matsunaga, A., Tsugawa, M., Fortes, J.: “Programming abstractions for data intensive computing on clouds and grids. In: IEEE Fourth International Conference on eScience, pp. 489–493 (2008)
-
(2008)
IEEE Fourth International Conference on eScience
, pp. 489-493
-
-
Matsunaga, A.1
Tsugawa, M.2
-
17
-
-
70349755440
-
Programming abstractions for data intensive computing on clouds and grids
-
Miceli, C., Miceli, M., Jha, S., Kaiser, H., Merzky, A.: Programming abstractions for data intensive computing on clouds and grids. In: IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 480–483 (2009)
-
(2009)
IEEE/ACM International Symposium on Cluster Computing and the Grid
, pp. 480-483
-
-
Miceli, C.1
Miceli, M.2
Jha, S.3
Kaiser, H.4
Merzky, A.5
-
18
-
-
84924221517
-
-
O’Malley, O.: TeraByte Sort on Apache Hadoop (2008
-
O’Malley, O.: TeraByte Sort on Apache Hadoop (2008)
-
-
-
-
19
-
-
77952774392
-
The model-summary problem and a solution for trees
-
Panda, B., Riedewald, M., Fink, D.: The model-summary problem and a solution for trees. In: Data Engineering, International Conference on Data, Engineering, pp. 452–455 (2010)
-
(2010)
Data Engineering, International Conference on Data, Engineering
, pp. 452-455
-
-
Panda, B.1
Riedewald, M.2
Fink, D.3
-
20
-
-
67149126890
-
Disco: distributed co-clustering with map-reduce: a case study towards petabyte-scale end-to-end mining
-
Papadimitriou, S., Sun, J.: Disco: distributed co-clustering with map-reduce: a case study towards petabyte-scale end-to-end mining. In: IEEE International Conference on Data Mining, p. 519 (2008)
-
(2008)
IEEE International Conference on Data Mining
, pp. 519
-
-
Papadimitriou, S.1
Sun, J.2
-
21
-
-
77952577122
-
The Hadoop distributed filesystem: balancing portability and performance
-
Shafer, J., Rixner, S., Cox, A.L.: The Hadoop distributed filesystem: balancing portability and performance. In: IEEE International Symposium on Performance Analysis of System and Software(ISPASS), p. 123 (2010)
-
(2010)
IEEE International Symposium on Performance Analysis of System and Software(ISPASS)
, pp. 123
-
-
Shafer, J.1
Rixner, S.2
Cox, A.L.3
-
22
-
-
84890563322
-
An improved partitioning mechanism for optimizing massive data analysis using MapReduce
-
Slagter, K., Hsu, C.-H., Chung, Y.-C., Zhang, D.: An improved partitioning mechanism for optimizing massive data analysis using MapReduce. J. Supercomput. 66(1), 539–555 (2013)
-
(2013)
J. Supercomput
, vol.66
, Issue.1
, pp. 539-555
-
-
Slagter, K.1
Hsu, C.-H.2
Chung, Y.-C.3
Zhang, D.4
-
23
-
-
38449085073
-
Grid approach to embarrassingly parallel CPU-intensive bioinformatics problems
-
Stockinger, H., Pagni, M., Cerutti, L., Falquet, L.: Grid approach to embarrassingly parallel CPU-intensive bioinformatics problems. In: IEEE International Conference on e-Science and Grid Computing (2006)
-
(2006)
IEEE International Conference on e-Science and Grid Computing
-
-
Stockinger, H.1
Pagni, M.2
Cerutti, L.3
Falquet, L.4
-
24
-
-
84904441123
-
Mochi: visual log-analysis based tools for debugging Hadoop
-
Tan, J., Pan, X., Kavulya, S., Gandhi, R., Narasimhan, P.: Mochi: visual log-analysis based tools for debugging Hadoop. In: USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) (2009)
-
(2009)
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud)
-
-
Tan, J.1
Pan, X.2
Kavulya, S.3
Gandhi, R.4
Narasimhan, P.5
-
25
-
-
78049340525
-
Moving text analysis tools to the cloud
-
Vashishtha, H., Smit, M., Stroulia, E.: Moving text analysis tools to the cloud. In: IEEE World Congress on Services, pp. 110–112 (2010)
-
(2010)
IEEE World Congress on Services
, pp. 110-112
-
-
Vashishtha, H.1
Smit, M.2
Stroulia, E.3
-
26
-
-
77949580645
-
Scaling genetic algorithms using mapreduce
-
Verma, A., Llora, X., Goldberg, D.E., Campbell, R.H.: Scaling genetic algorithms using mapreduce. In: Intelligent Systems Design and Applications (2009)
-
(2009)
Intelligent Systems Design and Applications
-
-
Verma, A.1
Llora, X.2
Goldberg, D.E.3
Campbell, R.H.4
-
27
-
-
85016706063
-
Hadoop the definitive guide 2nd edition
-
White, T.: “Hadoop the definitive guide 2nd edition”, Published Oreilly (2010)
-
(2010)
Published Oreilly
-
-
White, T.1
-
28
-
-
72249121870
-
Detecting large-scale system problems by mining console logs
-
Xu, W., Huang, L., Fox, A., Patterson, D., Jordan, M.I.: Detecting large-scale system problems by mining console logs. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (2009)
-
(2009)
Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles
-
-
Xu, W.1
Huang, L.2
Fox, A.3
Patterson, D.4
Jordan, M.I.5
|