SCOPUS 정보 검색 플랫폼

International Journal of Parallel Programming

Volumn 43, Issue 3, 2015, Pages 489-507

An Adaptive and Memory Efficient Sampling Mechanism for Partitioning in MapReduce

(3) Slagter, Kenn a Hsu, Ching Hsien b Chung, Yeh Ching a

a NATIONAL TSING HUA UNIVERSITY (Taiwan)

b CHUNG HUA UNIVERSITY (Taiwan)

Author keywords

Cloud computing; Hadoop; Load balance; MapReduce; Partitioning; Sampling

Indexed keywords

BIG DATA; CLOUD COMPUTING; DISTRIBUTED DATABASE SYSTEMS; SAMPLING;

DISTRIBUTED ENVIRONMENTS; DISTRIBUTED PROGRAM; DISTRIBUTED SYSTEMS; HADOOP; LOAD BALANCE; MAP-REDUCE; PARTITIONING; PARTITIONING SYSTEMS;

DISTRIBUTED COMPUTER SYSTEMS;

EID: 84924221125 PISSN: 08857458 EISSN: 15737640 Source Type: Journal
DOI: 10.1007/s10766-013-0288-z Document Type: Article

Times cited : (22)

References (28)

1
- 84924223547
- Apache Software Foundation, “Hadoop”
- Apache Software Foundation, “Hadoop”, http://hadoop.apache.org/core

2
- 79952149970
- RanKloud: scalable multimedia data processing in server clusters
- Candan, K., Kim, J.W., Nagarkar, P., Nagendra, M., Yu, R.: RanKloud: scalable multimedia data processing in server clusters. IEEE MultiMed. 18, 64–77 (2011)
- (2011) IEEE MultiMed. , vol.18 , pp. 64-77
- Candan, K.¹ Kim, J.W.² Nagarkar, P.³ Nagendra, M.⁴ Yu, R.⁵

3
- 85071319367
- Bigtable: a distributed storage system for structured data
- Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: 7th USENIX Symposium on Operating Systems Design and Implementation, pp. 205–218 (2006)
- (2006) 7th USENIX Symposium on Operating Systems Design and Implementation , pp. 205-218
- Chang, F.¹ Dean, J.² Ghemawat, S.³ Hsieh, W.C.⁴ Wallach, D.A.⁵ Burrows, M.⁶ Chandra, T.⁷ Fikes, A.⁸ Gruber, R.E.⁹

4
- 37549003336
- MapReduce: simplified data processing on large clusters
- Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51, 107–113 (2008)
- (2008) Commun. ACM , vol.51 , pp. 107-113
- Dean, J.¹ Ghemawat, S.²

5
- 79961138274
- Delma: dynamically elastic mapreduce framework for cpu-intensive applications
- Fadika, Z., Govindaraju, M.: Delma: dynamically elastic mapreduce framework for cpu-intensive applications. In: IEEE/ACM International Symposium on Cluster, Cloud and Grid, Computing, pp.454–463 (2011)
- (2011) IEEE/ACM International Symposium on Cluster, Cloud and Grid, Computing , pp. 454-463
- Fadika, Z.¹ Govindaraju, M.²

6
- 21644437974
- The google file system
- Ghemawat, S., Gobioff, H., Leung, S.-T.: The google file system. In: ACM SIGOPS Operating Systems Review, ACM, pp. 29–43 (2003)
- (2003) ACM SIGOPS Operating Systems Review, ACM , pp. 29-43
- Ghemawat, S.¹ Gobioff, H.² Leung, S.-T.³

7
- 80051635126
- Jumbo: Beyond MapReduce for Workload Balancing. VLDB
- Groot, S., Kitsuregawa, M.: Jumbo: Beyond MapReduce for Workload Balancing. VLDB, Phd Workshop (2010)
- (2010) Phd Workshop
- Groot, S.¹ Kitsuregawa, M.²

8
- 84924221518
- HBase
- HBase, http://hadoop.apache.org/hbase/

9
- 0038564328
- Burst tries: a fast, efficient data structure for string keys
- Heinz, S., Zobel, J., Williams, H.: Burst tries: a fast, efficient data structure for string keys. Trans. Inf. Syst. (TOIS) 20(12), 192–223 (2002)
- (2002) Trans. Inf. Syst. (TOIS) , vol.20 , Issue.12 , pp. 192-223
- Heinz, S.¹ Zobel, J.² Williams, H.³

10
- 79961142851
- Ex-mate: data intensive computing with large reduction objects and its application to graph mining
- Jiang, W., Agrawal, G.: Ex-mate: data intensive computing with large reduction objects and its application to graph mining. In: Cluster, Cloud and Grid Computing (CCGrid): 11th IEEE/ACM International Symposium on, IEEE 2011, pp. 475–484 (2011)
- (2011) Cluster, Cloud and Grid Computing (CCGrid): 11th IEEE/ACM International Symposium on, IEEE 2011 , pp. 475-484
- Jiang, W.¹ Agrawal, G.²

11
- 62749166510
- MRPGA: an extension of MapReduce for parallelizing genetic algorithms
- Jin, C., Vecchiola, C., Buyya, R.: MRPGA: an extension of MapReduce for parallelizing genetic algorithms. In: IEEE Fourth International Conference on eScience pp. 214–221 (2008)
- (2008) IEEE Fourth International Conference on eScience , pp. 214-221
- Jin, C.¹ Vecchiola, C.² Buyya, R.³

12
- 77954901315
- An analysis of traces from a production mapreduce cluster
- Kavulya, S., Tan, J., Gandhi, R., Narasimhan, P.: An analysis of traces from a production mapreduce cluster. In: Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference, pp. 94–103 (2010)
- (2010) Cluster, Cloud and Grid Computing (CCGrid), 2010 10th IEEE/ACM International Conference , pp. 94-103
- Kavulya, S.¹ Tan, J.² Gandhi, R.³ Narasimhan, P.⁴

13
- 27644464706
- GridBLAST: a globus-based high-throughput implementation of BLAST in a Grid computing framework
- Krishnan, A.: GridBLAST: a globus-based high-throughput implementation of BLAST in a Grid computing framework. Concurr. Comput. Pract. Exp. 17(13), 1607–1623 (2005)
- (2005) Concurr. Comput. Pract. Exp. , vol.17 , Issue.13 , pp. 1607-1623
- Krishnan, A.¹

14
- 79961137349
- Cloud mapreduce: a mapreduce implementation on top of a cloud operating system
- Liu, H., Orban, D.: Cloud mapreduce: a mapreduce implementation on top of a cloud operating system. In: 11th IEEE/ACM International Symposium, pp. 464–474 (2011)
- (2011) 11th IEEE/ACM International Symposium , pp. 464-474
- Liu, H.¹ Orban, D.²

15
- 84857178178
- Dynamic data redistribution for MapReduce joins
- Lynden, S., Tanimura, Y., Kojima, I., Matono, A.: Dynamic data redistribution for MapReduce joins. In: IEEE International Conference on Coud Computing Technology and Science, pp. 713–717 (2011)
- (2011) IEEE International Conference on Coud Computing Technology and Science , pp. 713-717
- Lynden, S.¹ Tanimura, Y.² Kojima, I.³ Matono, A.⁴

16
- 84890571720
- Fortes, J.: “Programming abstractions for data intensive computing on clouds and grids
- Matsunaga, A., Tsugawa, M., Fortes, J.: “Programming abstractions for data intensive computing on clouds and grids. In: IEEE Fourth International Conference on eScience, pp. 489–493 (2008)
- (2008) IEEE Fourth International Conference on eScience , pp. 489-493
- Matsunaga, A.¹ Tsugawa, M.²

17
- 70349755440
- Programming abstractions for data intensive computing on clouds and grids
- Miceli, C., Miceli, M., Jha, S., Kaiser, H., Merzky, A.: Programming abstractions for data intensive computing on clouds and grids. In: IEEE/ACM International Symposium on Cluster Computing and the Grid, pp. 480–483 (2009)
- (2009) IEEE/ACM International Symposium on Cluster Computing and the Grid , pp. 480-483
- Miceli, C.¹ Miceli, M.² Jha, S.³ Kaiser, H.⁴ Merzky, A.⁵

18
- 84924221517
- O’Malley, O.: TeraByte Sort on Apache Hadoop (2008
- O’Malley, O.: TeraByte Sort on Apache Hadoop (2008)

19
- 77952774392
- The model-summary problem and a solution for trees
- Panda, B., Riedewald, M., Fink, D.: The model-summary problem and a solution for trees. In: Data Engineering, International Conference on Data, Engineering, pp. 452–455 (2010)
- (2010) Data Engineering, International Conference on Data, Engineering , pp. 452-455
- Panda, B.¹ Riedewald, M.² Fink, D.³

20
- 67149126890
- Disco: distributed co-clustering with map-reduce: a case study towards petabyte-scale end-to-end mining
- Papadimitriou, S., Sun, J.: Disco: distributed co-clustering with map-reduce: a case study towards petabyte-scale end-to-end mining. In: IEEE International Conference on Data Mining, p. 519 (2008)
- (2008) IEEE International Conference on Data Mining , pp. 519
- Papadimitriou, S.¹ Sun, J.²

21
- 77952577122
- The Hadoop distributed filesystem: balancing portability and performance
- Shafer, J., Rixner, S., Cox, A.L.: The Hadoop distributed filesystem: balancing portability and performance. In: IEEE International Symposium on Performance Analysis of System and Software(ISPASS), p. 123 (2010)
- (2010) IEEE International Symposium on Performance Analysis of System and Software(ISPASS) , pp. 123
- Shafer, J.¹ Rixner, S.² Cox, A.L.³

22
- 84890563322
- An improved partitioning mechanism for optimizing massive data analysis using MapReduce
- Slagter, K., Hsu, C.-H., Chung, Y.-C., Zhang, D.: An improved partitioning mechanism for optimizing massive data analysis using MapReduce. J. Supercomput. 66(1), 539–555 (2013)
- (2013) J. Supercomput , vol.66 , Issue.1 , pp. 539-555
- Slagter, K.¹ Hsu, C.-H.² Chung, Y.-C.³ Zhang, D.⁴

23
- 38449085073
- Grid approach to embarrassingly parallel CPU-intensive bioinformatics problems
- Stockinger, H., Pagni, M., Cerutti, L., Falquet, L.: Grid approach to embarrassingly parallel CPU-intensive bioinformatics problems. In: IEEE International Conference on e-Science and Grid Computing (2006)
- (2006) IEEE International Conference on e-Science and Grid Computing
- Stockinger, H.¹ Pagni, M.² Cerutti, L.³ Falquet, L.⁴

24
- 84904441123
- Mochi: visual log-analysis based tools for debugging Hadoop
- Tan, J., Pan, X., Kavulya, S., Gandhi, R., Narasimhan, P.: Mochi: visual log-analysis based tools for debugging Hadoop. In: USENIX Workshop on Hot Topics in Cloud Computing (HotCloud) (2009)
- (2009) USENIX Workshop on Hot Topics in Cloud Computing (HotCloud)
- Tan, J.¹ Pan, X.² Kavulya, S.³ Gandhi, R.⁴ Narasimhan, P.⁵

25
- 78049340525
- Moving text analysis tools to the cloud
- Vashishtha, H., Smit, M., Stroulia, E.: Moving text analysis tools to the cloud. In: IEEE World Congress on Services, pp. 110–112 (2010)
- (2010) IEEE World Congress on Services , pp. 110-112
- Vashishtha, H.¹ Smit, M.² Stroulia, E.³

26
- 77949580645
- Scaling genetic algorithms using mapreduce
- Verma, A., Llora, X., Goldberg, D.E., Campbell, R.H.: Scaling genetic algorithms using mapreduce. In: Intelligent Systems Design and Applications (2009)
- (2009) Intelligent Systems Design and Applications
- Verma, A.¹ Llora, X.² Goldberg, D.E.³ Campbell, R.H.⁴

27
- 85016706063
- Hadoop the definitive guide 2nd edition
- White, T.: “Hadoop the definitive guide 2nd edition”, Published Oreilly (2010)
- (2010) Published Oreilly
- White, T.¹

28
- 72249121870
- Detecting large-scale system problems by mining console logs
- Xu, W., Huang, L., Fox, A., Patterson, D., Jordan, M.I.: Detecting large-scale system problems by mining console logs. In: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (2009)
- (2009) Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles
- Xu, W.¹ Huang, L.² Fox, A.³ Patterson, D.⁴ Jordan, M.I.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.