메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Optimization of cloud task processing with checkpoint-restart mechanism

Author keywords

BLCR; Checkpoint Restart Mechanism; Cloud Computing; Google; Optimal Checkpointing Interval

Indexed keywords

ADAPTIVE ALGORITHMS; CLOUD COMPUTING; DIGITAL STORAGE; FAULT TOLERANCE; PROBABILITY DISTRIBUTIONS;

EID: 84899679452     PISSN: 21674329     EISSN: 21674337     Source Type: Conference Proceeding    
DOI: 10.1145/2503210.2503217     Document Type: Conference Paper
Times cited : (67)

References (31)
  • 2
    • 33646410614 scopus 로고    scopus 로고
    • Virtual machines: Versatile platforms for systems and processes
    • J. E. Smith and R. Nair. Virtual Machines: Versatile Platforms For Systems And Processes. Morgan Kaufmann, 2005.
    • (2005) Morgan Kaufmann
    • Smith, J.E.1    Nair, R.2
  • 4
    • 74349095095 scopus 로고    scopus 로고
    • Cloud computing for parallel scientific hpc applications: Feasibility of running coupled atmosphere-ocean climate models on amazon's ec2
    • C Evangelinos and C. N. Hill. Cloud Computing for parallel Scientific HPC Applications: Feasibility of Running Coupled Atmosphere-Ocean Climate Models on Amazon's EC2. in Computability and Complexity in Analysis (CAA'08), 2008.
    • (2008) Computability and Complexity in Analysis (CAA'08)
    • Evangelinos, C.1    Hill, C.N.2
  • 5
    • 84899696576 scopus 로고    scopus 로고
    • Amazon elastic compute cloud: on line at
    • Amazon elastic compute cloud: on line at http://aws. amazon. com/ec2/.
  • 8
    • 84863909234 scopus 로고    scopus 로고
    • Google research blog. Nov. posted at
    • J. Wilkes. More Google cluster data. Google research blog, Nov. 2011, posted at http://googleresearch. blogspot. com/2011/11/moregoogle-cluster-data. html.
    • (2011) More Google Cluster Data
    • Wilkes, J.1
  • 9
    • 84899701973 scopus 로고    scopus 로고
    • Google cluster-usage traces: format + schema Google Inc., Mountain View, CA, USA, Technical Report, Nov. revised 2012. 03. 20
    • C. Reiss, J. Wilkes, and J. L. Hellerstein. Google cluster-usage traces: format + schema. Google Inc., Mountain View, CA, USA, Technical Report, Nov. 2011, revised 2012. 03. 20.
    • (2011)
    • Reiss, C.1    Wilkes, J.2    Hellerstein, J.L.3
  • 10
    • 84870486464 scopus 로고    scopus 로고
    • Towards understanding heterogeneous clouds at scale: Google trace analysis
    • Carnegie Mellon University, Pittsburgh, PA, USA, Tech. Rep. ISTC-CC-TR-12-101, Apr.
    • C. Reiss, A. Tumanov, G. R. Ganger, R. H. Katz, and M. A. Kozuch. Towards understanding heterogeneous clouds at scale: Google trace analysis. Intel science and technology center for cloud computing, Carnegie Mellon University, Pittsburgh, PA, USA, Tech. Rep. ISTC-CC-TR-12-101, Apr. 2012.
    • (2012) Intel Science and Technology Center for Cloud Computing
    • Reiss, C.1    Tumanov, A.2    Ganger, G.R.3    Katz, R.H.4    Kozuch, M.A.5
  • 12
    • 84870912418 scopus 로고    scopus 로고
    • Monetary cost-aware checkpointing and migration on amazon cloud spot instances
    • S. Yi, A. Andrzejak and D. Kondo. Monetary Cost-Aware Checkpointing and Migration on Amazon Cloud Spot Instances. in IEEE Trans. on Services Computing, 5(4):512-524, 2012.
    • (2012) IEEE Trans. on Services Computing , vol.5 , Issue.4 , pp. 512-524
    • Yi, S.1    Andrzejak, A.2    Kondo, D.3
  • 14
    • 28044460018 scopus 로고    scopus 로고
    • A higher order estimate of the optimum checkpoint interval for restart dumps
    • J. T. Daly. A higher order estimate of the optimum checkpoint interval for restart dumps. in Future Generation Computer Systems, 22(3):303-312, 2006.
    • (2006) Future Generation Computer Systems , vol.22 , Issue.3 , pp. 303-312
    • Daly, J.T.1
  • 15
    • 54149107334 scopus 로고    scopus 로고
    • Optimization of checkpointing-related I/O for high-performance parallel and distributed computing
    • R. Subramaniyan, E. Grobelny, S. Studham, A. George. Optimization of checkpointing-related I/O for high-performance parallel and distributed computing. in Journal of Supercomputing, 46(2):150-180, 2008.
    • (2008) Journal of Supercomputing , vol.46 , Issue.2 , pp. 150-180
    • Subramaniyan, R.1    Grobelny, E.2    Studham, S.3    George, A.4
  • 17
    • 84976846528 scopus 로고
    • A first order approximation to the optimum checkpoint interval
    • J. W. Young. A first order approximation to the optimum checkpoint interval. in Communications ACM, 17(9):530-531, 1974.
    • (1974) Communications ACM , vol.17 , Issue.9 , pp. 530-531
    • Young, J.W.1
  • 19
    • 37549003336 scopus 로고    scopus 로고
    • MapReduce: Simplified data processing on large clusters
    • J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. in Commun. ACM, 51(1):107-113, 2008.
    • (2008) Commun. ACM , vol.51 , Issue.1 , pp. 107-113
    • Dean, J.1    Ghemawat, S.2
  • 20
    • 33749067567 scopus 로고    scopus 로고
    • Berkeley lab checkpoint/restart (BLCR) for Linux clusters
    • P. H. Hargrove and J. C. Duell. Berkeley lab checkpoint/restart (BLCR) for Linux clusters. in Journal of Physics: Conference Series, 46(1):494, 2006.
    • (2006) Journal of Physics: Conference Series , vol.46 , Issue.1 , pp. 494
    • Hargrove, P.H.1    Duell, J.C.2
  • 21
    • 0037619265 scopus 로고    scopus 로고
    • Web search for a planet: The Google cluster architecture
    • L. A. Barroso, J. Dean and U. Holzle. Web search for a planet: The Google cluster architecture. in Journal of Micro, 23(2):22-28, 2003.
    • (2003) Journal of Micro , vol.23 , Issue.2 , pp. 22-28
    • Barroso, L.A.1    Dean, J.2    Holzle, U.3
  • 25
    • 84877772120 scopus 로고    scopus 로고
    • Error-tolerant resource allocation and payment minimization for cloud system
    • S. Di and C. L. Wang. Error-Tolerant Resource Allocation and Payment Minimization for Cloud System. in IEEE Trans. on Parallel and Distributed Systems (TPDS), 24(6):1097-1106, 2013.
    • (2013) IEEE Trans. on Parallel and Distributed Systems (TPDS) , vol.24 , Issue.6 , pp. 1097-1106
    • Di, S.1    Wang, C.L.2
  • 26
    • 84899684912 scopus 로고    scopus 로고
    • Gideon-II Cluster:
    • Gideon-II Cluster: http://i. cs. hku. hk/?clwang/Gideon-II.
  • 27
    • 0021453195 scopus 로고
    • On the execution of large batch programs in unreliable computing systems
    • C. H. C. Leung and Q. H. Choo. On the Execution of Large Batch Programs in Unreliable Computing Systems. in IEEE Trans. on Software Engineering, 10(4):444-450, 1984.
    • (1984) IEEE Trans. on Software Engineering , vol.10 , Issue.4 , pp. 444-450
    • Leung, C.H.C.1    Choo, Q.H.2
  • 28
    • 84899668302 scopus 로고    scopus 로고
    • Stochastic models for checkpointing
    • Springer Berlin Heidelberg
    • K. Wolter, Stochastic models for checkpointing. in Stochastic Models for Fault Tolerance, pages 177-236, Springer Berlin Heidelberg, 2010.
    • (2010) Stochastic Models for Fault Tolerance , pp. 177-236
    • Wolter, K.1
  • 29
    • 84951811772 scopus 로고    scopus 로고
    • Optimum retrial number of reliability models
    • ser. Springer Series in Reliability Engineering. Springer London
    • T. Nakagawa. Optimum retrial number of reliability models. in Advanced Reliability Models and Maintenance Policies, pages 101-122, ser. Springer Series in Reliability Engineering. Springer London, 2008.
    • (2008) Advanced Reliability Models and Maintenance Policies , pp. 101-122
    • Nakagawa, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.