메뉴 건너뛰기




Volumn , Issue , 2008, Pages 71-79

A scalable checkpoint encoding algorithm for diskless checkpointing

Author keywords

Checkpoint; Diskless checkpointing; Fault tolerance; High performance computing; Parallel and distributed systems; Reed solomon encoding

Indexed keywords

CONJUGATE GRADIENT METHOD; ENCODING (SYMBOLS); ERRORS; FAULT TOLERANCE; HIGH PERFORMANCE LIQUID CHROMATOGRAPHY; QUALITY ASSURANCE; RELIABILITY; SAFETY ENGINEERING; SYSTEMS ENGINEERING;

EID: 58449086437     PISSN: 15302059     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HASE.2008.13     Document Type: Conference Paper
Times cited : (19)

References (18)
  • 4
    • 0029715009 scopus 로고    scopus 로고
    • Evaluation of checkpoint mechanisms for massively parallel machines
    • T. cker Chiueh and P. Deng. Evaluation of checkpoint mechanisms for massively parallel machines. In FTCS, pages 370-379, 1996.
    • (1996) FTCS , pp. 370-379
    • cker Chiueh, T.1    Deng, P.2
  • 5
    • 58449099052 scopus 로고    scopus 로고
    • J. Dongarra, H. Meuer, and E. Strohmaier. TOP500 Supercomputer Sites, 24th edition. In Proceedings of the Supercomputing Conference (SC'2004), Pittsburgh PA, USA. ACM, 2004.
    • J. Dongarra, H. Meuer, and E. Strohmaier. TOP500 Supercomputer Sites, 24th edition. In Proceedings of the Supercomputing Conference (SC'2004), Pittsburgh PA, USA. ACM, 2004.
  • 6
    • 84940567900 scopus 로고    scopus 로고
    • FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world
    • G. E. Fagg and J. Dongarra. FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world. In PVM/MPI 2000, pages 346-353, 2000.
    • (2000) PVM/MPI 2000 , pp. 346-353
    • Fagg, G.E.1    Dongarra, J.2
  • 9
    • 0018454850 scopus 로고
    • On the optimum checkpoint interval
    • E. Gelenbc. On the optimum checkpoint interval. J. ACM, 26(2):259-270, 1979.
    • (1979) J. ACM , vol.26 , Issue.2 , pp. 259-270
    • Gelenbc, E.1
  • 12
    • 0031223146 scopus 로고    scopus 로고
    • A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems
    • September
    • J. S. Plank. A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems. Software -Practice & Experience, 27(9):995-1012, September 1997.
    • (1997) Software -Practice & Experience , vol.27 , Issue.9 , pp. 995-1012
    • Plank, J.S.1
  • 13
    • 0031570636 scopus 로고    scopus 로고
    • Fault-tolerant matrix operations for networks of workstations using diskless checkpointing
    • J. S. Plank, Y. Kim, and J. Dongarra. Fault-tolerant matrix operations for networks of workstations using diskless checkpointing. J. Parallel Distrib. Comput., 43(2):125-138, 1997.
    • (1997) J. Parallel Distrib. Comput , vol.43 , Issue.2 , pp. 125-138
    • Plank, J.S.1    Kim, Y.2    Dongarra, J.3
  • 14
    • 0028060943 scopus 로고
    • Faster checkpointing with n+1 parity
    • J. S. Plank and K. Li. Faster checkpointing with n+1 parity. In FTCS, pages 288-297, 1994.
    • (1994) FTCS , pp. 288-297
    • Plank, J.S.1    Li, K.2
  • 16
    • 0035201417 scopus 로고    scopus 로고
    • Processor allocation and checkpoint interval selection in cluster computing systems
    • November
    • J. S. Plank and M. G. Thomason. Processor allocation and checkpoint interval selection in cluster computing systems. J. Parallel Distrib. Comput., 61(11):1570-1590, November 2001.
    • (2001) J. Parallel Distrib. Comput , vol.61 , Issue.11 , pp. 1570-1590
    • Plank, J.S.1    Thomason, M.G.2
  • 17
    • 84864756973 scopus 로고    scopus 로고
    • An experimental study about diskless checkpointing
    • L. M. Silva and J. G. Silva. An experimental study about diskless checkpointing. In EUROMI-CRO'98, pages 395-402, 1998.
    • (1998) EUROMI-CRO'98 , pp. 395-402
    • Silva, L.M.1    Silva, J.G.2
  • 18
    • 84976846528 scopus 로고
    • A first order approximation to the optimal checkpoint interval
    • J. W. Young. A first order approximation to the optimal checkpoint interval. Commun. ACM, 17(9):530-531, 1974.
    • (1974) Commun. ACM , vol.17 , Issue.9 , pp. 530-531
    • Young, J.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.