메뉴 건너뛰기




Volumn , Issue , 2003, Pages 84-94

Automated application-level checkpointing of MPI programs

Author keywords

Application level Checkpointing; Fault tolerance; MPI; Non FIFO Communication; Scientific Computing

Indexed keywords

ALGORITHMS; COMPUTER HARDWARE; COMPUTER SCIENCE; COMPUTER SYSTEM RECOVERY; FAULT TOLERANT COMPUTER SYSTEMS; NETWORK PROTOCOLS;

EID: 0038040085     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (125)

References (21)
  • 2
    • 0038335808 scopus 로고
    • Compiler-assisted checkpointing
    • Dept. of Computer Science, University of Tennessee
    • M. Beck, J. S. Plank, and G. Kingsley. Compiler-assisted checkpointing. Technical Report UT-CS-94-269, Dept. of Computer Science, University of Tennessee, 1994.
    • (1994) Technical Report , vol.UT-CS-94-269
    • Beck, M.1    Plank, J.S.2    Kingsley, G.3
  • 5
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • M. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computing Systems, 3(1):63-75, 1985.
    • (1985) ACM Transactions on Computing Systems , vol.3 , Issue.1 , pp. 63-75
    • Chandy, M.1    Lamport, L.2
  • 6
    • 0026867749 scopus 로고
    • Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output
    • May
    • E. N. Elnozahy and W. Zwaenepoel. Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output. IEEE Transactions on Computers, 41(5), May 1992.
    • (1992) IEEE Transactions on Computers , vol.41 , Issue.5
    • Elnozahy, E.N.1    Zwaenepoel, W.2
  • 7
    • 0004096191 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message passing systems
    • School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, Oct.
    • M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, Oct. 1996.
    • (1996) Technical Report , vol.CMU-CS-96-181
    • Elnozahy, M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 8
    • 0003413672 scopus 로고
    • MPI: A message-passing interface standard
    • University of Tennessee
    • M. P. I. Forum. MPI: A message-passing interface standard. Technical Report UT-CS-94-230, University of Tennessee, 1994.
    • (1994) Technical Report , vol.UT-CS-94-230
  • 12
    • 0037660091 scopus 로고    scopus 로고
    • IBM Research. Blue gene project overview. Online at http://www.research.ibm.com/bluegene/, 2002.
    • (2002) Blue Gene Project Overview
  • 13
    • 0026142735 scopus 로고
    • Transparent optimistic rollback recovery
    • D. B. Johnson and W. Zwaenepoel. Transparent optimistic rollback recovery. Operating Systems Review, 25(2):99-102, 1991.
    • (1991) Operating Systems Review , vol.25 , Issue.2 , pp. 99-102
    • Johnson, D.B.1    Zwaenepoel, W.2
  • 14
    • 0004215089 scopus 로고    scopus 로고
    • Morgan Kaufmann, San Francisco, California, first edition
    • N. Lynch. Distributed Algorithms. Morgan Kaufmann, San Francisco, California, first edition, 1996.
    • (1996) Distributed Algorithms
    • Lynch, N.1
  • 15
    • 0003912256 scopus 로고    scopus 로고
    • Checkpoint and migration of UNIX processes in the condor distributed processing system
    • University of Wisconsin-Madison
    • J. B. M. Litzkow, T. Tannenbaum and M. Livny. Checkpoint and migration of UNIX processes in the condor distributed processing system. Technical Report 1346, University of Wisconsin-Madison, 1997.
    • (1997) Technical Report , vol.1346
    • Litzkow, J.B.M.1    Tannenbaum, T.2    Livny, M.3
  • 16
    • 1442359688 scopus 로고    scopus 로고
    • National Nuclear Security Administration. Asci home. Online at http://www.nnsa.doe.gov/asc/, 2002.
    • (2002) Asci Home
  • 17
    • 0002067202 scopus 로고
    • Transparent checkpointing under UNIX
    • Dept. of Computer Science, University of Tennessee
    • J. S. Plank, M. Beck, G. Kingsley, and K. Li. Libckpt: Transparent checkpointing under UNIX. Technical Report UT-CS-94-242, Dept. of Computer Science, University of Tennessee, 1994.
    • (1994) Technical Report , vol.UT-CS-94-242
    • Plank, J.S.1    Beck, M.2    Kingsley, G.3    Libckpt, K.Li.4
  • 19
  • 20
    • 1342295420 scopus 로고    scopus 로고
    • The use of the MPI communication library in the NAS parallel benchmarks
    • Advanced Computer Architecture Laboratory, Dept. of Electrical Engineering and Computer Science, University of Michigan, 17
    • T. Tabe and Q. F. Stout. The use of the MPI communication library in the NAS parallel benchmarks. Technical Report CSE-TR-386-99, Advanced Computer Architecture Laboratory, Dept. of Electrical Engineering and Computer Science, University of Michigan, 17, 1999.
    • (1999) Technical Report , vol.CSE-TR-386-99
    • Tabe, T.1    Stout, Q.F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.