메뉴 건너뛰기




Volumn 3741 LNCS, Issue , 2005, Pages 153-158

Self-refined fault tolerance in HPC using dynamic dependent process groups

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; DISTRIBUTED COMPUTER SYSTEMS; INFORMATION RETRIEVAL; MATHEMATICAL MODELS;

EID: 33745305678     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11603771_18     Document Type: Conference Paper
Times cited : (5)

References (7)
  • 2
    • 0032000230 scopus 로고    scopus 로고
    • Message Logging: Pessimistic, optimistic, causal and optimal
    • FEB
    • L. Alvisi and K. Marzullo., Message Logging: Pessimistic, optimistic, causal and optimal, IEEE Transactions on Software Engineering, 24(2): 149-159, FEB 1998.
    • (1998) IEEE Transactions on Software Engineering , vol.24 , Issue.2 , pp. 149-159
    • Alvisi, L.1    Marzullo, K.2
  • 3
    • 4344718367 scopus 로고    scopus 로고
    • MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes
    • ACM/IEEE CS Press
    • G. Bosilca et. al. MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes, Proceedings of Super Computing Conference, PP 23-41, ACM/IEEE CS Press, 2002.
    • (2002) Proceedings of Super Computing Conference , pp. 23-41
    • Bosilca, G.1
  • 4
    • 60449096682 scopus 로고    scopus 로고
    • MPICH-V2: A fault tolerant MPI for volatile nodes based on pessimistic sender based message logging
    • Bouteiller et. al. MPICH-V2: a fault tolerant MPI for volatile nodes based on pessimistic sender based message logging, Super Computing, 2003.
    • (2003) Super Computing
    • Bouteiller1
  • 5
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • Aug.
    • K.M. Chandy and L. Lamport., Distributed snapshots: Determining global states of distributed systems, ACM Transactions on Computing Systems, 3(1): 63-75, Aug. 1985.
    • (1985) ACM Transactions on Computing Systems , vol.3 , Issue.1 , pp. 63-75
    • Chandy, K.M.1    Lamport, L.2
  • 6
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • E.N. Elnozahy, L. Alvisi, Y.M. Wang, and D.B. Johnson., A survey of rollback-recovery protocols in message-passing systems, ACM Computing Surveys, 34(3): 375-408, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.N.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 7
    • 0029713612 scopus 로고    scopus 로고
    • Cocheck: Checkpointing and process migration for MPI
    • G. Stellner., Cocheck: Checkpointing and process migration for MPI. IPPS, pages 526-531, 1996.
    • (1996) IPPS , pp. 526-531
    • Stellner, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.