메뉴 건너뛰기




Volumn 18, Issue , 2004, Pages 2943-2950

A fault tolerant protocol for massively parallel systems

Author keywords

[No Author keywords available]

Indexed keywords

FAULT TOLERANT PROTOCOLS; MEAN TIME BETWEEN FAILURE (MTBF); PARALLEL SYSTEMS;

EID: 12444281734     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (29)

References (24)
  • 2
    • 0009228886 scopus 로고    scopus 로고
    • Understanding the message logging paradigm for masking process crashes
    • L. Alvisi. Understanding the message logging paradigm for masking process crashes. Technical Report TR96-1577, 1, 1996.
    • (1996) Technical Report , vol.TR96-1577 , pp. 1
    • Alvisi, L.1
  • 11
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • February
    • K. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. In ACM Transactions on Computer Systems, pages 3(1):63-75, February 1985.
    • (1985) ACM Transactions on Computer Systems , vol.3 , Issue.1 , pp. 63-75
    • Chandy, K.1    Lamport, L.2
  • 14
    • 0004096191 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message passing systems
    • School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, Oct.
    • M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, Oct. 1996.
    • (1996) Technical Report , vol.CMU-CS-96-181
    • Elnozahy, M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 15
    • 84940567900 scopus 로고    scopus 로고
    • FT-MPI: Fault tolerant MPI, supporting dynamic applications in dynamic world
    • S. Verlag, editor, Berlin, Germany
    • G. Fagg and J. Dongarra. FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in Dynamic World. In S. Verlag, editor, Euro PVM/MPI User's Group Meeting, pages 346-353, Berlin, Germany, 2000.
    • (2000) Euro PVM/MPI User's Group Meeting , pp. 346-353
    • Fagg, G.1    Dongarra, J.2
  • 16
    • 12444260048 scopus 로고    scopus 로고
    • Adaptive MPI
    • College Station, Texas, October
    • C. Huang, O. Lawlor, and L. V. Kalé. Adaptive MPI. In LCPC, College Station, Texas, October 2003.
    • (2003) LCPC
    • Huang, C.1    Lawlor, O.2    Kalé, L.V.3
  • 18
    • 0002479236 scopus 로고    scopus 로고
    • Charm++: Parallel programming with message-driven objects
    • G. V. Wilson and P. Lu, editors, MIT Press
    • L. V. Kale and S. Krishnan. Charm++: Parallel Programming with Message-Driven Objects. In G. V. Wilson and P. Lu, editors, Parallel Programming using C++, pages 175-213. MIT Press, 1996.
    • (1996) Parallel Programming Using C++ , pp. 175-213
    • Kale, L.V.1    Krishnan, S.2
  • 19
    • 84976815497 scopus 로고
    • Fail-stop processors: An approach to designing fault-tolerant computing systems
    • R. D. Schlichting and F. B. Schneider. Fail-stop processors: An approach to designing fault-tolerant computing systems. ACM Transactions on Computer Systems, 1(3):222-238, 1983.
    • (1983) ACM Transactions on Computer Systems , vol.1 , Issue.3 , pp. 222-238
    • Schlichting, R.D.1    Schneider, F.B.2
  • 20
    • 0029713612 scopus 로고    scopus 로고
    • CoCheck: Checkpointing and process migration for MPI
    • Honolulu, Hawaii
    • G. Stellner. CoCheck: Checkpointing and Process Migration for MPI. In Proceedings of the 10th IPPS, Honolulu, Hawaii, 1996.
    • (1996) Proceedings of the 10th IPPS
    • Stellner, G.1
  • 21
    • 0022112420 scopus 로고
    • Optimistic recovery in distributed systems
    • R. Strom and S. Yemini. Optimistic recovery in distributed systems. ACM Trans. Comput. Syst., 3(3):204-226, 1985.
    • (1985) ACM Trans. Comput. Syst. , vol.3 , Issue.3 , pp. 204-226
    • Strom, R.1    Yemini, S.2
  • 24
    • 12444339819 scopus 로고    scopus 로고
    • Bigsim: A parallel simulator for performance prediction of extremely large parallel machines
    • Santa Fe, New Mexico, April
    • G. Zheng, G. Kakulapati, and L. V. Kalé. Bigsim: A parallel simulator for performance prediction of extremely large parallel machines. In 2004 IPDPS Conference, Santa Fe, New Mexico, April 2004.
    • (2004) 2004 IPDPS Conference
    • Zheng, G.1    Kakulapati, G.2    Kalé, L.V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.