메뉴 건너뛰기




Volumn , Issue , 2009, Pages

Reasons for a pessimistic or optimistic message logging protocol in MPI uncoordinated failure recovery

Author keywords

[No Author keywords available]

Indexed keywords

EVENT LOGGING; FAILURE RATE; FAILURE RECOVERY; HIGH PERFORMANCE COMPUTING; HIGH PERFORMANCE NETWORKS; MESSAGE LOGGING; MESSAGE LOGGING PROTOCOLS; MESSAGE RECEPTION; MPI APPLICATIONS; NEW MODEL; SEVERAL PROTOCOLS; STABLE STORAGE;

EID: 72149132074     PISSN: 15525244     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CLUSTR.2009.5289157     Document Type: Conference Paper
Times cited : (27)

References (29)
  • 4
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson, "A survey of rollback-recovery protocols in message-passing systems," ACM Comput. Surv., vol.34, no.3, pp. 375-408, 2002.
    • (2002) ACM Comput. Surv. , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.N.M.1    Alvisi, L.2    Wang, Y.-M.3    Johnson, D.B.4
  • 7
    • 0017996760 scopus 로고
    • Time, clocks, and the ordering of events in a distributed system
    • L. Lamport, "Time, clocks, and the ordering of events in a distributed system," Communications of the ACM, vol.21, no.7, pp. 558-565, 1978.
    • (1978) Communications of the ACM , vol.21 , Issue.7 , pp. 558-565
    • Lamport, L.1
  • 8
    • 50649083601 scopus 로고    scopus 로고
    • O2P: An extremely optimistic message logging protocol
    • November
    • T. Ropars and C. Morin, "O2P: An Extremely Optimistic Message Logging Protocol," INRIA Research Report 6357, November 2007.
    • (2007) INRIA Research Report 6357
    • Ropars, T.1    Morin, C.2
  • 11
    • 0026907967 scopus 로고
    • An efficient implementation of vector clocks
    • M. Singhal and A. Kshemkalyani, "An Efficient Implementation of Vector Clocks," Information Processing Letters, vol.43, no.1, pp. 47-52, 1992.
    • (1992) Information Processing Letters , vol.43 , Issue.1 , pp. 47-52
    • Singhal, M.1    Kshemkalyani, A.2
  • 15
    • 0030286802 scopus 로고    scopus 로고
    • Algorithm-based fault location and recovery for matrix computations on multiprocessor systems
    • A. Roy-Chowdhury and P. Banerjee, "Algorithm-based fault location and recovery for matrix computations on multiprocessor systems," IEEE Trans. Comput., vol.45, no.11, pp. 1239-1247, 1996.
    • (1996) IEEE Trans. Comput. , vol.45 , Issue.11 , pp. 1239-1247
    • Roy-Chowdhury, A.1    Banerjee, P.2
  • 17
    • 84940567900 scopus 로고    scopus 로고
    • FT-MPI : FFFault tolerant MPI, supporting dynamic applications in a dynamic world
    • Balatonfred, Hungary: Springer-Verlag Heidelberg, September
    • G. Fagg and J. Dongarra, "FT-MPI : Fault tolerant MPI, supporting dynamic applications in a dynamic world," in 7th Euro PVM/MPI User's Group Meeting2000, vol.1908 / 2000. Balatonfred, Hungary: Springer-Verlag Heidelberg, september 2000.
    • (2000) 7th Euro PVM/MPI User's Group Meeting2000 , vol.1908
    • Fagg, G.1    Dongarra, J.2
  • 18
    • 0035480335 scopus 로고    scopus 로고
    • HARNESS and fault tolerant MPI
    • October
    • G. E. Fagg, A. Bukovsky, and J. J. Dongarra, "HARNESS and fault tolerant MPI," Parallel Computing, vol.27, no.11, pp. 1479-1495, October 2001.
    • (2001) Parallel Computing , vol.27 , Issue.11 , pp. 1479-1495
    • Fagg, G.E.1    Bukovsky, A.2    Dongarra, J.J.3
  • 19
    • 0022020346 scopus 로고
    • Distributed snapshots : DDDetermining global states of distributed systems
    • ACM, February
    • K. M. Chandy and L. Lamport, "Distributed snapshots : Determining global states of distributed systems," in Transactions on Computer Systems, vol.3(1). ACM, February 1985, pp. 63-75.
    • (1985) Transactions on Computer Systems , vol.3 , Issue.1 , pp. 63-75
    • Chandy, K.M.1    Lamport, L.2
  • 23
    • 0026867749 scopus 로고
    • Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output
    • May
    • Elnozahy, Elmootazbellah, and Zwaenepoel, "Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output," IEEE Transactions on Computing, vol.41, no.5, May 1992.
    • (1992) IEEE Transactions on Computing , vol.41 , Issue.5
    • Elnozahy1    Elmootazbellah2    Zwaenepoel3
  • 26
    • 0022112420 scopus 로고
    • Optimistic recovery in distributed systems
    • R. Strom and S. Yemini, "Optimistic Recovery in Distributed Systems," ACM Transactions on Computing Systems, vol.3, no.3, pp. 204-226, 1985.
    • (1985) ACM Transactions on Computing Systems , vol.3 , Issue.3 , pp. 204-226
    • Strom, R.1    Yemini, S.2
  • 29
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • september
    • M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson, "A survey of rollback-recovery protocols in message-passing systems," ACM Computing Surveys (CSUR), vol.34, no.3, pp. 375-408, september 2002.
    • (2002) ACM Computing Surveys (CSUR) , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.