메뉴 건너뛰기




Volumn , Issue , 2007, Pages

ABARIS: An adaptable fault detection/recovery component framework for MPIs

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SYSTEM RECOVERY; FAULT DETECTION; INTERFACES (COMPUTER); MESSAGE PASSING; PROBLEM SOLVING; RESPONSE TIME (COMPUTER SYSTEMS);

EID: 34548714331     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2007.370603     Document Type: Conference Paper
Times cited : (11)

References (17)
  • 6
    • 0022020346 scopus 로고    scopus 로고
    • K. M. Chandy and L.Lamport. Distributed snapshots : Determining global states of distributed systems,. In Transactions on Computer Systems, 3(1). ACM, pages 63-75, February 1985.
    • K. M. Chandy and L.Lamport. Distributed snapshots : Determining global states of distributed systems,. In Transactions on Computer Systems, vol. 3(1). ACM, pages 63-75, February 1985.
  • 7
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • E. Elnozahy, D. Johnson, and Y. Wang. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys, 34(3):375-408, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.1    Johnson, D.2    Wang, Y.3
  • 8
    • 84940567900 scopus 로고    scopus 로고
    • G. Fagg and a Dongarra. FT-MPI: Faulttolerant mpi,supporting dynamic applications in a dynamic world. In Euro PVM/MPI User's Group Meeting 2000 ,Springer-Verilag, Berlin, Germany, pages 346-353, 2000.
    • G. Fagg and a Dongarra. FT-MPI: Faulttolerant mpi,supporting dynamic applications in a dynamic world. In Euro PVM/MPI User's Group Meeting 2000 ,Springer-Verilag, Berlin, Germany, pages 346-353, 2000.
  • 10
    • 0030243005 scopus 로고    scopus 로고
    • High-performance, portable implementation of the MPI Message Passing Interface Standard
    • W. Gropp, E. Lusk, N. Doss, and A. Skjellum. High-performance, portable implementation of the MPI Message Passing Interface Standard. Parallel Computing, 22(6):789-828, 1996.
    • (1996) Parallel Computing , vol.22 , Issue.6 , pp. 789-828
    • Gropp, W.1    Lusk, E.2    Doss, N.3    Skjellum, A.4
  • 13
    • 3342966061 scopus 로고    scopus 로고
    • The ganglia distributed monitoring system: Design, implementation and experience
    • July
    • M. L. Massie, B. N. Chun, and D. E. Culler, The ganglia distributed monitoring system: Design, implementation and experience. In Parallel Computing, 30, July 2004.
    • (2004) Parallel Computing , vol.30
    • Massie, M.L.1    Chun, B.N.2    Culler, D.E.3
  • 14
    • 33847764225 scopus 로고    scopus 로고
    • Model-based check-point scheduling for volatile resource environments
    • Technical Report 2004-25, University of California Santa Barbara, Department of Computer Science, Santa Barbara, CA, 93106
    • D. Nurmi, R. Wolski, and J. Brevik. Model-based check-point scheduling for volatile resource environments. Technical Report 2004-25, University of California Santa Barbara, Department of Computer Science, Santa Barbara, CA, 93106, 2004.
    • (2004)
    • Nurmi, D.1    Wolski, R.2    Brevik, J.3
  • 15
    • 0032597696 scopus 로고    scopus 로고
    • S. Rao, L. Alvisi, and H. M. Vin. Egida: An extensible toolkit for low overhead fault tolerance. In Proceedings of the 29th Fault-tolerant Computing Symposium (FTCS-29), Madison, Wisconsin, pages 48-55, June 1999.
    • S. Rao, L. Alvisi, and H. M. Vin. Egida: An extensible toolkit for low overhead fault tolerance. In Proceedings of the 29th Fault-tolerant Computing Symposium (FTCS-29), Madison, Wisconsin, pages 48-55, June 1999.
  • 16
    • 35248827046 scopus 로고    scopus 로고
    • A Component Architecture for LAM/MPI
    • Proceedings, 10th European PVM/MPI Users' Group Meeting, number in, Venice, Italy, September, October, Springer-Verlag
    • J. M. Squyres and A. Lumsdaine. A Component Architecture for LAM/MPI. In Proceedings, 10th European PVM/MPI Users' Group Meeting, number 2840 in Lecture Notes in Computer Science, pages 379-387, Venice, Italy, September / October 2003. Springer-Verlag.
    • (2003) Lecture Notes in Computer Science , vol.2840 , pp. 379-387
    • Squyres, J.M.1    Lumsdaine, A.2
  • 17
    • 34548708373 scopus 로고    scopus 로고
    • V. C. Zandy. ckpt: A process checkpoint library, 2002. http://www.cs.wisc.edu/~zandy/ckpt.
    • V. C. Zandy. ckpt: A process checkpoint library, 2002. http://www.cs.wisc.edu/~zandy/ckpt.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.