메뉴 건너뛰기




Volumn 4297 LNCS, Issue , 2006, Pages 485-496

Proactive fault tolerance in MPI applications via task migration

Author keywords

[No Author keywords available]

Indexed keywords

CHARM++; DYNAMIC TASKS; HARDWARE DEVICES; MPI APPLICATIONS; PARALLEL APPLICATION; PERFORMANCE DATA; PROACTIVE FAULT; PROCESSOR VIRTUALIZATION; TASK MIGRATION;

EID: 50649108554     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11945918_47     Document Type: Conference Paper
Times cited : (52)

References (25)
  • 3
    • 33646395818 scopus 로고    scopus 로고
    • Master's thesis, Dep. of Computer Science, University of Illinois, Urbana, IL () Available at
    • Huang, C.: System support for checkpoint and restart of Charm++ and AMPI applications. Master's thesis, Dep. of Computer Science, University of Illinois, Urbana, IL (2004) Available at http://charm.cs.uiuc.edu/papers/ CheckpointThesis.html.
    • (2004) System support for checkpoint and restart of Charm++ and AMPI applications
    • Huang, C.1
  • 5
    • 12444281734 scopus 로고    scopus 로고
    • Chakravorty, S., Kalé, L.V.: A fault tolerant protocol for massively parallel machines. In: FTPDSWorkshop at IPDPS'2004, Santa Fe, NM, IEEE Press (2004)
    • Chakravorty, S., Kalé, L.V.: A fault tolerant protocol for massively parallel machines. In: FTPDSWorkshop at IPDPS'2004, Santa Fe, NM, IEEE Press (2004)
  • 7
    • 77049095467 scopus 로고    scopus 로고
    • Hewlett-Packard, Intel, Microsoft, Phoenix, Toshiba: Advanced configuration and power interface specification. ACPI Specification Document, Revision 3.0 (2004) Available from http://www.acpi.info.
    • Hewlett-Packard, Intel, Microsoft, Phoenix, Toshiba: Advanced configuration and power interface specification. ACPI Specification Document, Revision 3.0 (2004) Available from http://www.acpi.info.
  • 9
    • 77049110419 scopus 로고    scopus 로고
    • Oliner, A.J., Sahoo, R.K., Moreira, J.E., Gupta, M., Sivasubramaniam, A.: Fault-aware job scheduling for BlueGene/L systems. Technical Report RC23077, IBM Research ((2004))
    • Oliner, A.J., Sahoo, R.K., Moreira, J.E., Gupta, M., Sivasubramaniam, A.: Fault-aware job scheduling for BlueGene/L systems. Technical Report RC23077, IBM Research ((2004))
  • 10
    • 0002479236 scopus 로고    scopus 로고
    • Charm++: Parallel programming with message-driven objects
    • Wilson, G.V, Lu, P, eds, MIT Press
    • Kalé, L.V., Krishnan, S.: Charm++: Parallel programming with message-driven objects. In Wilson, G.V., Lu, P., eds.: Parallel Programming using C++. MIT Press (1996) 175-213
    • (1996) Parallel Programming using C , pp. 175-213
    • Kalé, L.V.1    Krishnan, S.2
  • 15
    • 70349122591 scopus 로고    scopus 로고
    • 2 runtime system
    • Proc. 3rdWorkshop on Runtime Systems for Parallel Programming (RTSPP) San Juan, Puerto Rico, Springer-Verlag
    • 2 runtime system. In: Proc. 3rdWorkshop on Runtime Systems for Parallel Programming (RTSPP) San Juan, Puerto Rico. Lecture Notes in Computer Science 1586, Springer-Verlag (1999) 496-510
    • (1999) Lecture Notes in Computer Science , vol.1586 , pp. 496-510
    • Antoniu, G.1    Bouge, L.2    Namyst, R.3
  • 17
    • 33646420251 scopus 로고    scopus 로고
    • Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations
    • Agbaria, A., Friedman, R.: Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations. Cluster Computing 6(3) (2003) 227-236
    • (2003) Cluster Computing , vol.6 , Issue.3 , pp. 227-236
    • Agbaria, A.1    Friedman, R.2
  • 23
    • 79961061539 scopus 로고    scopus 로고
    • MPICHV2: A fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging programming via processor virtualization
    • Phoenix, AZ
    • Bouteiller, A., Cappello, F., Hérault, T., Krawezik, G., Lemarinier, P.,Magniette, F.: MPICHV2: A fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging programming via processor virtualization. In: Proceedings of Supercomputing'03, Phoenix, AZ (2003)
    • (2003) Proceedings of Supercomputing'03
    • Bouteiller, A.1    Cappello, F.2    Hérault, T.3    Krawezik, G.4    Lemarinier, P.5    Magniette, F.6
  • 24
    • 0026867749 scopus 로고
    • Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit
    • Elnozahy, E.N., Zwaenepoel, W.: Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit. IEEE Transactions on Computers 41(5) (1992) 526-531
    • (1992) IEEE Transactions on Computers , vol.41 , Issue.5 , pp. 526-531
    • Elnozahy, E.N.1    Zwaenepoel, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.