메뉴 건너뛰기




Volumn 8097 LNCS, Issue , 2013, Pages 420-431

Multi-criteria checkpointing strategies: Response-time versus resource utilization

Author keywords

[No Author keywords available]

Indexed keywords

CHECK POINTING; COMPLETION TIME; EXASCALE; EXTENDED MODEL; IDLE TIME; MULTI-CRITERIA; RESOURCE UTILIZATIONS; ROLL-BACK RECOVERIES;

EID: 84883201136     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-40047-6_43     Document Type: Conference Paper
Times cited : (8)

References (19)
  • 2
  • 4
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • Elnozahy, E.N.M., Alvisi, L., Wang, Y.M., Johnson, D.B.: A survey of rollback-recovery protocols in message-passing systems. ACM Survey 34, 375-408 (2002)
    • (2002) ACM Survey , vol.34 , pp. 375-408
    • Elnozahy, E.N.M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 5
    • 80052306159 scopus 로고    scopus 로고
    • Correlated set coordination in fault tolerant message logging protocols
    • Jeannot, E., Namyst, R., Roman, J. (eds.) Euro-Par 2011, Part II. Springer, Heidelberg
    • Bouteiller, A., Herault, T., Bosilca, G., Dongarra, J.J.: Correlated set coordination in fault tolerant message logging protocols. In: Jeannot, E., Namyst, R., Roman, J. (eds.) Euro-Par 2011, Part II. LNCS, vol. 6853, pp. 51-64. Springer, Heidelberg (2011)
    • (2011) LNCS , vol.6853 , pp. 51-64
    • Bouteiller, A.1    Herault, T.2    Bosilca, G.3    Dongarra, J.J.4
  • 6
    • 84866852589 scopus 로고    scopus 로고
    • HydEE: Failure containment without event logging for large scale send-deterministic MPI applications
    • IEEE May
    • Guermouche, A., Ropars, T., Snir, M., Cappello, F.: HydEE: Failure containment without event logging for large scale send-deterministic MPI applications. In: Proc. 26th IPDPS, pp. 1216-1227. IEEE (May 2012)
    • (2012) Proc. 26th IPDPS , pp. 1216-1227
    • Guermouche, A.1    Ropars, T.2    Snir, M.3    Cappello, F.4
  • 8
    • 0021439162 scopus 로고
    • Algorithm-based fault tolerance for matrix operations
    • Huang, K., Abraham, J.: Algorithm-based fault tolerance for matrix operations. IEEE Transactions on Computers 100(6), 518-528 (1984)
    • (1984) IEEE Transactions on Computers , vol.100 , Issue.6 , pp. 518-528
    • Huang, K.1    Abraham, J.2
  • 13
    • 34548782109 scopus 로고    scopus 로고
    • A fault tolerance protocol with fast fault recovery
    • IEEE March
    • Chakravorty, S., Kale, L.: A fault tolerance protocol with fast fault recovery. In: Proc. 21st IPDPS, pp. 1-10. IEEE (March 2007)
    • (2007) Proc. 21st IPDPS , pp. 1-10
    • Chakravorty, S.1    Kale, L.2
  • 16
    • 84976769480 scopus 로고
    • The effectiveness of multiple hardware contexts
    • ACM
    • Thekkath, R., Eggers, S.J.: The effectiveness of multiple hardware contexts. In: Proc. of the 6th ASPLOS, pp. 328-337. ACM (1994)
    • (1994) Proc. of the 6th ASPLOS , pp. 328-337
    • Thekkath, R.1    Eggers, S.J.2
  • 18
    • 33646940970 scopus 로고    scopus 로고
    • Hybrid preemptive scheduling of message passing interface applications on grids
    • Bouteiller, A., Bouziane, H.L., Herault, T., Lemarinier, P., Cappello, F.: Hybrid preemptive scheduling of message passing interface applications on grids. IJHPCA 20(1), 77-90 (2006)
    • (2006) IJHPCA , vol.20 , Issue.1 , pp. 77-90
    • Bouteiller, A.1    Bouziane, H.L.2    Herault, T.3    Lemarinier, P.4    Cappello, F.5
  • 19
    • 28044460018 scopus 로고    scopus 로고
    • A higher order estimate of the optimum checkpoint interval for restart dumps
    • Daly, J.T.: A higher order estimate of the optimum checkpoint interval for restart dumps. FGCS 22(3), 303-312 (2004)
    • (2004) FGCS , vol.22 , Issue.3 , pp. 303-312
    • Daly, J.T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.