메뉴 건너뛰기




Volumn 2005, Issue , 2005, Pages

Performance implications of periodic checkpointing on large-scale cluster systems

Author keywords

[No Author keywords available]

Indexed keywords

BLUEGENE/L; CHECKPOINT; LARGE-SCALE CLUSTER SYSTEMS; TOROIDAL INTERCONNECT ARCHITECTURE;

EID: 33746286070     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2005.337     Document Type: Conference Paper
Times cited : (42)

References (23)
  • 1
    • 11144287593 scopus 로고    scopus 로고
    • An overview of the bluegene/1 super-computer
    • Papers, November
    • N. Adiga and et. al. An overview of the bluegene/1 super-computer. In Supercomputing (SC2002) Technical. Papers, November 2002.
    • (2002) Supercomputing (SC2002) Technical.
    • Adiga, N.1
  • 2
    • 8344232253 scopus 로고    scopus 로고
    • Adaptive incremental checkpointing for massively parallel systems
    • S. Agarwal, R. Garg, M. S. Gupta, and J. E. Moreira. Adaptive incremental checkpointing for massively parallel systems. In ICS 2004, pages 277-286, 2004.
    • (2004) ICS 2004 , pp. 277-286
    • Agarwal, S.1    Garg, R.2    Gupta, M.S.3    Moreira, J.E.4
  • 3
    • 0035877334 scopus 로고    scopus 로고
    • Scheduling with unexpected machine breakdowns
    • S. Albers and G. Schmidt. Scheduling with unexpected machine breakdowns. Discrete Applied Mathematics, 110(2-3):85-99, 2001.
    • (2001) Discrete Applied Mathematics , vol.110 , Issue.2-3 , pp. 85-99
    • Albers, S.1    Schmidt, G.2
  • 9
    • 0011625222 scopus 로고
    • Time sharing massively parallel machines
    • August
    • B. Gorda and R. Wolski. Time sharing massively parallel machines. In Proc. of ICPP'95. Portland OR, pages 214-217, August 1995.
    • (1995) Proc. of ICPP'95. Portland or , pp. 214-217
    • Gorda, B.1    Wolski, R.2
  • 10
    • 84974701617 scopus 로고    scopus 로고
    • Job scheduling for the bluegene/1 system
    • E. Krevat, J. G. Castanos, and J. E. Moreira. Job scheduling for the bluegene/1 system. In JSSPP, pages 38-54, 2002.
    • (2002) JSSPP , pp. 38-54
    • Krevat, E.1    Castanos, J.G.2    Moreira, J.E.3
  • 14
  • 15
    • 0035201417 scopus 로고    scopus 로고
    • Processor allocation and checkpoint interval selection in cluster computing systems
    • November
    • J. S. Plank and M. G. Thomason. Processor allocation and checkpoint interval selection in cluster computing systems. Journal of Parallel and Distributed Computing, 61(11):1570-1590, November 2001.
    • (2001) Journal of Parallel and Distributed Computing , vol.61 , Issue.11 , pp. 1570-1590
    • Plank, J.S.1    Thomason, M.G.2
  • 16
    • 84948470299 scopus 로고    scopus 로고
    • An efficient faulttolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems
    • August
    • X. Qin, H. Jiang, and D. R. Swanson. An efficient faulttolerant scheduling algorithm for real-time tasks with precedence constraints in heterogeneous systems. In Proceedings of the 30th. International Conference on Parallel Processing, pages 360-368, August 2002.
    • (2002) Proceedings of the 30th. International Conference on Parallel Processing , pp. 360-368
    • Qin, X.1    Jiang, H.2    Swanson, D.R.3
  • 20
    • 84976696875 scopus 로고
    • Performance analysis of checkpointing strategies
    • May
    • A. N. Tantawi and M. Ruschitzka. Performance analysis of checkpointing strategies. In ACM Transactions on Computer Systems, volume 110, pages 123-144, May 1984.
    • (1984) ACM Transactions on Computer Systems , vol.110 , pp. 123-144
    • Tantawi, A.N.1    Ruschitzka, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.