메뉴 건너뛰기




Volumn , Issue , 2009, Pages 99-108

Fast checkpointing by write aggregation with dynamic buffer and interleaving on multicore architecture

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION EXECUTION; CHECK POINTING; CHECKPOINT/RESTART; DE FACTO STANDARD; DYNAMIC BUFFER POOL; JOB SIZE; LARGE CLUSTERS; MEAN TIME BETWEEN FAILURES; MULTICORE ARCHITECTURES; PARALLEL APPLICATION; PROCESSOR CORES;

EID: 77952145003     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HIPC.2009.5433218     Document Type: Conference Paper
Times cited : (18)

References (25)
  • 1
    • 77952225154 scopus 로고    scopus 로고
    • MPI over InfiniBand Project
    • MPI over InfiniBand Project. In http://nowlab.cse.ohiostate.edu/projects/ mpi-iba/.
  • 5
    • 0032592492 scopus 로고    scopus 로고
    • Harness: A next generation distributed virtual machine
    • Micah Beck, Jack J. Dongarra, and etc. Graham E. Fagg. Harness: a next generation distributed virtual machine. Future Gener. Comput. Syst., 15(5-6):571-582, 1999.
    • (1999) Future Gener. Comput. Syst. , vol.15 , Issue.5-6 , pp. 571-582
    • Beck, M.1    Dongarra, J.J.2    Fagg, G.E.3
  • 15
    • 74049121711 scopus 로고    scopus 로고
    • Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters
    • 6
    • Paul H. Hargrove and Jason C. Duell. Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters. In SciDAC, 6 2006.
    • (2006) SciDAC
    • Hargrove, P.H.1    Duell, J.C.2
  • 19
    • 20444444457 scopus 로고    scopus 로고
    • The lam/mpi checkpoint/restart framework: System-initiated checkpointing
    • Oct.
    • S. Sankaran and J. M. Squyres and B. Barrett etc. The lam/mpi checkpoint/restart framework: System-initiated checkpointing. LACSI, Oct. 2003.
    • (2003) LACSI
    • Sankaran, S.1    Squyres, J.M.2    Barrett, B.3
  • 21
    • 34548768671 scopus 로고    scopus 로고
    • A job pause service under lam/mpi+blcr for transparent fault tolerance
    • Chao Wang, Frank Mueller, Christian Engelmann, and Stephen L. Scott. A job pause service under lam/mpi+blcr for transparent fault tolerance. In IPDPS, pages 1-10, 2007.
    • (2007) IPDPS , pp. 1-10
    • Wang, C.1    Mueller, F.2    Engelmann, C.3    Scott, S.L.4
  • 23
    • 85014969248 scopus 로고    scopus 로고
    • Architectural requirements and scalability of the nas parallel benchmarks
    • Frederick C. Wong and Richard P. Martin etc. Architectural requirements and scalability of the nas parallel benchmarks. In Supercomputing '99, page 41, 1999.
    • (1999) Supercomputing '99 , pp. 41
    • Wong, F.C.1    Martin, R.P.2
  • 24
    • 77951447133 scopus 로고    scopus 로고
    • Accelerating checkpoint operation by node-level write aggregation on multicore systems
    • To appear in September
    • Xiangyong Ouyang, Karthik Gopalakrishnan and Dhabaleswar K. Panda. Accelerating checkpoint operation by node-level write aggregation on multicore systems. To appear in ICPP 2009, September 2009.
    • (2009) ICPP 2009
    • Ouyang, X.1    Gopalakrishnan, K.2    Panda, D.K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.