메뉴 건너뛰기




Volumn 2, Issue , 2006, Pages 3353-3358

CCK: An improved coordinated checkpoint/rollback protocol for dataflow applications in KAAPI

Author keywords

Checkpoint recovery; Dataflow graph; Parallel application

Indexed keywords

DATA FLOW GRAPHS; FAULT TOLERANCE;

EID: 36849023369     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICTTA.2006.1684955     Document Type: Conference Paper
Times cited : (3)

References (19)
  • 1
    • 84866225421 scopus 로고    scopus 로고
    • A communication-induced checkpointing protocol that ensures rollback-dependency trackability
    • (FTCS '97), IEEE Computer Society
    • R. Baldoni. A communication-induced checkpointing protocol that ensures rollback-dependency trackability. In Proceedings of the 27th International Symposium on Fault-Tolerant Computing (FTCS '97), page 68. IEEE Computer Society, 1997.
    • (1997) Proceedings of the 27th International Symposium on Fault-Tolerant Computing , pp. 68
    • Baldoni, R.1
  • 4
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • E. N. Mootaz Elnozahy, L. Alvisi, Y.-M. Wang, and Johnson D. B. A survey of rollback-recovery protocols in message-passing systems. ACM Comput. Surv., 34(3):375-08, 2002.
    • (2002) ACM Comput. Surv , vol.34 , Issue.3 , pp. 375-408
    • Johnson, D.B.1    Elnozahy, E.N.M.2    Alvisi, L.3    Wang, Y.-M.4
  • 5
    • 0343950907 scopus 로고    scopus 로고
    • Athapascan-1: On-line building data flow graph in a parallel language
    • editor, Pact'98, Paris, France, October
    • G. Cavalheiro, M. Doreille, F. Galilee, J.-L. Roch. Athapascan-1: On-line building data flow graph in a parallel language. In IEEE, editor, Pact'98, pages 88-95, Paris, France, October 1998.
    • (1998) IEEE , pp. 88-95
    • Cavalheiro, G.1    Doreille, M.2    Galilee, F.3    Roch, J.-L.4
  • 6
    • 20444463494 scopus 로고    scopus 로고
    • Ftc-charm++: An in-memory checkpoint-based fault tolerant runtime for charm++ and mpi
    • San Dieago, CA, September
    • L. V. Kale, G. Zheng, L. Shi. Ftc-charm++: An in-memory checkpoint-based fault tolerant runtime for charm++ and mpi. In 2004 IEEE International Conference on Cluster Computing, San Dieago, CA, September 2004.
    • (2004) 2004 IEEE International Conference on Cluster Computing
    • Kale, L.V.1    Zheng, G.2    Shi, L.3
  • 7
    • 3042751916 scopus 로고    scopus 로고
    • Athapascan: Api for asynchronous parallel programming
    • Projet APACHE, INRIA, February
    • Revire J., L. Roch, T. Gautier. Athapascan: Api for asynchronous parallel programming. Technical Report RT-0276, www-id.imag.fr/software/athl, Projet APACHE, INRIA, February 2003.
    • (2003) Technical Report RT-0276
    • Revire, J.1    Roch, L.2    Gautier, T.3
  • 9
    • 27144432456 scopus 로고    scopus 로고
    • A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing
    • Lisboa, Portugal, August
    • S. Jafar, T. Gautier, A. Krings, and J-L. Roch. A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing. In Proceedings of (LNCS) Euro Par '05, Lisboa, Portugal, August 2005.
    • (2005) Proceedings of (LNCS) Euro Par '05
    • Jafar, S.1    Gautier, T.2    Krings, A.3    Roch, J.-L.4
  • 11
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • L. Lamport K. M. Chandy. Distributed snapshots: determining global states of distributed systems. ACM Trans. Comput. Syst., 3(1):63-75, 1985.
    • (1985) ACM Trans. Comput. Syst , vol.3 , Issue.1 , pp. 63-75
    • Chandy, L.L.K.M.1
  • 15
    • 0040769741 scopus 로고    scopus 로고
    • Experimental analysis of the dual recursive bipartitioning algorithm for static mapping
    • F. Pellegrini and J. Roman. Experimental analysis of the dual recursive bipartitioning algorithm for static mapping. Technical Report 1038-96, 1996.
    • (1996) Technical Report 1038-96
    • Pellegrini, F.1    Roman, J.2
  • 17
    • 35248844154 scopus 로고    scopus 로고
    • Efficient and easy parallel implementation of large numerical simulation
    • Springer, editor, Venice, Italy
    • R. Revire, F. Zara, and T. Gautier. Efficient and easy parallel implementation of large numerical simulation. In Springer, editor, Proceedings of ParSim03 of Eu- roPVM/MP103, pages 663-666, Venice, Italy, 2003.
    • (2003) Proceedings of ParSim03 of Eu- RoPVM/MP103 , pp. 663-666
    • Revire, R.1    Zara, F.2    Gautier, T.3
  • 18
    • 0012243052 scopus 로고    scopus 로고
    • Compiler technology for portable checkpoints
    • MIT Laboratory for Computer Science, Cambridge
    • V. Strumpen. Compiler technology for portable checkpoints. Technical Report MA-02139, MIT Laboratory for Computer Science, Cambridge, 1998.
    • (1998) Technical Report MA-02139
    • Strumpen, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.