메뉴 건너뛰기




Volumn 13, Issue 2, 2003, Pages 291-312

SRS: A framework for developing malleable and migratable parallel applications for distributed systems

Author keywords

Checkpointing; Distributed; Malleable; Migrati on; MPI; Parallel; Reconfiguration

Indexed keywords

COMPUTER SOFTWARE; DIGITAL LIBRARIES; RESOURCE ALLOCATION; SCHEDULING;

EID: 0141682129     PISSN: 01296264     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0129626403001288     Document Type: Article
Times cited : (72)

References (39)
  • 1
    • 0141710897 scopus 로고    scopus 로고
    • LAM-MPI
    • LAM-MPI. http://www.lam-mpi.org.
  • 7
    • 0003661864 scopus 로고    scopus 로고
    • Application level fault tolerance in heterogeneous networks of workstations
    • Technical Report CMU-CS-96-157, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, August
    • A. Beguelin, E. Seligman, and P. Stephan. Application Level Fault Tolerance in Heterogeneous Networks of Workstations. Technical Report CMU-CS-96-157, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, August 1996.
    • (1996)
    • Beguelin, A.1    Seligman, E.2    Stephan, P.3
  • 11
    • 0012941855 scopus 로고
    • MPVM: A migration transparent version of PVM
    • Technical Report CSE-95-002
    • J. Casas, D. Clark, R. Konuru, S. Otto, R. Prouty, and J. Walpole. MPVM: A Migration Transparent Version of PVM. Technical Report CSE-95-002, 1, 1995.
    • (1995) , vol.1
    • Casas, J.1    Clark, D.2    Konuru, R.3    Otto, S.4    Prouty, R.5    Walpole, J.6
  • 16
    • 0004096191 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message passing systems
    • Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October
    • M. Elnozahy, L. Alvisi, Y.M. Wang, and D.B. Johnson. A Survey of Rollback-Recovery Protocols in Message Passing Systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October 1996.
    • (1996)
    • Elnozahy, M.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 17
    • 0010976041 scopus 로고    scopus 로고
    • Process introspection: A heterogeneous checkpoint/restart mechanism based on automatic code modification
    • Technical Report Technical Report CS-97-05, Department of Computer Science, University of Virginia, March
    • A.J. Ferrari, S.J. Chapin, and A.S. Grimshaw. Process Introspection: A Heterogeneous Checkpoint/Restart Mechanism Based on Automatic Code Modification. Technical Report Technical Report CS-97-05, Department of Computer Science, University of Virginia, March 1997.
    • (1997)
    • Ferrari, A.J.1    Chapin, S.J.2    Grimshaw, A.S.3
  • 18
    • 0003982659 scopus 로고    scopus 로고
    • I. Foster and C. Kesselman eds.; Morgan Kaufmann, ISBN 1-55860-475-8
    • I. Foster and C. Kesselman eds. The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, ISBN 1-55860-475-8, 1999.
    • (1999) The Grid: Blueprint for a New Computing Infrastructure
  • 25
    • 0023090161 scopus 로고
    • Checkpointing and rollback recovery for distributed systems
    • R. Koo and S. Toueg. Checkpointing and Rollback Recovery for Distributed Systems. IEEE Transactions on Software Engineering, 13(1):23-31, 1987.
    • (1987) IEEE Transactions on Software Engineering , vol.13 , Issue.1 , pp. 23-31
    • Koo, R.1    Toueg, S.2
  • 27
    • 84900340299 scopus 로고    scopus 로고
    • A checkpointing strategy for scalable recovery on distributed parallel systems
    • San Jose, November
    • V. K. Naik, S. P. Midkiff, and J. E. Moreira. A checkpointing strategy for scalable recovery on distributed parallel systems. In SuperComputing (SC) '97, San Jose, November 1997.
    • (1997) SuperComputing (SC) '97
    • Naik, V.K.1    Midkiff, S.P.2    Moreira, J.E.3
  • 29
    • 0003820750 scopus 로고    scopus 로고
    • An overview of checkpointing in uniprocessor and distributed systems, focusing on implementation and performance
    • Technical Report UT-CS-97-372
    • James S. Plank. An Overview of Checkpointing in Uniprocessor and Distributed Systems, Focusing on Implementation and Performance. Technical Report UT-CS-97-372, 1997.
    • (1997)
    • Plank, J.S.1
  • 30
    • 0141599174 scopus 로고
    • Libckpt: Transparent checkpointing under unix
    • Technical Report UT-CS-94-242
    • James S. Plank, Micah Beck, Gerry Kingsley, and Kai Li. Libckpt: Transparent Checkpointing under Unix. Technical Report UT-CS-94-242, 1994.
    • (1994)
    • Plank, J.S.1    Beck, M.2    Kingsley, G.3    Li, K.4
  • 31
    • 0141822128 scopus 로고    scopus 로고
    • An asynchronous checkpoint and rollback facility for distributed computations
    • P. Pruitt. An Asynchronous Checkpoint and Rollback Facility for Distributed Computations, 1998.
    • (1998)
    • Pruitt, P.1
  • 35
    • 4243899605 scopus 로고    scopus 로고
    • Portable checkpointing and recovery in heterogeneous environments
    • Technical Report Technical Report 96-6-1, Department of Electrical and Computer Engineering, University of Iowa, June
    • V. Strumpen and B. Ramkumar. Portable Checkpointing and Recovery in Heterogeneous Environments. Technical Report Technical Report 96-6-1, Department of Electrical and Computer Engineering, University of Iowa, June 1996.
    • (1996)
    • Strumpen, V.1    Ramkumar, B.2
  • 37
    • 0029251277 scopus 로고
    • The condor distributed processing system
    • February
    • T. Tannenbaum and M. Litzkow. The condor distributed processing system. Dr. Dobb's Journal, pages 40-48, February 1995.
    • (1995) Dr. Dobb's Journal , pp. 40-48
    • Tannenbaum, T.1    Litzkow, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.