-
1
-
-
0141710897
-
-
LAM-MPI
-
LAM-MPI. http://www.lam-mpi.org.
-
-
-
-
4
-
-
0347324590
-
Dome: Parallel programming in a heterogeneous multi-user environment
-
J.N.C. Arabe, A.B.B. Lowekamp, E. Seligman, M. Starkey, and P. Stephan. Dome: Parallel Programming in a Heterogeneous Multi-User Environment. Supercomputing, 1995.
-
(1995)
Supercomputing
-
-
Arabe, J.N.C.1
Lowekamp, A.B.B.2
Seligman, E.3
Starkey, M.4
Stephan, P.5
-
6
-
-
85084773602
-
An end-to-end approach to globally scalable network storage
-
M. Beck, T. Moore, and J. Plank. An End-to-End Approach to Globally Scalable Network Storage. In ACM SIGCOMM 2002 Conference, Pittsburgh, PA, USA, August 2002.
-
ACM SIGCOMM 2002 Conference, Pittsburgh, PA, USA, August 2002
-
-
Beck, M.1
Moore, T.2
Plank, J.3
-
7
-
-
0003661864
-
Application level fault tolerance in heterogeneous networks of workstations
-
Technical Report CMU-CS-96-157, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, August
-
A. Beguelin, E. Seligman, and P. Stephan. Application Level Fault Tolerance in Heterogeneous Networks of Workstations. Technical Report CMU-CS-96-157, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, August 1996.
-
(1996)
-
-
Beguelin, A.1
Seligman, E.2
Stephan, P.3
-
8
-
-
65549122370
-
Object-based adaptive load balancing for MPI programs
-
May
-
M. Bhandarkar, L. V. Kale, E. de Sturler, and J. Hoeflinger. Object-Based Adaptive Load Balancing for MPI Programs. In Proceedings of the International Conference on Computational Science, San Francisco, CA, LINCS 2074, pages 108-117, May 2001.
-
(2001)
Proceedings of the International Conference on Computational Science, San Francisco, CA, LINCS 2074
, pp. 108-117
-
-
Bhandarkar, M.1
Kale, L.V.2
De Sturler, E.3
Hoeflinger, J.4
-
9
-
-
10044246310
-
Reconfiguration and check-pointing in massively parallel systems
-
Springer-Verlag, October
-
B. Bieker, G. Deconinck, E. Maehle, and J. Vounckx. Reconfiguration and Check-pointing in Massively Parallel Systems. In Proceedings of 1st European Dependable Computing Conference (EDCC-1), volume Lecture Notes in Computer Science Vol. 852, pages 353-370. Springer-Verlag, October 1994.
-
(1994)
Proceedings of 1st European Dependable Computing Conference (EDCC-1), Volume Lecture Notes in Computer Science
, vol.852
, pp. 353-370
-
-
Bieker, B.1
Deconinck, G.2
Maehle, E.3
Vounckx, J.4
-
10
-
-
0042838995
-
MIST: PVM with transparent migration and checkpointing
-
J. Casas, D. Clark, P. Galbiati, R. Konuru, S. Otto, R. Prouty, and J. Walpole. MIST: PVM with Transparent Migration and Checkpointing, 1995.
-
(1995)
-
-
Casas, J.1
Clark, D.2
Galbiati, P.3
Konuru, R.4
Otto, S.5
Prouty, R.6
Walpole, J.7
-
11
-
-
0012941855
-
MPVM: A migration transparent version of PVM
-
Technical Report CSE-95-002
-
J. Casas, D. Clark, R. Konuru, S. Otto, R. Prouty, and J. Walpole. MPVM: A Migration Transparent Version of PVM. Technical Report CSE-95-002, 1, 1995.
-
(1995)
, vol.1
-
-
Casas, J.1
Clark, D.2
Konuru, R.3
Otto, S.4
Prouty, R.5
Walpole, J.6
-
14
-
-
0141822124
-
User-triggered checkpointing library for computation-intensive applications
-
Washington, DC, October
-
G. Deconinck, J. Vounckx, R. Lauwereins, and J.A. Peperstraete. User-triggered Checkpointing Library for Computation-intensive Applications. In Proceedings of 7th IASTED-ISMM International Conference On Parallel and Distributed Computing and Systems (IASTED, Anaheim-Calgary-Zurich) (ISCC97), pages 321-324, Washington, DC, October 1995.
-
(1995)
Proceedings of 7th IASTED-ISMM International Conference On Parallel and Distributed Computing and Systems (IASTED, Anaheim-Calgary-Zurich) (ISCC97)
, pp. 321-324
-
-
Deconinck, G.1
Vounckx, J.2
Lauwereins, R.3
Peperstraete, J.A.4
-
15
-
-
85026787682
-
Dynamic PVM: Dynamic load balancing on parallel systems
-
Wolfgang Gentzsch and Uwe Harms, editors; Munich, Germany, April; Springer Verlag
-
L. Dikken, F. van der Linden, J. J. J. Vesseur, and P. M. A. Sloot. Dynamic PVM: Dynamic Load Balancing on Parallel Systems. In Wolfgang Gentzsch and Uwe Harms, editors, Lecture notes in computer science 797, High Performance Computing and Networking, volume Proceedings Volume II, Networking and Tools, pages 273-277, Munich, Germany, April 1994. Springer Verlag.
-
(1994)
Lecture Notes in Computer Science 797, High Performance Computing and Networking, Volume Proceedings Volume II, Networking and Tools
, pp. 273-277
-
-
Dikken, L.1
Van Der Linden, F.2
Vesseur, J.J.J.3
Sloot, P.M.A.4
-
16
-
-
0004096191
-
A survey of rollback-recovery protocols in message passing systems
-
Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October
-
M. Elnozahy, L. Alvisi, Y.M. Wang, and D.B. Johnson. A Survey of Rollback-Recovery Protocols in Message Passing Systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October 1996.
-
(1996)
-
-
Elnozahy, M.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
17
-
-
0010976041
-
Process introspection: A heterogeneous checkpoint/restart mechanism based on automatic code modification
-
Technical Report Technical Report CS-97-05, Department of Computer Science, University of Virginia, March
-
A.J. Ferrari, S.J. Chapin, and A.S. Grimshaw. Process Introspection: A Heterogeneous Checkpoint/Restart Mechanism Based on Automatic Code Modification. Technical Report Technical Report CS-97-05, Department of Computer Science, University of Virginia, March 1997.
-
(1997)
-
-
Ferrari, A.J.1
Chapin, S.J.2
Grimshaw, A.S.3
-
18
-
-
0003982659
-
-
I. Foster and C. Kesselman eds.; Morgan Kaufmann, ISBN 1-55860-475-8
-
I. Foster and C. Kesselman eds. The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, ISBN 1-55860-475-8, 1999.
-
(1999)
The Grid: Blueprint for a New Computing Infrastructure
-
-
-
20
-
-
0031221351
-
CUMULVS: Providing fault-tolerance, visualization and steering of parallel applications
-
August
-
G. A. Geist, J. A. Kohl, and P. M. Papadopoulos. CUMULVS: Providing Fault-Tolerance, Visualization and Steering of Parallel Applications. International Journal of High Performance Computing Applications, 11(3):224-236, August 1997.
-
(1997)
International Journal of High Performance Computing Applications
, vol.11
, Issue.3
, pp. 224-236
-
-
Geist, G.A.1
Kohl, J.A.2
Papadopoulos, P.M.3
-
21
-
-
0141822131
-
DyRecT: Software support for adaptive parallelism on NOWs
-
E. Godard, S. Setia, and E. White. DyRecT: Software Support for Adaptive Parallelism on NOWs. In in IPDPS Workshop on Runtime Systems for Parallel Programming, Cancun, Mexico, May 2000.
-
In IPDPS Workshop on Runtime Systems for Parallel Programming, Cancun, Mexico, May 2000
-
-
Godard, E.1
Setia, S.2
White, E.3
-
25
-
-
0023090161
-
Checkpointing and rollback recovery for distributed systems
-
R. Koo and S. Toueg. Checkpointing and Rollback Recovery for Distributed Systems. IEEE Transactions on Software Engineering, 13(1):23-31, 1987.
-
(1987)
IEEE Transactions on Software Engineering
, vol.13
, Issue.1
, pp. 23-31
-
-
Koo, R.1
Toueg, S.2
-
27
-
-
84900340299
-
A checkpointing strategy for scalable recovery on distributed parallel systems
-
San Jose, November
-
V. K. Naik, S. P. Midkiff, and J. E. Moreira. A checkpointing strategy for scalable recovery on distributed parallel systems. In SuperComputing (SC) '97, San Jose, November 1997.
-
(1997)
SuperComputing (SC) '97
-
-
Naik, V.K.1
Midkiff, S.P.2
Moreira, J.E.3
-
28
-
-
27544493915
-
The internet backplane protocol: Storage in the network
-
J. S. Plank, M. Beck, W. R. Elwasif, T. Moore, M. Swany, and R. Wolski. The Internet Backplane Protocol: Storage in the Network. NetStore99: The Network Storage Symposium, 1999.
-
NetStore99: The Network Storage Symposium, 1999
-
-
Plank, J.S.1
Beck, M.2
Elwasif, W.R.3
Moore, T.4
Swany, M.5
Wolski, R.6
-
29
-
-
0003820750
-
An overview of checkpointing in uniprocessor and distributed systems, focusing on implementation and performance
-
Technical Report UT-CS-97-372
-
James S. Plank. An Overview of Checkpointing in Uniprocessor and Distributed Systems, Focusing on Implementation and Performance. Technical Report UT-CS-97-372, 1997.
-
(1997)
-
-
Plank, J.S.1
-
30
-
-
0141599174
-
Libckpt: Transparent checkpointing under unix
-
Technical Report UT-CS-94-242
-
James S. Plank, Micah Beck, Gerry Kingsley, and Kai Li. Libckpt: Transparent Checkpointing under Unix. Technical Report UT-CS-94-242, 1994.
-
(1994)
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Li, K.4
-
31
-
-
0141822128
-
An asynchronous checkpoint and rollback facility for distributed computations
-
P. Pruitt. An Asynchronous Checkpoint and Rollback Facility for Distributed Computations, 1998.
-
(1998)
-
-
Pruitt, P.1
-
33
-
-
0029706450
-
Hector: Automated task allocation for MPI
-
Honolulu, Hawaii, April
-
S. H. Russ, B. K. Flachs, J. Robinson, and B. Heckel. Hector: Automated Task Allocation for MPI. In Proceedings of IPPS '96, The 10th International Parallel Processing Symposium, pages 344-348, Honolulu, Hawaii, April 1996.
-
(1996)
Proceedings of IPPS '96, The 10th International Parallel Processing Symposium
, pp. 344-348
-
-
Russ, S.H.1
Flachs, B.K.2
Robinson, J.3
Heckel, B.4
-
35
-
-
4243899605
-
Portable checkpointing and recovery in heterogeneous environments
-
Technical Report Technical Report 96-6-1, Department of Electrical and Computer Engineering, University of Iowa, June
-
V. Strumpen and B. Ramkumar. Portable Checkpointing and Recovery in Heterogeneous Environments. Technical Report Technical Report 96-6-1, Department of Electrical and Computer Engineering, University of Iowa, June 1996.
-
(1996)
-
-
Strumpen, V.1
Ramkumar, B.2
-
37
-
-
0029251277
-
The condor distributed processing system
-
February
-
T. Tannenbaum and M. Litzkow. The condor distributed processing system. Dr. Dobb's Journal, pages 40-48, February 1995.
-
(1995)
Dr. Dobb's Journal
, pp. 40-48
-
-
Tannenbaum, T.1
Litzkow, M.2
|