-
1
-
-
0031570635
-
Application level fault tolerance in heterogeneous networks of workstations
-
Adam Beguelin, Erik Seligman, and Peter Stephan. Application Level Fault Tolerance in Heterogeneous Networks of Workstations. Journal of Parallel and Distributed Computing, 43(2): 147-155, 1997.
-
(1997)
Journal of Parallel and Distributed Computing
, vol.43
, Issue.2
, pp. 147-155
-
-
Beguelin, A.1
Seligman, E.2
Stephan, P.3
-
3
-
-
1142268808
-
Collective operations in an application-level fault tolerant MPI system
-
San Francisco, CA, June 23-26
-
Greg Bronevetsky, Daniel Marques, Keshav Pingali, and Paul Stodghill. Collective Operations in an Application-level Fault Tolerant MPI System. In International Conference on Supercomputing (ICS) 2003, San Francisco, CA, June 23-26 2003.
-
(2003)
International Conference on Supercomputing (ICS) 2003
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Stodghill, P.4
-
4
-
-
20744442381
-
Heterogeneous process state capture and recovery through process introspection
-
Adam Ferrari, Steve J. Chapin, and Andrew S. Grimshaw. Heterogeneous Process State Capture and Recovery through Process Introspection. In Cluster Computing, volume 3, 2000.
-
(2000)
Cluster Computing
, vol.3
-
-
Ferrari, A.1
Chapin, S.J.2
Grimshaw, A.S.3
-
5
-
-
0345584934
-
The cactus framework and toolkit: Design and applications
-
Tom Goodale, Gabrielle Allen, Gerd Lanfermann, Joan Mass, Thomas Radke, Edward Seidel, and John Shalf. The Cactus Framework and Toolkit: Design and Applications. In VECPAR, 2002.
-
(2002)
VECPAR
-
-
Goodale, T.1
Allen, G.2
Lanfermann, G.3
Mass, J.4
Radke, T.5
Seidel, E.6
Shalf, J.7
-
8
-
-
0037173433
-
Data collection and restoration for heterogeneous process migration
-
Kasidit Chanchio and Xian-He Sun. Data Collection and Restoration for Heterogeneous Process Migration. In Softw., Pract. Exper.
-
Softw., Pract. Exper.
-
-
Chanchio, K.1
Sun, X.-H.2
-
9
-
-
12344324502
-
Process/thread migration and checkpointing in heterogeneous distributed systems
-
Hai Jiang and Vipin Chaudhary. Process/Thread Migration and Checkpointing in Heterogeneous Distributed Systems. In HICSS 2004.
-
HICSS 2004
-
-
Hai, J.1
Chaudhary, V.2
-
10
-
-
0004096191
-
-
Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October
-
M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson. A Survey of Rollback-recovery Protocols in Message Passing Systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October 1996.
-
(1996)
A Survey of Rollback-recovery Protocols in Message Passing Systems
-
-
Elnozahy, M.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
13
-
-
0003265656
-
The internet backplane protocol: Storage in the network
-
Seattle, WA
-
James S. Plank, Micah Beck, Wael R. Elwasif, Terry Moore, Martin Swany, and Rich Wolski. The Internet Backplane Protocol: Storage in the Network. In NetStore99: The Network Storage Symposium, Seattle, WA, 1999.
-
(1999)
NetStore99: The Network Storage Symposium
-
-
Plank, J.S.1
Beck, M.2
Elwasif, W.R.3
Moore, T.4
Swany, M.5
Wolski, R.6
-
15
-
-
84934312471
-
Implementation and evaluation of a scalable application-level checkpoint-recovery scheme for MPI programs
-
Pittsburgh, PA, November 6-12
-
Martin Schulz, Greg Bronevetsky, Rohit Fernandes, Daniel Marques, Keshav Pingali, and Paul Stodghill. Implementation and Evaluation of a Scalable Application-level Checkpoint-recovery Scheme for MPI Programs. In Supercomputing 2004, Pittsburgh, PA, November 6-12 2004.
-
(2004)
Supercomputing 2004
-
-
Schulz, M.1
Bronevetsky, G.2
Fernandes, R.3
Marques, D.4
Pingali, K.5
Stodghill, P.6
-
16
-
-
0032071579
-
Heterogeneous process migration: The Tui system
-
Peter Smith and Norman C. Hutchinson. Heterogeneous Process Migration: The Tui System. Softw., Pract. Exper., 28, 1998.
-
(1998)
Softw., Pract. Exper.
, pp. 28
-
-
Smith, P.1
Hutchinson, N.C.2
-
19
-
-
0141682129
-
SRS - A framework for developing malleable and migratable parallel software
-
June
-
S. Vadhiyar and J. Dongarra. SRS - A Framework for Developing Malleable and Migratable Parallel Software. Parallel Processing Letters, 13(2):291-312, June 2003.
-
(2003)
Parallel Processing Letters
, vol.13
, Issue.2
, pp. 291-312
-
-
Vadhiyar, S.1
Dongarra, J.2
|