-
2
-
-
0029237761
-
Message logging: Pessimistic, optimistic, and causal
-
IEEE CS Press, May-June
-
L. Alvisi and K. Marzullo. Message logging : Pessimistic, optimistic, and causal. In Proceedings of the 15th International Conference on Distributed Computing Systems (ICDCS 1995), pages 229-236. IEEE CS Press, May-June 1995.
-
(1995)
Proceedings of the 15th International Conference on Distributed Computing Systems (ICDCS 1995)
, pp. 229-236
-
-
Alvisi, L.1
Marzullo, K.2
-
3
-
-
0003605996
-
-
Report NAS-95-020, Numerical Aerodynamic Simulation Facility, NASA Ames Research Center
-
David Bailey, Tim Harris, William Saphir, Rob Van Der Wijngaart, Alex Woo, and Maurice Yarrow. The NAS Parallel Benchmarks 2.0. Report NAS-95-020, Numerical Aerodynamic Simulation Facility, NASA Ames Research Center, 1995.
-
(1995)
The NAS Parallel Benchmarks 2.0
-
-
Bailey, D.1
Harris, T.2
Saphir, W.3
Van Der Wijngaart, R.4
Woo, A.5
Yarrow, M.6
-
4
-
-
77954003885
-
Mpi/ft™: Architecture and taxonomies for fault-tolerant, messagepassing middleware for performance-portable parallel computing
-
IEEE/ACM
-
R. Batchu, J. Neelamegam, Z. Cui, M. Beddhua, A. Skjellum, Y. Dandass, and M. Apte. Mpi/ft™ : Architecture and taxonomies for fault-tolerant, messagepassing middleware for performance-portable parallel computing. In In Proceedings of the 1st International Symposium of Cluster Computing and the Grid (CCGRID2001, Melbourne, Australia, May 2001. IEEE/ACM.
-
Proceedings of the 1st International Symposium of Cluster Computing and the Grid CCGRID2001, Melbourne, Australia, May 2001
-
-
Batchu, R.1
Neelamegam, J.2
Cui, Z.3
Beddhua, M.4
Skjellum, A.5
Dandass, Y.6
Apte, M.7
-
5
-
-
84884662651
-
Mpich-v: Toward a scalable fault tolerant mpi for volatile nodes
-
IEEE/ACM
-
George Bosilca, Aurélien Bouteiller, Franck Cappello, Samir Djilali, Gilles Fédak, Cécile Germain, Thomas Hérault, Pierre Lemarinier, Oleg Lodygensky, Frédéric Magniette, Vincent Néri, and Anton Selikhov. Mpich-v: Toward a scalable fault tolerant mpi for volatile nodes. In SC2002: High Performance Networking and Computing (SC2002), Baltimore USA, Novembre 2002. IEEE/ACM.
-
SC2002: High Performance Networking and Computing (SC2002), Baltimore USA, Novembre 2002
-
-
Bosilca, G.1
Bouteiller, A.2
Cappello, F.3
Djilali, S.4
Fédak, G.5
Germain, C.6
Hérault, T.7
Lemarinier, P.8
Lodygensky, O.9
Magniette, F.10
Néri, V.11
Selikhov, A.12
-
6
-
-
0022020346
-
Distributed snapshots: Determining global states of distributed systems
-
ACM, February
-
K. M. Chandy and L.Lamport. Distributed snapshots : Determining global states of distributed systems. In Transactions on Computer Systems, volume 3(1), pages 63-75. ACM, February 1985.
-
(1985)
Transactions on Computer Systems
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy, K.M.1
Lamport, L.2
-
8
-
-
0026867749
-
Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output
-
May
-
Elnozahy, Elmootazbellah, and Zwaenepoel. Manetho: Transparent rollback-recovery with low overhead, limited rollback and fast output. IEEE Transactions on Computers, 41(5), May 1992.
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.5
-
-
Elnozahy1
Elmootazbellah2
Zwaenepoel3
-
10
-
-
0004096191
-
-
Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October
-
M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October 1996.
-
(1996)
A Survey of Rollback-recovery Protocols in Message Passing Systems
-
-
Elnozahy, M.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
12
-
-
84940567900
-
FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world
-
Balatonfred, Hungary, september Springer-Verlag Heidelberg
-
G. Fagg and J. Dongarra. FT-MPI : Fault tolerant MPI, supporting dynamic applications in a dynamic world. In 7th Euro PVM/MPI User's Group Meeting2000, volume 1908 / 2000, Balatonfred, Hungary, september 2000. Springer-Verlag Heidelberg.
-
(2000)
7th Euro PVM/MPI User's Group Meeting2000
, vol.1908
-
-
Fagg, G.1
Dongarra, J.2
-
13
-
-
0035480335
-
HARNESS and fault tolerant MPI
-
DOI 10.1016/S0167-8191(01)00100-4, PII S0167819101001004
-
G. E. Fagg, A. Bukovsky, and J. J. Dongarra. Harness and fault tolerant mpi. Parallel Computing, 27(11):1479-1495, October 2001. (Pubitemid 32723675)
-
(2001)
Parallel Computing
, vol.27
, Issue.11
, pp. 1479-1495
-
-
Fagg, G.E.1
Bukovsky, A.2
Dongarra, J.J.3
-
15
-
-
0030243005
-
High-performance, portable implementation of the mpi message passing interface standard
-
September
-
William Gropp, Ewing Lusk, Nathan Doss, and Anthony Skjellum. High-performance, portable implementation of the mpi message passing interface standard. Parallel Computing, 22(6):789-828, September 1996.
-
(1996)
Parallel Computing
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
16
-
-
16144366475
-
The network architecture of the Connection Machine CM-5
-
Charles E. Leiserson, Zahi S. Abuhamdeh, David C. Douglas, Carl R. Feynman, Mahesh N. Ganmukhi, Jeffrey V. Hill, W. Daniel Hillis, Bradley C. Kuszmaul, Margaret A. St Pierre, David S. Wells, Monica C. Wong-Chan, Shaw-Wen Yang, and Robert Zak. The network architecture of the Connection Machine CM-5. Journal of Parallel and Distributed Computing, 33(2):145-158, 1996.
-
(1996)
Journal of Parallel and Distributed Computing
, vol.33
, Issue.2
, pp. 145-158
-
-
Leiserson, C.E.1
Abuhamdeh, Z.S.2
Douglas, D.C.3
Feynman, C.R.4
Ganmukhi, M.N.5
Hill, J.V.6
Hillis, W.D.7
Kuszmaul, B.C.8
St Pierre, M.A.9
Wells, D.S.10
Wong-Chan, M.C.11
Yang, S.-W.12
Zak, R.13
-
17
-
-
0003912256
-
-
Technical Report Technical Report 1346, University of Wisconsin-Madison
-
M. Litzkow, T. Tannenbaum, J. Basney, and M. Livny. Checkpoint and migration of unix processes in the condor distributed processing system. Technical Report Technical Report 1346, University of Wisconsin-Madison, 1997.
-
(1997)
Checkpoint and Migration of Unix Processes in the Condor Distributed Processing System
-
-
Litzkow, M.1
Tannenbaum, T.2
Basney, J.3
Livny, M.4
-
18
-
-
0034439137
-
Mpi-ft: Portable fault tolerance scheme for mpi
-
World Scientific Publishing Company
-
S. Louca, N. Neophytou, A. Lachanas, and P. Evripidou. Mpi-ft: Portable fault tolerance scheme for mpi. In Parallel Processing Letters(PPL), volume 10(4). World Scientific Publishing Company, 2000.
-
(2000)
Parallel Processing Letters(PPL)
, vol.10
, Issue.4
-
-
Louca, S.1
Neophytou, N.2
Lachanas, A.3
Evripidou, P.4
-
19
-
-
0022045868
-
Impossibility of distributed consensus with one faulty process
-
April
-
M. Paterson M. Fischer, N. Lynch. Impossibility of distributed consensus with one faulty process. Journal of the ACM, 32:374-382, April 1985.
-
(1985)
Journal of the ACM
, vol.32
, pp. 374-382
-
-
Paterson, M.1
Fischer, M.2
Lynch, N.3
-
21
-
-
0003710740
-
-
The MIT Press
-
M. Snir, S. Otto, S. Huss-Lederman, D. Walker, and J. Dongarra. MPI: The Complete Reference. The MIT Press, 1996. http://www.netlib.org/utk/papers/mpi- book/mpi-book.html.
-
(1996)
MPI: The Complete Reference
-
-
Snir, M.1
Otto, S.2
Huss-Lederman, S.3
Walker, D.4
Dongarra, J.5
-
23
-
-
0022112420
-
Optimistic recovery in distributed systems
-
ACM, August
-
E. Strom and S. Yemini. Optimistic recovery in distributed systems. In Transactions on Computer Systems, volume 3(3), pages 204-226. ACM, August 1985.
-
(1985)
Transactions on Computer Systems
, vol.3
, Issue.3
, pp. 204-226
-
-
Strom, E.1
Yemini, S.2
|