-
1
-
-
78149270835
-
-
Technical report SAND2009-6753, Sandia National Laboratories
-
Ferreira, K., Riesen, R., Oldfield, R., Stearley, J., Laros, J., Pedretti, K., Brightwell, R., Kordenbrock, T.: Increasing fault resiliency in a message-passing environment. Technical report SAND2009-6753, Sandia National Laboratories (2009)
-
(2009)
Increasing Fault Resiliency in a Message-passing Environment
-
-
Ferreira, K.1
Riesen, R.2
Oldfield, R.3
Stearley, J.4
Laros, J.5
Pedretti, K.6
Brightwell, R.7
Kordenbrock, T.8
-
2
-
-
77956584397
-
See applications run and throughput jump: The case for redundant computing in HPC
-
Riesen, R., Ferreira, K., Stearley, J.: See applications run and throughput jump: The case for redundant computing in HPC. In: 1st International Workshop on Fault-Tolerance for HPC at Extreme Scale, FTXS 2010 (2010)
-
(2010)
1st International Workshop on Fault-Tolerance for HPC at Extreme Scale, FTXS 2010
-
-
Riesen, R.1
Ferreira, K.2
Stearley, J.3
-
3
-
-
78149275568
-
-
Network-Based Computing Laboratory, Ohio State University: OMB
-
Network-Based Computing Laboratory, Ohio State University: OSU MPI benchmarks, OMB (2010), http://mvapich.cse.ohio-state.edu/benchmarks/
-
(2010)
OSU MPI Benchmarks
-
-
-
6
-
-
70350567175
-
Symmetric active/active metadata service for high availability parallel file systems
-
He, X., Ou, L., Engelmann, C., Chen, X., Scott, S.L.: Symmetric active/active metadata service for high availability parallel file systems. Journal of Parallel and Distributed Computing (JPDC) 69(12), 961-973 (2009)
-
(2009)
Journal of Parallel and Distributed Computing (JPDC)
, vol.69
, Issue.12
, pp. 961-973
-
-
He, X.1
Ou, L.2
Engelmann, C.3
Chen, X.4
Scott, S.L.5
-
9
-
-
60449096682
-
MPICH-V2: A fault tolerant MPI for volatile nodes based on pessimistic sender based message logging
-
Bouteiller, A., Cappello, F., Herault, T., Krawezik, G., Lemarinier, P., Magniette, F.: MPICH-V2: a fault tolerant MPI for volatile nodes based on pessimistic sender based message logging. In: Proceedings of the ACM/IEEE International Conference on High Performance Computing and Networking (2003)
-
Proceedings of the ACM/IEEE International Conference on High Performance Computing and Networking (2003)
-
-
Bouteiller, A.1
Cappello, F.2
Herault, T.3
Krawezik, G.4
Lemarinier, P.5
Magniette, F.6
-
10
-
-
34548789748
-
The design and implementation of checkpoint/restart process fault tolerance for Open MPI
-
Hursey, J., Squyres, J., Mattox, T., Lumsdaine, A.: The design and implementation of checkpoint/restart process fault tolerance for Open MPI. In: Proceedings of the IEEE International Parallel and Distributed Processing Symposium (2007)
-
Proceedings of the IEEE International Parallel and Distributed Processing Symposium (2007)
-
-
Hursey, J.1
Squyres, J.2
Mattox, T.3
Lumsdaine, A.4
-
11
-
-
51849121653
-
Providing non-stop service for message-passing based parallel applications with RADIC
-
Luque, E., Margalef, T., Benítez, D. (eds.) Euro-Par 2008. Springer, Heidelberg
-
Santos, G., Duarte, A., Rexachs, D., Luque, E.: Providing non-stop service for message-passing based parallel applications with RADIC. In: Luque, E., Margalef, T., Benítez, D. (eds.) Euro-Par 2008. LNCS, vol. 5168, pp. 58-67. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.5168
, pp. 58-67
-
-
Santos, G.1
Duarte, A.2
Rexachs, D.3
Luque, E.4
-
12
-
-
34250362376
-
P2P-MPI: A peer-to-peer framework for robust execution of message passing parallel programs on grids
-
DOI 10.1007/s10723-006-9056-2
-
Genaud, S., Rattanapoka, C.: P2P-MPI: A peer-to-peer framework for robust execution of message passing parallel programs on grids. J. Grid Comput. 5(1), 27-42 (2007) (Pubitemid 46909011)
-
(2007)
Journal of Grid Computing
, vol.5
, Issue.1
, pp. 27-42
-
-
Genaud, S.1
Rattanapoka, C.2
-
13
-
-
69049089649
-
Fault-management in P2P-MPI
-
Genaud, S., Jeannot, E., Rattanapoka, C.: Fault-management in P2P-MPI. Int. J. Parallel Program. 37(5), 433-461 (2009)
-
(2009)
Int. J. Parallel Program
, vol.37
, Issue.5
, pp. 433-461
-
-
Genaud, S.1
Jeannot, E.2
Rattanapoka, C.3
-
14
-
-
34547401052
-
Scaling MPI to short-memory MPPs such as BG/L
-
Farreras, M., Cortes, T., Labarta, J., Almasi, G.: Scaling MPI to short-memory MPPs such as BG/L. In: Proceeding of the International Conference on Supercomputing, pp. 209-218 (2006)
-
(2006)
Proceeding of the International Conference on Supercomputing
, pp. 209-218
-
-
Farreras, M.1
Cortes, T.2
Labarta, J.3
Almasi, G.4
|