-
1
-
-
77954003885
-
MPI/FT (TM): Architecture and taxonomies for fault-tolerant message- passing middleware for performance-portable parallel computing
-
Melbourne, Australia
-
Batchu, R., Neelamegam, J., Dui, Z., Beddhua, M., Skjellum, A., Dandass, Y., and Apte, M. 2001. MPI/FT (TM): Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing. In Proceedings of the 1st IEEE International Symposium of Cluster Computing and the Grid, Melbourne, Australia.
-
(2001)
Proceedings of the 1st IEEE International Symposium of Cluster Computing and the Grid
-
-
Batchu, R.1
Neelamegam, J.2
Dui, Z.3
Beddhua, M.4
Skjellum, A.5
Dandass, Y.6
Apte, M.7
-
2
-
-
84884662651
-
MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes
-
IEEE
-
Bosilca, G., Bouteiller, A., Cappello, F., Djilali, S., Fedak, G., Germain, C., Herault, T., Lemarinier, P., Lodygensky, O., Magniette, F., Neri, V., and Selikhov, A. 2002. MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes. In Proceedings of SC 2002, IEEE.
-
(2002)
Proceedings of SC 2002
-
-
Bosilca, G.1
Bouteiller, A.2
Cappello, F.3
Djilali, S.4
Fedak, G.5
Germain, C.6
Herault, T.7
Lemarinier, P.8
Lodygensky, O.9
Magniette, F.10
Neri, V.11
Selikhov, A.12
-
3
-
-
0035480353
-
Components and interfaces of a process management system for parallel programs
-
Butler, R., Gropp, W., and Lusk, E. 2001. Components and interfaces of a process management system for parallel programs. Parallel Computing, 27: 1417-1429.
-
(2001)
Parallel Computing
, vol.27
, pp. 1417-1429
-
-
Butler, R.1
Gropp, W.2
Lusk, E.3
-
4
-
-
84940567900
-
Fault-tolerant MPI: Supporting dynamic applications in a dynamic world
-
J. Dongarra, P. Kacsuk, and N. Podhorszki, editors, 7th European PVM/MPI Users' Group Meeting No 1908 in Springer Lecture Notes in Computer Science
-
Fagg, G. and Dongarra, J. 2000. Fault-tolerant MPI: Supporting dynamic applications in a dynamic world. In J. Dongarra, P. Kacsuk, and N. Podhorszki, editors, Recent Advances in Parallel Virutal Machine and Message Passing Interface, 7th European PVM/MPI Users' Group Meeting No 1908 in Springer Lecture Notes in Computer Science, pp. 346-353.
-
(2000)
Recent Advances in Parallel Virutal Machine and Message Passing Interface
, pp. 346-353
-
-
Fagg, G.1
Dongarra, J.2
-
6
-
-
0035480335
-
HARNESS and fault tolerant MPI
-
Fagg, G.E., Bukovsky, A., and Dongarra, J.J. 2001. HARNESS and fault tolerant MPI. Parallel Computing, 27(11): 1479-1495.
-
(2001)
Parallel Computing
, vol.27
, Issue.11
, pp. 1479-1495
-
-
Fagg, G.E.1
Bukovsky, A.2
Dongarra, J.J.3
-
7
-
-
12444258147
-
Development of naturally fault tolerant algorithms for computing on 100,000 processors
-
submitted
-
Geist, A. and Engelmann, C. 2004. Development of naturally fault tolerant algorithms for computing on 100,000 processors. Journal of Parallel and Distributed Computing, submitted.
-
(2004)
Journal of Parallel and Distributed Computing
-
-
Geist, A.1
Engelmann, C.2
-
9
-
-
0029507454
-
Dynamic process management in an MPI setting
-
October 25-28, San Antonio, TX, IEEE Computer Society Press
-
Gropp, W. and Lusk, E. 1995. Dynamic process management in an MPI setting. In Proceedings of the 7th IEEE Symposium on Parallel and Distributed Processing, October 25-28, San Antonio, TX, IEEE Computer Society Press, pp. 530-534.
-
(1995)
Proceedings of the 7th IEEE Symposium on Parallel and Distributed Processing
, pp. 530-534
-
-
Gropp, W.1
Lusk, E.2
-
10
-
-
0003417929
-
-
2nd edition. MIT Press, Cambridge, MA
-
Gropp, W., Lusk, E., and Skjellum, A. 1999. Using MPI: Portable Parallel Programming with the Message Passing Interface, 2nd edition. MIT Press, Cambridge, MA.
-
(1999)
Using MPI: Portable Parallel Programming With the Message Passing Interface
-
-
Gropp, W.1
Lusk, E.2
Skjellum, A.3
-
11
-
-
0026174913
-
An efficient checkpointing method for multicomputers with wormhole routing
-
Li, K., Naughton, J.F., and Plank, J.S. 1992. An efficient checkpointing method for multicomputers with wormhole routing. International Journal of Parallel Processing, 20(3): 150-180.
-
(1992)
International Journal of Parallel Processing
, vol.20
, Issue.3
, pp. 150-180
-
-
Li, K.1
Naughton, J.F.2
Plank, J.S.3
-
12
-
-
0028485392
-
Low-latency, concurrent checkpointing for parallel programs
-
Li, K., Naughton, J.F., and Plank, J.S. 1994. Low-latency, concurrent checkpointing for parallel programs. IEEE Transactions on Parallel and Distributed Systems, 5(8):874-879.
-
(1994)
IEEE Transactions on Parallel and Distributed Systems
, vol.5
, Issue.8
, pp. 874-879
-
-
Li, K.1
Naughton, J.F.2
Plank, J.S.3
-
13
-
-
0034439137
-
MPI-FT: Portable fault tolerance scheme for MPI
-
Louca, S., Neophytou, N., Lachanas, A., and Evrepidou, P. 2000. MPI-FT: Portable fault tolerance scheme for MPI. Parallel Processing Letters, 10(4): 371-382.
-
(2000)
Parallel Processing Letters
, vol.10
, Issue.4
, pp. 371-382
-
-
Louca, S.1
Neophytou, N.2
Lachanas, A.3
Evrepidou, P.4
-
14
-
-
0001439335
-
MPI: A message-passing interface standard
-
Message Passing Interface Forum
-
Message Passing Interface Forum. 1994. MPI: A Message-Passing Interface standard. International Journal of Supercomputer Applications, 8(3/4):165-414.
-
(1994)
International Journal of Supercomputer Applications
, vol.8
, Issue.3-4
, pp. 165-414
-
-
-
15
-
-
0003413675
-
The MPI message-passing interface standard
-
Message Passing Interface Forum. 1995. The MPI message-passing interface standard. http://www.mpi-forum.org.
-
(1995)
Message Passing Interface Forum
-
-
-
16
-
-
0032597696
-
Egida: An extensible tookit for low-overhead fault-tolerance
-
Rao, S., Alvisi, L., and Vin, H.M. 1999. Egida: an extensible tookit for low-overhead fault-tolerance. In Symposium on Fault-Tolerant Computing, pp. 48-55.
-
(1999)
Symposium on Fault-Tolerant Computing
, pp. 48-55
-
-
Rao, S.1
Alvisi, L.2
Vin, H.M.3
-
17
-
-
0003710740
-
-
2nd edition. MIT Press, Cambridge, MA
-
Snir, M., Otto, S.W., Huss-Lederman, S., Walker, D.W., and Dongarra, J. 1998. MPI - The Complete Reference: Volume 1, The MPI Core, 2nd edition. MIT Press, Cambridge, MA.
-
(1998)
MPI - The Complete Reference: The MPI Core
, vol.1
-
-
Snir, M.1
Otto, S.W.2
Huss-Lederman, S.3
Walker, D.W.4
Dongarra, J.5
-
18
-
-
0029713612
-
CoCheck: Checkpointing and process migration for MPI
-
The 10th International Parallel Processing Symposium, April 15-19, Honolulu, HI, IEEE Computer Society Press
-
Stellner, G. 1996. CoCheck: checkpointing and process migration for MPI. In Proceedings of IPPS '96. The 10th International Parallel Processing Symposium, April 15-19, Honolulu, HI, IEEE Computer Society Press, pp. 526-531.
-
(1996)
Proceedings of IPPS '96
, pp. 526-531
-
-
Stellner, G.1
|