-
1
-
-
12444268370
-
Architecture of LA-MPI, a network-fault-tolerant MPI
-
R. T. Aulwes, D. J. Daniel, N. N. Desai, R. L. Graham, L. D. Risinger, M. A. Taylor, and T. S. Woodall. Architecture of LA-MPI, a network-fault-tolerant MPI. In IPDPS, 2004.
-
(2004)
IPDPS
-
-
Aulwes, R.T.1
Daniel, D.J.2
Desai, N.N.3
Graham, R.L.4
Risinger, L.D.5
Taylor, M.A.6
Woodall, T.S.7
-
2
-
-
84973836157
-
The NAS Parallel Benchmarks
-
Fall
-
D. H. Bailey, E. Barszcz, J. T. Barton, D. S. Browning, R. L. Carter, D. Dagum, R. A. Fatoohi, P. O. Frederickson, T. A. Lasinski, R. S. Schreiber, H. D. Simon, V. Venkatakrishnan, and S. K. Weeratunga. The NAS Parallel Benchmarks. The International Journal of Supercomputer Applications, 5(3):63-73, Fall 1991.
-
(1991)
The International Journal of Supercomputer Applications
, vol.5
, Issue.3
, pp. 63-73
-
-
Bailey, D.H.1
Barszcz, E.2
Barton, J.T.3
Browning, D.S.4
Carter, R.L.5
Dagum, D.6
Fatoohi, R.A.7
Frederickson, P.O.8
Lasinski, T.A.9
Schreiber, R.S.10
Simon, H.D.11
Venkatakrishnan, V.12
Weeratunga, S.K.13
-
3
-
-
34548717478
-
MPICH-V: A multiprotocol fault tolerant mpi
-
A. Bouteiller, T. Herault, G. Krawezik, P. Lemarinier, and F. Cappello. MPICH-V: a multiprotocol fault tolerant mpi. In International Journal of High Performance Computing and Applications., 2005.
-
(2005)
International Journal of High Performance Computing and Applications
-
-
Bouteiller, A.1
Herault, T.2
Krawezik, G.3
Lemarinier, P.4
Cappello, F.5
-
6
-
-
0022020346
-
-
K. M. Chandy and L.Lamport. Distributed snapshots : Determining global states of distributed systems,. In Transactions on Computer Systems, 3(1). ACM, pages 63-75, February 1985.
-
K. M. Chandy and L.Lamport. Distributed snapshots : Determining global states of distributed systems,. In Transactions on Computer Systems, vol. 3(1). ACM, pages 63-75, February 1985.
-
-
-
-
7
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
E. Elnozahy, D. Johnson, and Y. Wang. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys, 34(3):375-408, 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.1
Johnson, D.2
Wang, Y.3
-
8
-
-
84940567900
-
-
G. Fagg and a Dongarra. FT-MPI: Faulttolerant mpi,supporting dynamic applications in a dynamic world. In Euro PVM/MPI User's Group Meeting 2000 ,Springer-Verilag, Berlin, Germany, pages 346-353, 2000.
-
G. Fagg and a Dongarra. FT-MPI: Faulttolerant mpi,supporting dynamic applications in a dynamic world. In Euro PVM/MPI User's Group Meeting 2000 ,Springer-Verilag, Berlin, Germany, pages 346-353, 2000.
-
-
-
-
9
-
-
35048884271
-
Open MPI: Goals, concept, and design of a next generation MPI implementation
-
Budapest, Hungary, September
-
E. Gabriel, G. E. Fagg, G. Bosilca, T. Angskun, J. J. Dongarra, J. M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R. H. Castain, D. J. Daniel, R. L. Graham, and T. S. Woodall. Open MPI: Goals, concept, and design of a next generation MPI implementation. In Proceedings, 11th European PVM/MPI Users ' Group Meeting, pages 97-104, Budapest, Hungary, September 2004.
-
(2004)
Proceedings, 11th European PVM/MPI Users ' Group Meeting
, pp. 97-104
-
-
Gabriel, E.1
Fagg, G.E.2
Bosilca, G.3
Angskun, T.4
Dongarra, J.J.5
Squyres, J.M.6
Sahay, V.7
Kambadur, P.8
Barrett, B.9
Lumsdaine, A.10
Castain, R.H.11
Daniel, D.J.12
Graham, R.L.13
Woodall, T.S.14
-
10
-
-
0030243005
-
High-performance, portable implementation of the MPI Message Passing Interface Standard
-
W. Gropp, E. Lusk, N. Doss, and A. Skjellum. High-performance, portable implementation of the MPI Message Passing Interface Standard. Parallel Computing, 22(6):789-828, 1996.
-
(1996)
Parallel Computing
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
11
-
-
33646146876
-
MPI development tools and applications for the grid,
-
June
-
R. Keller, B. Krammer, M. S. Mueller, M. M. Resch, and E. Gabriel. MPI development tools and applications for the grid,. In Workshop on Grid Applications and Programming Tools, held in conjunction with the GGF8 meetings, Seattle, WA, USA, June 2003.
-
(2003)
Workshop on Grid Applications and Programming Tools, held in conjunction with the GGF8 meetings, Seattle, WA, USA
-
-
Keller, R.1
Krammer, B.2
Mueller, M.S.3
Resch, M.M.4
Gabriel, E.5
-
12
-
-
0034771266
-
Topology discovery for large ethernet networks
-
B. Lowekamp, D. O'Hallaron, and T. Gross. Topology discovery for large ethernet networks. In Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications(SIGCOMM '01), pages 237-248, 2001.
-
(2001)
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications(SIGCOMM '01)
, pp. 237-248
-
-
Lowekamp, B.1
O'Hallaron, D.2
Gross, T.3
-
13
-
-
3342966061
-
The ganglia distributed monitoring system: Design, implementation and experience
-
July
-
M. L. Massie, B. N. Chun, and D. E. Culler, The ganglia distributed monitoring system: Design, implementation and experience. In Parallel Computing, 30, July 2004.
-
(2004)
Parallel Computing
, vol.30
-
-
Massie, M.L.1
Chun, B.N.2
Culler, D.E.3
-
14
-
-
33847764225
-
Model-based check-point scheduling for volatile resource environments
-
Technical Report 2004-25, University of California Santa Barbara, Department of Computer Science, Santa Barbara, CA, 93106
-
D. Nurmi, R. Wolski, and J. Brevik. Model-based check-point scheduling for volatile resource environments. Technical Report 2004-25, University of California Santa Barbara, Department of Computer Science, Santa Barbara, CA, 93106, 2004.
-
(2004)
-
-
Nurmi, D.1
Wolski, R.2
Brevik, J.3
-
15
-
-
0032597696
-
-
S. Rao, L. Alvisi, and H. M. Vin. Egida: An extensible toolkit for low overhead fault tolerance. In Proceedings of the 29th Fault-tolerant Computing Symposium (FTCS-29), Madison, Wisconsin, pages 48-55, June 1999.
-
S. Rao, L. Alvisi, and H. M. Vin. Egida: An extensible toolkit for low overhead fault tolerance. In Proceedings of the 29th Fault-tolerant Computing Symposium (FTCS-29), Madison, Wisconsin, pages 48-55, June 1999.
-
-
-
-
16
-
-
35248827046
-
A Component Architecture for LAM/MPI
-
Proceedings, 10th European PVM/MPI Users' Group Meeting, number in, Venice, Italy, September, October, Springer-Verlag
-
J. M. Squyres and A. Lumsdaine. A Component Architecture for LAM/MPI. In Proceedings, 10th European PVM/MPI Users' Group Meeting, number 2840 in Lecture Notes in Computer Science, pages 379-387, Venice, Italy, September / October 2003. Springer-Verlag.
-
(2003)
Lecture Notes in Computer Science
, vol.2840
, pp. 379-387
-
-
Squyres, J.M.1
Lumsdaine, A.2
-
17
-
-
34548708373
-
-
V. C. Zandy. ckpt: A process checkpoint library, 2002. http://www.cs.wisc.edu/~zandy/ckpt.
-
V. C. Zandy. ckpt: A process checkpoint library, 2002. http://www.cs.wisc.edu/~zandy/ckpt.
-
-
-
|