-
2
-
-
0003473816
-
-
SIAM, Philadelphia, PA
-
R. Barrett, M. Berry, T. F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. V. der Vorst. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd Edition. SIAM, Philadelphia, PA, 1994.
-
(1994)
Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd Edition
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
Der Vorst, H.V.10
-
3
-
-
31844452364
-
Recovery patterns for iterative methods in a parallel unstable environment
-
University of Tennessee, Knoxville, Tennessee, USA
-
G. Bosilca, Z. Chen, J. Dongarra, and J. Langou. Recovery patterns for iterative methods in a parallel unstable environment. Technical Report ut-cs-04-538, University of Tennessee, Knoxville, Tennessee, USA, 2004.
-
(2004)
Technical Report
, vol.UT-CS-04-538
-
-
Bosilca, G.1
Chen, Z.2
Dongarra, J.3
Langou, J.4
-
4
-
-
31844450567
-
Condition numbers of gaussian random matrices
-
University of Tennessee, Knoxville, Tennessee, USA
-
Z. Chen and J. Dongarra. Condition numbers of gaussian random matrices. Technical Report ut-cs-04-539, University of Tennessee, Knoxville, Tennessee, USA, 2004.
-
(2004)
Technical Report
, vol.UT-CS-04-539
-
-
Chen, Z.1
Dongarra, J.2
-
5
-
-
0242658775
-
Self-adapting software for numerical linear algebra and LAPACK for clusters
-
November-December
-
Z. Chen, J. Dongarra, P. Luszczek, and K. Roche. Self-adapting software for numerical linear algebra and LAPACK for clusters. Parallel Computing, 29(11-12):1723-1743, November-December 2003.
-
(2003)
Parallel Computing
, vol.29
, Issue.11-12
, pp. 1723-1743
-
-
Chen, Z.1
Dongarra, J.2
Luszczek, P.3
Roche, K.4
-
6
-
-
0029715009
-
Evaluation of checkpoint mechanisms for massively parallel machines
-
T. cker Chiueh and P. Deng. Evaluation of checkpoint mechanisms for massively parallel machines. In FTCS, pages 370-379, 1996.
-
(1996)
FTCS
, pp. 370-379
-
-
Chiueh, T.C.1
Deng, P.2
-
7
-
-
75449119828
-
TOP500 supercomputer sites, 24th edition
-
ACM
-
J. Dongarra, H. Meuer, and E. Strohmaier. TOP500 Supercomputer Sites, 24th edition. In Proceedings of the Supercomputing Conference (SC'2004), Pittsburgh PA, USA. ACM, 2004.
-
(2004)
Proceedings of the Supercomputing Conference (SC'2004), Pittsburgh PA, USA
-
-
Dongarra, J.1
Meuer, H.2
Strohmaier, E.3
-
8
-
-
0000324960
-
Eigenvalues and condition numbers of random matrices
-
A. Edelman. Eigenvalues and condition numbers of random matrices. SIAM J. Matrix Anal. Appl., 9(4):543-560, 1988.
-
(1988)
SIAM J. Matrix Anal. Appl.
, vol.9
, Issue.4
, pp. 543-560
-
-
Edelman, A.1
-
9
-
-
84940567900
-
FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world
-
G. E. Fagg and J. Dongarra. FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world. In PVM/MPI 2000, pages 346-353, 2000.
-
(2000)
PVM/MPI 2000
, pp. 346-353
-
-
Fagg, G.E.1
Dongarra, J.2
-
10
-
-
33646110228
-
Extending the MPI specification for process fault tolerance on high performance computing systems
-
G. E. Fagg, E. Gabriel, G. Bosilca, T. Angskun, Z. Chen, J. Pjesivac-Grbovic, K. London, and J. J. Dongarra. Extending the MPI specification for process fault tolerance on high performance computing systems. In Proceedings of the International Supercomputer Conference, Heidelberg, Germany, 2004.
-
(2004)
Proceedings of the International Supercomputer Conference, Heidelberg, Germany
-
-
Fagg, G.E.1
Gabriel, E.2
Bosilca, G.3
Angskun, T.4
Chen, Z.5
Pjesivac-Grbovic, J.6
London, K.7
Dongarra, J.J.8
-
11
-
-
31844456437
-
Process fault-tolerance: Semantics, design and applications for high performance computing
-
Submitted to
-
G. E. Fagg, E. Gabriel, Z. Chen, T. Angskun, G. Bosilca, J. Pjesivac-Grbovic, and J. J. Dongarra. Process fault-tolerance: Semantics, design and applications for high performance computing. Submitted to International Journal of High Performance Computing Applications, 2004.
-
(2004)
International Journal of High Performance Computing Applications
-
-
Fagg, G.E.1
Gabriel, E.2
Chen, Z.3
Angskun, T.4
Bosilca, G.5
Pjesivac-Grbovic, J.6
Dongarra, J.J.7
-
12
-
-
12444258147
-
Development of naturally fault tolerant algortihms for computing on 100,000 processors
-
Submited to
-
A. Geist and C. Engelmann. Development of naturally fault tolerant algortihms for computing on 100,000 processors. Submited to J. Parallel Distrib. Comput., 2002.
-
(2002)
J. Parallel Distrib. Comput.
-
-
Geist, A.1
Engelmann, C.2
-
13
-
-
0018454850
-
On the optimum checkpoint interval
-
E. Gelenbe. On the optimum checkpoint interval. J. ACM, 26(2):259-270, 1979.
-
(1979)
J. ACM
, vol.26
, Issue.2
, pp. 259-270
-
-
Gelenbe, E.1
-
14
-
-
0030243005
-
A high-performance, portable implementation of the MPI message passing interface standard
-
September
-
W. Gropp, E. Lusk, N. Doss, and A. Skjellum. A high-performance, portable implementation of the MPI message passing interface standard. Parallel Computing, 22(6):789-828, September 1996.
-
(1996)
Parallel Computing
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
17
-
-
0003413671
-
Message passing interface forum. MPI: A message passing interface standard
-
University of Tennessee, Knoxville, Tennessee, USA
-
Message Passing Interface Forum. MPI: A Message Passing Interface Standard. Technical Report ut-cs-94-230, University of Tennessee, Knoxville, Tennessee, USA, 1994.
-
(1994)
Technical Report
, vol.UT-CS-94-230
-
-
-
18
-
-
0031223146
-
A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems
-
September
-
J. S. Plank. A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems. Software - Practice & Experience. 27(9):995-1012, September 1997.
-
(1997)
Software - Practice & Experience
, vol.27
, Issue.9
, pp. 995-1012
-
-
Plank, J.S.1
-
19
-
-
0031570636
-
Fault-tolerant matrix operations for networks of workstations using diskless checkpointing
-
J. S. Plank, Y. Kim, and J. Dongarra. Fault-tolerant matrix operations for networks of workstations using diskless checkpointing. J. Parallel Distrib. Comput., 43(2):125-138, 1997.
-
(1997)
J. Parallel Distrib. Comput.
, vol.43
, Issue.2
, pp. 125-138
-
-
Plank, J.S.1
Kim, Y.2
Dongarra, J.3
-
20
-
-
0028060943
-
Faster checkpointing with n+1 parity
-
J. S. Plank and K. Li. Faster checkpointing with n+1 parity. In FTCS, pages 288-297, 1994.
-
(1994)
FTCS
, pp. 288-297
-
-
Plank, J.S.1
Li, K.2
-
21
-
-
0032179680
-
Diskless checkpointing
-
J. S. Plank, K. Li, and M. A. Puening. Diskless checkpointing. IEEE Trans. Parallel Distrib. Syst., 9(10):972-986, 1998.
-
(1998)
IEEE Trans. Parallel Distrib. Syst.
, vol.9
, Issue.10
, pp. 972-986
-
-
Plank, J.S.1
Li, K.2
Puening, M.A.3
-
22
-
-
0035201417
-
Processor allocation and checkpoint interval selection in cluster computing systems
-
November
-
J. S. Plank and M. G. Thomason. Processor allocation and checkpoint interval selection in cluster computing systems. J. Parallel Distrib. Comput., 61(11):1570-1590, November 2001.
-
(2001)
J. Parallel Distrib. Comput.
, vol.61
, Issue.11
, pp. 1570-1590
-
-
Plank, J.S.1
Thomason, M.G.2
-
23
-
-
84864756973
-
An experimental study about diskless checkpointing
-
L. M. Silva and J. G. Silva. An experimental study about diskless checkpointing. In EUROMICRO'98. pages 395-402, 1998.
-
(1998)
EUROMICRO'98
, pp. 395-402
-
-
Silva, L.M.1
Silva, J.G.2
-
24
-
-
0345442370
-
A case for two-level recovery schemes
-
N. H. Vaidya. A case for two-level recovery schemes. IEEE Trans. Computers, 47(6):656-666, 1998.
-
(1998)
IEEE Trans. Computers
, vol.47
, Issue.6
, pp. 656-666
-
-
Vaidya, N.H.1
-
25
-
-
84976846528
-
A first order approximation to the optimal checkpoint interval
-
J. W. Young. A first order approximation to the optimal checkpoint interval. Commun. ACM, 17(9):530-531, 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.9
, pp. 530-531
-
-
Young, J.W.1
|