-
1
-
-
84870548923
-
An overview of the bluegene/l supercomputer
-
N.R. Adiga et al. "An Overview of the BlueGene/L Supercomputer," Proc. Supercomputing Conf. (SC '02), pp. 1-22, 2002.
-
(2002)
Proc. Supercomputing Conf. (SC '02)
, pp. 1-22
-
-
Adiga, N.R.1
-
2
-
-
0003203438
-
Templates for the solution of linear systems: Building blocks for iterative methods
-
R. Barrett, M. Berry, T.F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H.V. Der Vorst, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, second ed. SIAM, 1994.
-
(1994)
Second Ed. SIAM
-
-
Barrett, R.1
Berry, M.2
Chan, T.F.3
Demmel, J.4
Donato, J.5
Dongarra, J.6
Eijkhout, V.7
Pozo, R.8
Romine, C.9
Vorst Der, H.V.10
-
5
-
-
33746136466
-
Condition numbers of gaussian random matrices
-
Z. Chen and J. Dongarra, "Condition Numbers of Gaussian Random Matrices," SIAM J. Matrix Analysis and Applications, vol.27, no.3, pp. 603-620, 2005.
-
(2005)
SIAM J. Matrix Analysis and Applications
, vol.27
, Issue.3
, pp. 603-620
-
-
Chen, Z.1
Dongarra, J.2
-
6
-
-
0242658775
-
Self-adapting software for numerical linear algebra and LAPACK for clusters
-
Nov./Dec.
-
Z. Chen, J. Dongarra, P. Luszczek, and K. Roche, "Self-Adapting Software for Numerical Linear Algebra and LAPACK for Clusters," Parallel Computing, vol.29, nos. 11/12, pp. 1723-1743, Nov./Dec. 2003.
-
(2003)
Parallel Computing
, vol.29
, Issue.11-12
, pp. 1723-1743
-
-
Chen, Z.1
Dongarra, J.2
Luszczek, P.3
Roche, K.4
-
7
-
-
31844451082
-
Fault tolerant high performance computing by a coding approach
-
June
-
Z. Chen, G.E. Fagg, E. Gabriel, J. Langou, T. Angskun, G. Bosilca, and J. Dongarra, "Fault Tolerant High Performance Computing by a Coding Approach," Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP '05), June 2005.
-
(2005)
Proc. ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP '05)
-
-
Chen, Z.1
Fagg, G.E.2
Gabriel, E.3
Langou, J.4
Angskun, T.5
Bosilca, G.6
Dongarra, J.7
-
9
-
-
75449119828
-
TOP500 supercomputer sites, 24th edition
-
J. Dongarra, H. Meuer, and E. Strohmaier, "TOP500 Supercomputer Sites, 24th Edition," Proc. Supercomputing Conf. (SC '2004), 2004.
-
(2004)
Proc. Supercomputing Conf. (SC '2004)
-
-
Dongarra, J.1
Meuer, H.2
Strohmaier, E.3
-
10
-
-
0000324960
-
Eigenvalues and condition numbers of random matrices
-
A. Edelman, "Eigenvalues and Condition Numbers of Random Matrices," SIAM J. Matrix Analysis and Applications, vol.9, no.4, pp. 543-560, 1988.
-
(1988)
SIAM J. Matrix Analysis and Applications
, vol.9
, Issue.4
, pp. 543-560
-
-
Edelman, A.1
-
12
-
-
33646110228
-
Extending the MPI specification for process fault tolerance on high performance computing systems
-
G.E. Fagg, E. Gabriel, G. Bosilca, T. Angskun, Z. Chen, J. Pjesivac- Grbovic, K. London, and J.J. Dongarra, "Extending the MPI Specification for Process Fault Tolerance on High Performance Computing Systems," Proc. Int'l Supercomputer Conf., 2004.
-
(2004)
Proc. Int'l Supercomputer Conf.
-
-
Fagg, G.E.1
Gabriel, E.2
Bosilca, G.3
Angskun, T.4
Chen, Z.5
Pjesivac- Grbovic, J.6
London, K.7
Dongarra, J.J.8
-
13
-
-
27844508605
-
Process fault-tolerance: Semantics, design and applications for high performance computing
-
G.E. Fagg, E. Gabriel, Z. Chen, T. Angskun, G. Bosilca, J. Pjesivac- Grbovic, and J.J. Dongarra, "Process Fault-Tolerance: Semantics, Design and Applications for High Performance Computing," Int'l J. High Performance Computing Applications, vol.19, no.4, pp. 465- 477, 2005.
-
(2005)
Int'l J. High Performance Computing Applications
, vol.19
, Issue.4
, pp. 465-477
-
-
Fagg, G.E.1
Gabriel, E.2
Chen, Z.3
Angskun, T.4
Bosilca, G.5
Pjesivac- Grbovic, J.6
Dongarra, J.J.7
-
15
-
-
0018454850
-
On the optimum checkpoint interval
-
E. Gelenbe, "On the Optimum Checkpoint Interval," J. ACM, vol.26, no.2, pp. 259-270, 1979.
-
(1979)
J. ACM
, vol.26
, Issue.2
, pp. 259-270
-
-
Gelenbe, E.1
-
16
-
-
0030243005
-
A high-performance, portable implementation of the MPI message passing interface standard
-
Sept.
-
W. Gropp, E. Lusk, N. Doss, and A. Skjellum, "A High- Performance, Portable Implementation of the MPI Message Passing Interface Standard," Parallel Computing, vol.22, no.6, pp. 789-828, Sept. 1996.
-
(1996)
Parallel Computing
, vol.22
, Issue.6
, pp. 789-828
-
-
Gropp, W.1
Lusk, E.2
Doss, N.3
Skjellum, A.4
-
19
-
-
0003413672
-
MPI: A message passing interface standard
-
Message Passing Interface Forum Univ. of Tennessee
-
Message Passing Interface Forum "MPI: A Message Passing Interface Standard," Technical Report ut-cs-94-230, Univ. of Tennessee, 1994.
-
(1994)
Technical Report ut-cs-94-230
-
-
-
20
-
-
0031223146
-
A tutorial on reed-solomon coding for fault-tolerance in RAID-like systems
-
Sept.
-
J.S. Plank, "A Tutorial on Reed-Solomon Coding for Fault- Tolerance in RAID-Like Systems," Software-Practice & Experience, vol.27, no.9, pp. 995-1012, Sept. 1997.
-
(1997)
Software-Practice & Experience
, vol.27
, Issue.9
, pp. 995-1012
-
-
Plank, J.S.1
-
21
-
-
0031570636
-
Fault-tolerant matrix operations for networks of workstations using diskless checkpointing
-
J.S. Plank, Y. Kim, and J. Dongarra, "Fault-Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing," J. Parallel and Distributed Computing, vol.43, no.2, pp. 125-138, 1997.
-
(1997)
J. Parallel and Distributed Computing
, vol.43
, Issue.2
, pp. 125-138
-
-
Plank, J.S.1
Kim, Y.2
Dongarra, J.3
-
23
-
-
0032179680
-
Diskless checkpointing
-
Oct.
-
J.S. Plank, K. Li, and M.A. Puening, "Diskless Checkpointing," IEEE Trans. Parallel and Distributed Systems, vol.9, no.10, pp. 972- 986, Oct. 1998.
-
(1998)
IEEE Trans. Parallel and Distributed Systems
, vol.9
, Issue.10
, pp. 972-986
-
-
Plank, J.S.1
Li, K.2
Puening, M.A.3
-
24
-
-
0035201417
-
Processor allocation and checkpoint interval selection in cluster computing systems
-
Nov.
-
J.S. Plank and M.G. Thomason, "Processor Allocation and Checkpoint Interval Selection in Cluster Computing Systems," J. Parallel and Distributed Computing, vol.61, no.11, pp. 1570-1590, Nov. 2001.
-
(2001)
J. Parallel and Distributed Computing
, vol.61
, Issue.11
, pp. 1570-1590
-
-
Plank, J.S.1
Thomason, M.G.2
-
25
-
-
84864756973
-
An experimental study about diskless checkpointing
-
L.M. Silva and J.G. Silva, "An Experimental Study about Diskless Checkpointing," Proc. EUROMICRO '98 Conf., pp. 395-402, 1998.
-
(1998)
Proc. EUROMICRO '98 Conf.
, pp. 395-402
-
-
Silva, L.M.1
Silva, J.G.2
-
26
-
-
0345442370
-
A case for two-level recovery schemes
-
June
-
N.H. Vaidya, "A Case for Two-Level Recovery Schemes," IEEE Trans. Computers, vol.47, no.6, pp. 656-666, June 1998.
-
(1998)
IEEE Trans. Computers
, vol.47
, Issue.6
, pp. 656-666
-
-
Vaidya, N.H.1
-
27
-
-
84976846528
-
A first order approximation to the optimal checkpoint interval
-
J.W. Young, "A First Order Approximation to the Optimal Checkpoint Interval," Comm. ACM, vol.17, no.9, pp. 530-531, 1974.
-
(1974)
Comm. ACM
, vol.17
, Issue.9
, pp. 530-531
-
-
Young, J.W.1
|