-
1
-
-
66749092384
-
-
P. Kogge, K. Bergman, S. Borkar, D. Campbell, W. Carlson, W. Dally, M. Denneau, P. Franzon, W. Harrod, J. Hiller, S. Karp, S. Keckler, D. Klein, R. Lucas, M. Richards, A. Scarpelli, S. Scott, A. Snavely, T. Sterling, R. S. Williams, and K. Yelick, "Exascale computing study: Technology challenges in achieving exascale systems," 2008.
-
(2008)
Exascale Computing Study: Technology Challenges in Achieving Exascale Systems
-
-
Kogge, P.1
Bergman, K.2
Borkar, S.3
Campbell, D.4
Carlson, W.5
Dally, W.6
Denneau, M.7
Franzon, P.8
Harrod, W.9
Hiller, J.10
Karp, S.11
Keckler, S.12
Klein, D.13
Lucas, R.14
Richards, M.15
Scarpelli, A.16
Scott, S.17
Snavely, A.18
Sterling, T.19
Williams, R.S.20
Yelick, K.21
more..
-
2
-
-
78649256719
-
-
J. Dongarra, P. Beckman, T. Moore, P. Aerts, G. Aloisio, D. Barkai, T. Boku, B. Chapman, X. Chi, A. Choudhary, S. Dosanjh, T. Dunning, R. Fiore, A. Geist, R. Harrison, M. Hereld, M. Heroux, K. Hotta, Y. Ishikawa, Z. Jin, F. Johnson, S. Kale, R. Kenway, D. Keyes, B. Kramer, J. Labarta, A. Lichnewsky, B. Lucas, S. Matsuoka, P. Messina, P. Michielse, B. Mohr, M. Mueller, J. Shalf, D. Skinner, M. Snir, T. Sterling, R. Stevens, F. Streitz, B. Sugar, A. V. D. Steen, J. Vetter, P. Williams, R. Wisniewski, and K. Yelick, "The international exascale software project roadmap 1."
-
The International Exascale Software Project Roadmap 1
-
-
Dongarra, J.1
Beckman, P.2
Moore, T.3
Aerts, P.4
Aloisio, G.5
Barkai, D.6
Boku, T.7
Chapman, B.8
Chi, X.9
Choudhary, A.10
Dosanjh, S.11
Dunning, T.12
Fiore, R.13
Geist, A.14
Harrison, R.15
Hereld, M.16
Heroux, M.17
Hotta, K.18
Ishikawa, Y.19
Jin, Z.20
Johnson, F.21
Kale, S.22
Kenway, R.23
Keyes, D.24
Kramer, B.25
Labarta, J.26
Lichnewsky, A.27
Lucas, B.28
Matsuoka, S.29
Messina, P.30
Michielse, P.31
Mohr, B.32
Mueller, M.33
Shalf, J.34
Skinner, D.35
Snir, M.36
Sterling, T.37
Stevens, R.38
Streitz, F.39
Sugar, B.40
Steen, A.V.D.41
Vetter, J.42
Williams, P.43
Wisniewski, R.44
Yelick, K.45
more..
-
3
-
-
78650831692
-
Design, modeling, and evaluation of a scalable multi-level checkpointing system
-
A. Moody, G. Bronevetsky, K. Mohror, and B. R. de Supinski, "Design, modeling, and evaluation of a scalable multi-level checkpointing system," in SC, 2010, pp. 1-11.
-
(2010)
SC
, pp. 1-11
-
-
Moody, A.1
Bronevetsky, G.2
Mohror, K.3
De Supinski, B.R.4
-
4
-
-
20444463494
-
FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI
-
G. Zheng, L. Shi, and L. V. Kalé, "FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI," in 2004 IEEE International Conference on Cluster Computing, San Diego, CA, September 2004, pp. 93-103.
-
2004 IEEE International Conference on Cluster Computing, San Diego, CA, September 2004
, pp. 93-103
-
-
Zheng, G.1
Shi, L.2
Kalé, L.V.3
-
5
-
-
83455181657
-
Evaluation of simple causal message logging for large-scale fault tolerant hpc systems
-
E. Meneses, G. Bronevetsky, and L. V. Kale, "Evaluation of simple causal message logging for large-scale fault tolerant hpc systems," in 16th IEEE Workshop on Dependable Parallel, Distributed and Network-Centric Systems in 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2011)., May 2011.
-
16th IEEE Workshop on Dependable Parallel, Distributed and Network-Centric Systems in 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2011)., May 2011
-
-
Meneses, E.1
Bronevetsky, G.2
Kale, L.V.3
-
7
-
-
74049121711
-
Berkeley lab checkpoint/restart (blcr) for linux clusters
-
P. H. Hargrove and J. C. Duell, "Berkeley lab checkpoint/restart (blcr) for linux clusters," in SciDAC, 2006.
-
(2006)
SciDAC
-
-
Hargrove, P.H.1
Duell, J.C.2
-
8
-
-
84976846528
-
A first order approximation to the optimal checkpoint interval
-
J. W. Young, "A first order approximation to the optimal checkpoint interval," Commun. ACM, vol. 17, no. 9, pp. 530-531, 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.9
, pp. 530-531
-
-
Young, J.W.1
-
9
-
-
28044460018
-
A higher order estimate of the optimum checkpoint interval for restart dumps
-
J. T. Daly, "A higher order estimate of the optimum checkpoint interval for restart dumps," Future Generation Comp. Syst., vol. 22, no. 3, pp. 303-312, 2006.
-
(2006)
Future Generation Comp. Syst.
, vol.22
, Issue.3
, pp. 303-312
-
-
Daly, J.T.1
-
10
-
-
0029237761
-
Message logging: Pessimistic, optimistic, and causal
-
vol. 0
-
L. Alvisi and K. Marzullo, "Message logging: pessimistic, optimistic, and causal," Distributed Computing Systems, International Conference on, vol. 0, p. 0229, 1995.
-
(1995)
Distributed Computing Systems, International Conference on
, pp. 0229
-
-
Alvisi, L.1
Marzullo, K.2
-
12
-
-
83155188951
-
Evaluating the viability of process replication reliability for exascale systems
-
New York, NY, USA: ACM, [Online]. Available
-
K. Ferreira, J. Stearley, J. H. Laros, III, R. Oldfield, K. Pedretti, R. Brightwell, R. Riesen, P. G. Bridges, and D. Arnold, "Evaluating the viability of process replication reliability for exascale systems," in Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. New York, NY, USA: ACM, 2011, pp. 44:1-44:12. [Online]. Available: http://doi.acm.org/10.1145/2063384.2063443
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
-
-
Ferreira, K.1
Stearley, J.2
Laros III, J.H.3
Oldfield, R.4
Pedretti, K.5
Brightwell, R.6
Riesen, R.7
Bridges, P.G.8
Arnold, D.9
-
13
-
-
84871668004
-
Energy considerations in checkpointing and fault tolerance protocols
-
M. Diouri, O. Gluck, L. Lefevre, and F. Cappello, "Energy considerations in checkpointing and fault tolerance protocols," in 2nd Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2012), Boston, USA, Jun. 2012.
-
2nd Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS 2012), Boston, USA, Jun. 2012
-
-
Diouri, M.1
Gluck, O.2
Lefevre, L.3
Cappello, F.4
-
14
-
-
20444435911
-
Improved message logging versus improved coordinated checkpointing for fault tolerant MPI
-
vol. 0
-
P. Lemarinier, A. Bouteiller, T. Herault, G. Krawezik, and F. Cappello, "Improved message logging versus improved coordinated checkpointing for fault tolerant MPI," Cluster Computing, IEEE International Conference on, vol. 0, pp. 115-124, 2004.
-
(2004)
Cluster Computing, IEEE International Conference on
, pp. 115-124
-
-
Lemarinier, P.1
Bouteiller, A.2
Herault, T.3
Krawezik, G.4
Cappello, F.5
|