-
1
-
-
12344308304
-
Basic concepts and taxonomy of dependable and secure computing
-
Avizienis, A., Laprie, J.-C., Randell, B., Landwehr, C.: Basic concepts and taxonomy of dependable and secure computing. IEEE Transactions on Dependable and Secure Computing 1(1), 11–33 (2004)
-
(2004)
IEEE Transactions on Dependable and Secure Computing
, vol.1
, Issue.1
, pp. 11-33
-
-
Avizienis, A.1
Laprie, J.-C.2
Randell, B.3
Landwehr, C.4
-
2
-
-
83155160949
-
FTI: High performance fault tolerance interface for hybrid systems
-
ACM, New York
-
Bautista-Gomez, L., Tsuboi, S., Komatitsch, D., Cappello, F., Maruyama, N., Matsuoka, S.: FTI: high performance fault tolerance interface for hybrid systems. In: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2011, pp. 32:1–32:32. ACM, New York (2011)
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2011
, pp. 1-32
-
-
Bautista-Gomez, L.1
Tsuboi, S.2
Komatitsch, D.3
Cappello, F.4
Maruyama, N.5
Matsuoka, S.6
-
3
-
-
70450206305
-
Toward exascale resilience
-
Cappello, F., Geist, A., Gropp, B., Kale, L., Kramer, B., Snir, M.: Toward exascale resilience. Int. J. High Perform. Comput. Appl. 23(4), 374–388 (2009)
-
(2009)
Int. J. High Perform. Comput. Appl
, vol.23
, Issue.4
, pp. 374-388
-
-
Cappello, F.1
Geist, A.2
Gropp, B.3
Kale, L.4
Kramer, B.5
Snir, M.6
-
4
-
-
0032002385
-
Xception: A technique for the experimental evaluation of dependability in modern computers
-
Carreira, J., Madeira, H., Silva, J.G.: Xception: a technique for the experimental evaluation of dependability in modern computers. IEEE Transactions on Software Engineering 24(2), 36–125 (1998)
-
(1998)
IEEE Transactions on Software Engineering
, vol.24
, Issue.2
, pp. 36-125
-
-
Carreira, J.1
Madeira, H.2
Silva, J.G.3
-
5
-
-
84864068316
-
Fault resilience of the algebraic multi-grid solver
-
ACM, New York
-
Casas, M., de Supinski, B.R., Bronevetsky, G., Schulz, M.: Fault resilience of the algebraic multi-grid solver. In: Proceedings of the 26th ACM International Conference on Supercomputing, ICS 2012, pp. 91–100. ACM, New York (2012)
-
(2012)
Proceedings of the 26th ACM International Conference on Supercomputing, ICS 2012
, pp. 91-100
-
-
Casas, M.1
De Supinski, B.R.2
Bronevetsky, G.3
Schulz, M.4
-
7
-
-
84877705582
-
Detection and correction of silent data corruption for large-scale high-performance computing
-
IEEE Computer Society Press, Los Alamitos
-
Fiala, D., Mueller, F., Engelmann, C., Riesen, R., Ferreira, K., Brightwell, R.: Detection and correction of silent data corruption for large-scale high-performance computing. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012, pp. 1–78. IEEE Computer Society Press, Los Alamitos (2012)
-
(2012)
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012
, pp. 1-78
-
-
Fiala, D.1
Mueller, F.2
Engelmann, C.3
Riesen, R.4
Ferreira, K.5
Brightwell, R.6
-
10
-
-
84894169143
-
Facing the exascale energy wall
-
IEEE Computer Society, Washington, DC (2010)
-
Kogge, P.M., La Fratta, P., Vance, M.: [2010] Facing the exascale energy wall. In: Proceedings of the 2010 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems, IWIA 2010, pp. 51–58. IEEE Computer Society, Washington, DC (2010)
-
(2010)
Proceedings of the 2010 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems, IWIA 2010
, pp. 51-58
-
-
Kogge, P.M.1
La Fratta, P.2
Vance, M.3
-
12
-
-
84877692741
-
Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool
-
IEEE Computer Society Press, Los Alamitos
-
Li, D., Vetter, J.S., Yu, W.: Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012, pp. 57:1–57:11. IEEE Computer Society Press, Los Alamitos (2012)
-
(2012)
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012
, pp. 1-57
-
-
Li, D.1
Vetter, J.S.2
Yu, W.3
-
13
-
-
84934311843
-
Assessing fault sensitivity in MPI applications
-
IEEE Computer Society, Washington, DC
-
Lu, C.-d., Reed, D.A.: Assessing fault sensitivity in MPI applications. In: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, SC 2004, p. 37. IEEE Computer Society, Washington, DC (2004)
-
(2004)
Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, SC 2004
, pp. 37
-
-
Lu, C.-D.1
Reed, D.A.2
-
14
-
-
84877686804
-
Alleviating scalability issues of checkpointing protocols
-
IEEE Computer Society Press, Los Alamitos
-
Riesen, R., Ferreira, K., Da Silva, D., Lemarinier, P., Arnold, D., Bridges, P.G.: Alleviating scalability issues of checkpointing protocols. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012, pp. 1–18. IEEE Computer Society Press, Los Alamitos (2012)
-
(2012)
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012
, pp. 1-18
-
-
Riesen, R.1
Ferreira, K.2
Da Silva, D.3
Lemarinier, P.4
Arnold, D.5
Bridges, P.G.6
-
15
-
-
84891618936
-
Design and modeling of non-blocking checkpoint system
-
A*STAR Computational Resource Centre, Singapore
-
Sato, K., Gamblin, T., Moody, A., de Supinski, B.R., Mohror, K., Maruyama, N.: Design and modeling of non-blocking checkpoint system. In: Proceedings of the ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP 2012, pp. 39:1–39:2. A*STAR Computational Resource Centre, Singapore (2012)
-
(2012)
Proceedings of the ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, ATIP 2012
, pp. 1-39
-
-
Sato, K.1
Gamblin, T.2
Moody, A.3
De Supinski, B.R.4
Mohror, K.5
Maruyama, N.6
-
16
-
-
84905657146
-
Towards formal approaches to system resilience
-
Sharma, V.C., Haran, A., Rakamarić, Z., Gopalakrishnan, G.: Towards formal approaches to system resilience. In: Proceedings of the 19th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC (2013)
-
(2013)
Proceedings of the 19th IEEE Pacific Rim International Symposium on Dependable Computing, PRDC
-
-
Sharma, V.C.1
Haran, A.2
Rakamarić, Z.3
Gopalakrishnan, G.4
-
17
-
-
84877721508
-
A study of DRAM failures in the field
-
IEEE Computer Society Press, Los Alamitos
-
Sridharan, V., Liberty, D.: A study of DRAM failures in the field. In: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012, pp. 76:1–76:11. IEEE Computer Society Press, Los Alamitos (2012)
-
(2012)
Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC 2012
, pp. 1-76
-
-
Sridharan, V.1
Liberty, D.2
-
18
-
-
0033875633
-
NFTAPE: A framework for assessing dependability in distributed systems with lightweight fault injectors
-
Stott, D.T., Floering, B., Burke, D., Kalbarczyk, Z., Iyer, R.K.: NFTAPE: A framework for assessing dependability in distributed systems with lightweight fault injectors. In: Proceedings of the IEEE International Computer Performance and Dependability Symposium, pp. 91–100 (2000)
-
(2000)
Proceedings of the IEEE International Computer Performance and Dependability Symposium
, pp. 91-100
-
-
Stott, D.T.1
Floering, B.2
Burke, D.3
Kalbarczyk, Z.4
Iyer, R.K.5
|