-
1
-
-
51049106838
-
OVIS-2: A robust distributed architecture for scalable RAS
-
J. M. Brandt, B. J. Debusschere, A. C. Gentile, J. R. Mayo, P. P. Pébay, D. Thompson, and M. H. Wong. OVIS-2: A robust distributed architecture for scalable RAS. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS): Workshop on System Management Techniques, Processes, and Services (SMTPS), 2008.
-
(2008)
Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS): Workshop on System Management Techniques, Processes, and Services (SMTPS)
-
-
Brandt, J.M.1
Debusschere, B.J.2
Gentile, A.C.3
Mayo, J.R.4
Pébay, P.P.5
Thompson, D.6
Wong, M.H.7
-
2
-
-
50649108554
-
Proactive fault tolerance in MPI applications via task migration
-
S. Chakravorty, C. L. Mendes, and L. V. Kalé. Proactive fault tolerance in MPI applications via task migration. In Lecture Notes in Computer Science: Proceedings ofthe International Conference on High Performance Computing (HiPC), volume 4297, pages 485-496, 2006.
-
(2006)
Lecture Notes in Computer Science: Proceedings Ofthe International Conference on High Performance Computing (HiPC)
, vol.4297
, pp. 485-496
-
-
Chakravorty, S.1
Mendes, C.L.2
Kalé, L.V.3
-
5
-
-
3342966061
-
The Ganglia distributed monitoring system: Design, implementation, and experience
-
M. L. Massie, B. N. Chun, and D. E. Culler. The Ganglia distributed monitoring system: Design, implementation, and experience. Parallel Computing, 30(7):817-840, 2004.
-
(2004)
Parallel Computing
, vol.30
, Issue.7
, pp. 817-840
-
-
Massie, M.L.1
Chun, B.N.2
Culler, D.E.3
-
6
-
-
34548046749
-
Proactive fault tolerance for HPC with Xen virtualization
-
A. B. Nagarajan, F. Mueller, C. Engelmann, and S. L. Scott. Proactive fault tolerance for HPC with Xen virtualization. In Proceedings of the ACM International Conference on Supercomputing (ICS), pages 23-32, 2007.
-
(2007)
Proceedings of the ACM International Conference on Supercomputing (ICS)
, pp. 23-32
-
-
Nagarajan, A.B.1
Mueller, F.2
Engelmann, C.3
Scott, S.L.4
-
10
-
-
70349736737
-
Towards a fault-aware computing environment
-
X.-H. Sun, Z. Lan, Y. Li, H. Jin, and Z. Zheng. Towards a fault-aware computing environment. In Proceedings of the High Availability and Performance Workshop (HAPCW), in conjunction with the High-Performance Computer Science Week (HPCSW), 2008.
-
(2008)
Proceedings of the High Availability and Performance Workshop (HAPCW), in Conjunction with the High-Performance Computer Science Week (HPCSW)
-
-
Sun, X.-H.1
Lan, Z.2
Li, Y.3
Jin, H.4
Zheng, Z.5
-
11
-
-
53349098075
-
Evaluation of fault-tolerant policies using simulation
-
A. Tikotekar, G. Vallée, T. Naughton, S. L. Scott, and C. Leangsuksun. Evaluation of fault-tolerant policies using simulation. In Proceedings ofthe IEEEInternational Conference on Cluster Computing (Cluster), 2007.
-
(2007)
Proceedings Ofthe IEEEInternational Conference on Cluster Computing (Cluster)
-
-
Tikotekar, A.1
Vallée, G.2
Naughton, T.3
Scott, S.L.4
Leangsuksun, C.5
-
12
-
-
49049111154
-
A framework for proactive fault tolerance
-
G. R. Vallée, K. Charoenpornwattana, C. Engelmann, A. Tikotekar, C. B. Leangsuksun, T. Naughton, and S. L. Scott. A framework for proactive fault tolerance. In Proceedings ofthe International Conference on Availability, Reliability and Security (ARES), pages 659-664, 2007.
-
(2007)
Proceedings Ofthe International Conference on Availability, Reliability and Security (ARES)
, pp. 659-664
-
-
Vallée, G.R.1
Charoenpornwattana, K.2
Engelmann, C.3
Tikotekar, A.4
Leangsuksun, C.B.5
Naughton, T.6
Scott, S.L.7
-
13
-
-
70350755748
-
Proactive process-level live migration in HPC environments
-
To appear
-
C. Wang, F. Mueller, C. Engelmann, and S. L. Scott. Proactive process-level live migration in HPC environments. In Proceedings ofthe IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC), 2008. To appear.
-
(2008)
Proceedings Ofthe IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Wang, C.1
Mueller, F.2
Engelmann, C.3
Scott, S.L.4
|