-
3
-
-
33646395818
-
-
Master's thesis, Dep. of Computer Science, University of Illinois, Urbana, IL () Available at
-
Huang, C.: System support for checkpoint and restart of Charm++ and AMPI applications. Master's thesis, Dep. of Computer Science, University of Illinois, Urbana, IL (2004) Available at http://charm.cs.uiuc.edu/papers/ CheckpointThesis.html.
-
(2004)
System support for checkpoint and restart of Charm++ and AMPI applications
-
-
Huang, C.1
-
4
-
-
20444463494
-
FTC-Charm++: An in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI
-
San Diego, CA
-
Zheng, G., Shi, L., Kalé, L.V.: FTC-Charm++: An in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI. In: 2004 IEEE International Conference on Cluster Computing, San Diego, CA (2004)
-
(2004)
2004 IEEE International Conference on Cluster Computing
-
-
Zheng, G.1
Shi, L.2
Kalé, L.V.3
-
5
-
-
12444281734
-
-
Chakravorty, S., Kalé, L.V.: A fault tolerant protocol for massively parallel machines. In: FTPDSWorkshop at IPDPS'2004, Santa Fe, NM, IEEE Press (2004)
-
Chakravorty, S., Kalé, L.V.: A fault tolerant protocol for massively parallel machines. In: FTPDSWorkshop at IPDPS'2004, Santa Fe, NM, IEEE Press (2004)
-
-
-
-
7
-
-
77049095467
-
-
Hewlett-Packard, Intel, Microsoft, Phoenix, Toshiba: Advanced configuration and power interface specification. ACPI Specification Document, Revision 3.0 (2004) Available from http://www.acpi.info.
-
Hewlett-Packard, Intel, Microsoft, Phoenix, Toshiba: Advanced configuration and power interface specification. ACPI Specification Document, Revision 3.0 (2004) Available from http://www.acpi.info.
-
-
-
-
8
-
-
77952378080
-
Critical event prediction for proactive management in large-scale computer clusters
-
Sahoo, R.K., Oliner, A.J., Rish, I., Gupta, M., Moreira, J.E., Ma, S., Vilalta, R., Sivasubramaniam, A.: Critical event prediction for proactive management in large-scale computer clusters. In: Proceedings og the ACM SIGKDD, Intl. Conf. on Knowledge Discovery Data Mining. (2003) 426-435
-
(2003)
Proceedings og the ACM SIGKDD, Intl. Conf. on Knowledge Discovery Data Mining
, pp. 426-435
-
-
Sahoo, R.K.1
Oliner, A.J.2
Rish, I.3
Gupta, M.4
Moreira, J.E.5
Ma, S.6
Vilalta, R.7
Sivasubramaniam, A.8
-
9
-
-
77049110419
-
-
Oliner, A.J., Sahoo, R.K., Moreira, J.E., Gupta, M., Sivasubramaniam, A.: Fault-aware job scheduling for BlueGene/L systems. Technical Report RC23077, IBM Research ((2004))
-
Oliner, A.J., Sahoo, R.K., Moreira, J.E., Gupta, M., Sivasubramaniam, A.: Fault-aware job scheduling for BlueGene/L systems. Technical Report RC23077, IBM Research ((2004))
-
-
-
-
10
-
-
0002479236
-
Charm++: Parallel programming with message-driven objects
-
Wilson, G.V, Lu, P, eds, MIT Press
-
Kalé, L.V., Krishnan, S.: Charm++: Parallel programming with message-driven objects. In Wilson, G.V., Lu, P., eds.: Parallel Programming using C++. MIT Press (1996) 175-213
-
(1996)
Parallel Programming using C
, pp. 175-213
-
-
Kalé, L.V.1
Krishnan, S.2
-
11
-
-
12444260048
-
Adaptive MPI
-
College Station, TX
-
Huang, C., Lawlor, O., Kalé, L.V.: Adaptive MPI. In: Proceedings of the 16th International Workshop on Languages and Compilers for Parallel Computing (LCPC 03), College Station, TX (2003)
-
(2003)
Proceedings of the 16th International Workshop on Languages and Compilers for Parallel Computing (LCPC 03)
-
-
Huang, C.1
Lawlor, O.2
Kalé, L.V.3
-
12
-
-
34548782726
-
Scalable cosmology simulations on parallel machines
-
Gioachin, F., Sharma, A., Chackravorty, S., Mendes, C., Kale, L.V., Quinn, T.R.: Scalable cosmology simulations on parallel machines. In: 7th International Meeting on High Performance Computing for Computational Science (VECPAR). (2006)
-
(2006)
7th International Meeting on High Performance Computing for Computational Science (VECPAR)
-
-
Gioachin, F.1
Sharma, A.2
Chackravorty, S.3
Mendes, C.4
Kale, L.V.5
Quinn, T.R.6
-
13
-
-
33646903735
-
Scaling molecular dynamics to 3000 processors with projections: A performance analysis case study
-
Melbourne, Australia
-
Kalé, L.V., Kumar, S., Zheng, G., Lee, C.W.: Scaling molecular dynamics to 3000 processors with projections: A performance analysis case study. In: Terascale Performance AnalysisWorkshop, International Conference on Computational Science(ICCS), Melbourne, Australia (2003)
-
(2003)
Terascale Performance AnalysisWorkshop, International Conference on Computational Science(ICCS)
-
-
Kalé, L.V.1
Kumar, S.2
Zheng, G.3
Lee, C.W.4
-
15
-
-
70349122591
-
2 runtime system
-
Proc. 3rdWorkshop on Runtime Systems for Parallel Programming (RTSPP) San Juan, Puerto Rico, Springer-Verlag
-
2 runtime system. In: Proc. 3rdWorkshop on Runtime Systems for Parallel Programming (RTSPP) San Juan, Puerto Rico. Lecture Notes in Computer Science 1586, Springer-Verlag (1999) 496-510
-
(1999)
Lecture Notes in Computer Science
, vol.1586
, pp. 496-510
-
-
Antoniu, G.1
Bouge, L.2
Namyst, R.3
-
17
-
-
33646420251
-
Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations
-
Agbaria, A., Friedman, R.: Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations. Cluster Computing 6(3) (2003) 227-236
-
(2003)
Cluster Computing
, vol.6
, Issue.3
, pp. 227-236
-
-
Agbaria, A.1
Friedman, R.2
-
21
-
-
77954003885
-
Mpi/fttm: Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing
-
IEEE Computer Society
-
Batchu, R., Skjellum, A., Cui, Z., Beddhu, M., Neelamegam, J.P., Dandass, Y., Apte, M.: Mpi/fttm: Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing. In: Proceedings of the 1st International Symposium on Cluster Computing and the Grid, IEEE Computer Society (2001) 26
-
(2001)
Proceedings of the 1st International Symposium on Cluster Computing and the Grid
, pp. 26
-
-
Batchu, R.1
Skjellum, A.2
Cui, Z.3
Beddhu, M.4
Neelamegam, J.P.5
Dandass, Y.6
Apte, M.7
-
22
-
-
0034439137
-
MPI-FT: Portable fault tolerance scheme for MPI
-
Louca, S., Neophytou, N., Lachanas, A., Evripidou, P.: MPI-FT: Portable fault tolerance scheme for MPI. Parallel Processing Letters 10(4) (2000) 371-382
-
(2000)
Parallel Processing Letters
, vol.10
, Issue.4
, pp. 371-382
-
-
Louca, S.1
Neophytou, N.2
Lachanas, A.3
Evripidou, P.4
-
23
-
-
79961061539
-
MPICHV2: A fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging programming via processor virtualization
-
Phoenix, AZ
-
Bouteiller, A., Cappello, F., Hérault, T., Krawezik, G., Lemarinier, P.,Magniette, F.: MPICHV2: A fault tolerant MPI for volatile nodes based on the pessimistic sender based message logging programming via processor virtualization. In: Proceedings of Supercomputing'03, Phoenix, AZ (2003)
-
(2003)
Proceedings of Supercomputing'03
-
-
Bouteiller, A.1
Cappello, F.2
Hérault, T.3
Krawezik, G.4
Lemarinier, P.5
Magniette, F.6
-
24
-
-
0026867749
-
Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit
-
Elnozahy, E.N., Zwaenepoel, W.: Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit. IEEE Transactions on Computers 41(5) (1992) 526-531
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.5
, pp. 526-531
-
-
Elnozahy, E.N.1
Zwaenepoel, W.2
|