-
1
-
-
34548153543
-
-
at Lawrence Berkeley National Laboratory, Berkeley, CA, USA. Available at
-
Berkeley Lab Checkpoint/Restart (BLCR) project at Lawrence Berkeley National Laboratory, Berkeley, CA, USA. Available at http://ftg.lbl.gov/ checkpoint.
-
Berkeley Lab Checkpoint/Restart (BLCR) project
-
-
-
2
-
-
0001776520
-
Group communication specifications: A comprehensive study
-
G. V. Chockler, I. Keidar, and R. Vitenberg. Group communication specifications: A comprehensive study. ACM Computing Surveys, 33(4):1-43, 2001.
-
(2001)
ACM Computing Surveys
, vol.33
, Issue.4
, pp. 1-43
-
-
Chockler, G.V.1
Keidar, I.2
Vitenberg, R.3
-
3
-
-
13644278157
-
Total order broadcast and multicast algorithms: Taxonomy and survey
-
X. Defago, A. Schiper, and P. Urban. Total order broadcast and multicast algorithms: Taxonomy and survey. ACM Computing Surveys, 36(4):372-421, 2004.
-
(2004)
ACM Computing Surveys
, vol.36
, Issue.4
, pp. 372-421
-
-
Defago, X.1
Schiper, A.2
Urban, P.3
-
4
-
-
0030129232
-
The Transis approach to high availability cluster communication
-
D. Dolev and D. Malki. The Transis approach to high availability cluster communication. Communications of the ACM, 39(4):64-70, 1996.
-
(1996)
Communications of the ACM
, vol.39
, Issue.4
, pp. 64-70
-
-
Dolev, D.1
Malki, D.2
-
5
-
-
33746635500
-
RMIX: A dynamic, heterogeneous, reconfigurable communication framework
-
3992, Reading, UK, May 28-31
-
C. Engelmann and G. A. Geist. RMIX: A dynamic, heterogeneous, reconfigurable communication framework. In Lecture Notes in Computer Science: Proceedings of International Conference on Computational Science, Part II, volume 3992, pages 573-580, Reading, UK, May 28-31, 2006.
-
(2006)
Lecture Notes in Computer Science: Proceedings of International Conference on Computational Science, Part
, vol.2
, pp. 573-580
-
-
Engelmann, C.1
Geist, G.A.2
-
6
-
-
38049153567
-
-
C. Engelmann, H. Ong, and S. L. Scott. Middleware in modern high performance computing system architectures. In Lecture Notes in Computer Science: Proceedings of International Conference on Computational Science, Beijing, China, May 27-30, 2007.
-
C. Engelmann, H. Ong, and S. L. Scott. Middleware in modern high performance computing system architectures. In Lecture Notes in Computer Science: Proceedings of International Conference on Computational Science, Beijing, China, May 27-30, 2007.
-
-
-
-
7
-
-
33646432010
-
MOLAR: Adaptive runtime support for high-end computing operating and runtime systems
-
C. Engelmann, S. L. Scott, D. E. Bernholdt, N. R. Gottumukkala, C. Leangsuksun, J. Varma, C. Wang, Mueller, A. G. Shet, and P. Sadayappan. MOLAR: Adaptive runtime support for high-end computing operating and runtime systems. ACM SIGOPS Operating Systems Review (OSR), 40(2):63-72, 2006.
-
(2006)
ACM SIGOPS Operating Systems Review (OSR)
, vol.40
, Issue.2
, pp. 63-72
-
-
Engelmann, C.1
Scott, S.L.2
Bernholdt, D.E.3
Gottumukkala, N.R.4
Leangsuksun, C.5
Varma, J.6
Wang, C.7
Mueller8
Shet, A.G.9
Sadayappan, P.10
-
8
-
-
33750954729
-
Active/active replication for highly available HPC system services
-
Vienna, Austria, Apr. 20-22
-
st International Conference on Availability, Reliability and Security, pages 639-645, Vienna, Austria, Apr. 20-22, 2006.
-
(2006)
st International Conference on Availability, Reliability and Security
, pp. 639-645
-
-
Engelmann, C.1
Scott, S.L.2
Leangsuksun, C.3
He, X.4
-
9
-
-
34548190800
-
Symmetric active/active high availability for high-performance computing system services
-
C. Engelmann, S. L. Scott, C. Leangsuksun, and X. He. Symmetric active/active high availability for high-performance computing system services. Journal of Computers, 1(8):43-54, 2006.
-
(2006)
Journal of Computers
, vol.1
, Issue.8
, pp. 43-54
-
-
Engelmann, C.1
Scott, S.L.2
Leangsuksun, C.3
He, X.4
-
10
-
-
34548183060
-
Towards high availability for high-performance computing system services: Accomplishments and limitations
-
Santa Fe, NM, USA, Oct. 17
-
C. Engelmann, S. L. Scott, C. Leangsuksun, and X. He. Towards high availability for high-performance computing system services: Accomplishments and limitations. In Proceedings of High Availability and Performance Workshop, Santa Fe, NM, USA, Oct. 17, 2006.
-
(2006)
Proceedings of High Availability and Performance Workshop
-
-
Engelmann, C.1
Scott, S.L.2
Leangsuksun, C.3
He, X.4
-
11
-
-
34548190322
-
On programming models for service-level high availability
-
Vienna, Austria, Apr. 10-13
-
C. Engelmann, S. L. Scott, C. Leangsuksun, and X. He. On programming models for service-level high availability. In Proceedings of 2 International Conference on Availability, Reliability and Security, Vienna, Austria, Apr. 10-13, 2007.
-
(2007)
Proceedings of 2 International Conference on Availability, Reliability and Security
-
-
Engelmann, C.1
Scott, S.L.2
Leangsuksun, C.3
He, X.4
-
12
-
-
4644300495
-
-
Prentice Hall PTR, Upper Saddle River, NJ, USA
-
T. Erl. Service-Oriented Architecture: Concepts, Technology, and Design. Prentice Hall PTR, Upper Saddle River, NJ, USA, 2005.
-
(2005)
Service-Oriented Architecture: Concepts, Technology, and Design
-
-
Erl, T.1
-
17
-
-
34548344147
-
-
Modular Linux and Adaptive Runtime Support for High-end Computing Operating and Runtime Systems (MOLAR). Available at http://www.fastos.org/molar.
-
Modular Linux and Adaptive Runtime Support for High-end Computing Operating and Runtime Systems (MOLAR). Available at http://www.fastos.org/molar.
-
-
-
-
18
-
-
0028576754
-
Extended virtual synchrony
-
June 21-24
-
L. Moser, Y. Amir, P. Melliar-Smith, and D. Agarwal. Extended virtual synchrony. Proceedings of IEEE 14 International Conference on Distributed Computing Systems, pages 56-65, June 21-24, 1994.
-
(1994)
Proceedings of IEEE 14 International Conference on Distributed Computing Systems
, pp. 56-65
-
-
Moser, L.1
Amir, Y.2
Melliar-Smith, P.3
Agarwal, D.4
-
21
-
-
34548178009
-
-
at Hebrew University of Jerusalem, Israel. Available at
-
Transis Project at Hebrew University of Jerusalem, Israel. Available at http://www.cs.huji.ac.il/labs/transis.
-
Transis Project
-
-
-
22
-
-
46049083585
-
JOSHUA: Symmetric active/active replication for highly available HPC job and resource management
-
Barcelona, Spain, Sept. 25-28
-
K. Uhlemann, C. Engelmann, and S. L. Scott. JOSHUA: Symmetric active/active replication for highly available HPC job and resource management. In Proceedings of IEEE International Conference on Cluster Computing, Barcelona, Spain, Sept. 25-28, 2006.
-
(2006)
Proceedings of IEEE International Conference on Cluster Computing
-
-
Uhlemann, K.1
Engelmann, C.2
Scott, S.L.3
|