-
2
-
-
0030083764
-
TreadMarksi Shared memory computing on networks of workstations
-
February
-
C. Amza, A. Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel. TreadMarksi Shared memory computing on networks of workstations. IEEE Computer, 29(2):18-28, February 1995.
-
(1995)
IEEE Computer
, vol.29
, Issue.2
, pp. 18-28
-
-
Amza, C.1
Cox, A.2
Dwarkadas, S.3
Keleher, P.4
Lu, H.5
Rajamony, R.6
Yu, W.7
Zwaenepoel, W.8
-
3
-
-
0031570635
-
Application level fault tolerance in heterogeneous networks of workstations
-
Adam Beguelin, Erik Seligman, and Peter Stephan. Application level fault tolerance in heterogeneous networks of workstations. Journal of Parallel and Distributed Computing, 43(2): 147-155, 1997. Also available as http://citeseer.nj.nec.com/beguelin97application.html.
-
(1997)
Journal of Parallel and Distributed Computing
, vol.43
, Issue.2
, pp. 147-155
-
-
Beguelin, A.1
Seligman, E.2
Stephan, P.3
-
4
-
-
1142268808
-
Collective operations in an application-level fault tolerant MPI system
-
June
-
G. Bronevetsky, D. Marques, K. Pingali, and P. Stodghill. Collective operations in an application-level fault tolerant MPI system. In Proceedings of the 2003 International Conference on Supercomputing, pages 234-243, June 2003.
-
(2003)
Proceedings of the 2003 International Conference on Supercomputing
, pp. 234-243
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Stodghill, P.4
-
5
-
-
1442337674
-
Automated application-level checkpointing of MPI programs
-
June
-
Greg Bronevetsky, Daniel Marques, Keshav Pingali, and Paul Stodghill. Automated application-level checkpointing of MPI programs. In Principles and Practice of Parallel Programming (PPoPP), pages 84-94, June 2003.
-
(2003)
Principles and Practice of Parallel Programming (PPoPP)
, pp. 84-94
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Stodghill, P.4
-
6
-
-
0022020346
-
Distributed snapshots: Determining global states of distributed systems
-
M. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. IEEE Transactions on Computing Systems, 3(1):63-75, 1985.
-
(1985)
IEEE Transactions on Computing Systems
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy, M.1
Lamport, L.2
-
8
-
-
84860082128
-
-
Condor, http://www.cs.wisc.edu/condor/manual.
-
-
-
-
11
-
-
0004096191
-
A survey of rollback-recovery protocols in message passing systems
-
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October
-
M. Elnozahy, L. Alvisi, Y. M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report CMU-CS-96-181, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA, October 1996.
-
(1996)
Technical Report CMU-CS-96-181
-
-
Elnozahy, M.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
14
-
-
0003912256
-
Checkpoint and migration of unix processes in the condor distributed processing system
-
University of Wisconsin-Madison
-
T. Tannenbaum J. B. M. Litzkow and M. Livny. Checkpoint and Migration of Unix Processes in the Condor Distributed Processing System. Technical Report Technical Report 1346, University of Wisconsin-Madison, 1997.
-
(1997)
Technical Report
, vol.1346
-
-
Tannenbaum, T.1
Litzkow, J.B.M.2
Livny, M.3
-
16
-
-
0004215089
-
-
Morgan Kaufmann, San Francisco, California, first edition
-
Nancy Lynch. Distributed Algorithms. Morgan Kaufmann, San Francisco, California, first edition, 1996.
-
(1996)
Distributed Algorithms
-
-
Lynch, N.1
-
17
-
-
0038335808
-
Compiler-assisted checkpointing
-
University of Tennessee, December
-
J.S. Plank M. Beck and G. Kingsley. Compiler-Assisted Checkpointing. Technical Report Technical Report CS-94-269, University of Tennessee, December 1994.
-
(1994)
Technical Report
, vol.CS-94-269
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
-
18
-
-
0004096191
-
A survey of rollback-recovery protocols in message passing systems
-
Carnegie Mellon University, October
-
Y. M. Wang M. Elnozahy, L. Alvisi and D. B. Johnson. A survey of rollback-recovery protocols in message passing systems. Technical Report Technical Report CMU-CS-96-181, Carnegie Mellon University, October 1996.
-
(1996)
Technical Report
, vol.CMU-CS-96-181
-
-
Wang, Y.M.1
Elnozahy, M.2
Alvisi, L.3
Johnson, D.B.4
-
20
-
-
0003023157
-
Design of OpenMP compiler for an SMP cluster
-
September
-
K. Kusano M. Sato, S. Satoh and Y. Tanaka. Design of OpenMP compiler for an SMP cluster. In EWOMP '99, pages 32-39, September 1999.
-
(1999)
EWOMP '99
, pp. 32-39
-
-
Kusano, K.1
Sato, M.2
Satoh, S.3
Tanaka, Y.4
-
21
-
-
0003413672
-
MPI: A message-passing interface standard
-
University of Tennessee, Knoxville, June
-
Message Passing Interface Forum (MPIF). MPI: A message-passing interface standard. Technical Report, University of Tennessee, Knoxville, June 1995.
-
(1995)
Technical Report
-
-
-
22
-
-
12344306566
-
-
N. Stone, J. Kochmar, R. Reddy, J. R. Scott, J. Sommerfield, C. Vizino. A checkpoint and recovery system for the Pittsburgh supercomputing center terascale computing system. http://www.psc.edu/publications/tech.reports/chkpt- rcvry/checkpoint-recovery-1.0.html.
-
A Checkpoint and Recovery System for the Pittsburgh Supercomputing Center Terascale Computing System.
-
-
Stone, N.1
Kochmar, J.2
Reddy, R.3
Scott, J.R.4
Sommerfield, J.5
Vizino, C.6
-
24
-
-
0003554155
-
-
Document Number 004-2229-01 edition, October
-
OpenMP Architecture Review Board. OpenMP C and C++ Application, Program Interface, Version 1.0, Document Number 004-2229-01 edition, October 1998. Available from http://www.openmp.org/.
-
(1998)
OpenMP C and C++ Application, Program Interface, Version 1.0
-
-
-
29
-
-
0029179077
-
The SPLASH-2 programs: Characterization and methodological considerations
-
June
-
S. Woo, M. Ohara, E. Torrie, J. Singh, and A. Gupta. The SPLASH-2 programs: Characterization and methodological considerations. In Proceedings of the International Symposium on Computer Architecture 1995, pages 24-36, June 1995.
-
(1995)
Proceedings of the International Symposium on Computer Architecture 1995
, pp. 24-36
-
-
Woo, S.1
Ohara, M.2
Torrie, E.3
Singh, J.4
Gupta, A.5
|