-
1
-
-
0034317011
-
Towards an operating system managing parallelism of computers on clusters
-
Goscinski A.M., Towards an operating system managing parallelism of computers on clusters. Future Gener. Comput. Syst. 17:2000;293-314.
-
(2000)
Future Gener. Comput. Syst.
, vol.17
, pp. 293-314
-
-
Goscinski, A.M.1
-
2
-
-
1842587235
-
Operating system support for general purpose single system image clusters
-
Las Vegas, Nevada, June 30-July 2
-
J.M.B. Auban, Y.A. Khalidi, Operating system support for general purpose single system image clusters, in: Proceedings of the 1997 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'97), Las Vegas, Nevada, June 30-July 2, 1997.
-
(1997)
Proceedings of the 1997 International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'97)
-
-
Auban, J.M.B.1
Khalidi, Y.A.2
-
3
-
-
0008802172
-
A cluster operating system supporting parallel computing
-
Goscinski A.M., Hobbs M.J., Silcock J., A cluster operating system supporting parallel computing. Cluster Comput. 4:2001;145-156.
-
(2001)
Cluster Comput.
, vol.4
, pp. 145-156
-
-
Goscinski, A.M.1
Hobbs, M.J.2
Silcock, J.3
-
4
-
-
0032317368
-
System-level versus user-defined checkpointing
-
IEEE Computer Society, West Lafayette, IN
-
L.M. Silva, J.G. Silva, System-level versus user-defined checkpointing, in: Proceedings of the 17th Symposium on Reliable Distributed Systems, IEEE Computer Society, West Lafayette, IN, 1998, pp. 68-74.
-
(1998)
Proceedings of the 17th Symposium on Reliable Distributed Systems
, pp. 68-74
-
-
Silva, L.M.1
Silva, J.G.2
-
5
-
-
0004096191
-
A survey of rollback-recovery protocols in message-passing systems
-
Carnegie Mellon University, a revision of CMU-CS-96-181, June
-
E.N. Elnozahy, L. Alvisi, Y.-M. Wang, D.B. Johnson, A survey of rollback-recovery protocols in message-passing systems, Technical Report CMU-CS-99-148, Carnegie Mellon University, a revision of CMU-CS-96-181, June 1999.
-
(1999)
Technical Report
, vol.CMU-CS-99-148
-
-
Elnozahy, E.N.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
6
-
-
0033360051
-
Quasi-synchronous checkpointing: Models, characterization, and classification
-
Manivannan D., Singhal M., Quasi-synchronous checkpointing: models, characterization, and classification. IEEE Trans. Parallel Distrib. Syst. 10(7):1999;703-713.
-
(1999)
IEEE Trans. Parallel Distrib. Syst.
, vol.10
, Issue.7
, pp. 703-713
-
-
Manivannan, D.1
Singhal, M.2
-
7
-
-
0033359224
-
Starfish: Fault-tolerant dynamic mpi programs on clusters of workstations
-
Redondo Beach, California, August 3-6
-
A.M. Agbaria, R. Friedman, Starfish: fault-tolerant dynamic mpi programs on clusters of workstations, in: Proceedings of the Eighth IEEE International Symposium on High Performance Distributed Computing, Redondo Beach, California, August 3-6, 1999, pp. 31-40.
-
(1999)
Proceedings of the Eighth IEEE International Symposium on High Performance Distributed Computing
, pp. 31-40
-
-
Agbaria, A.M.1
Friedman, R.2
-
8
-
-
0026867749
-
Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit
-
Elnozahy E.N., Zwaenepoel W., Manetho: transparent rollback-recovery with low overhead, limited rollback, and fast output commit. IEEE Trans. Comput. 41(5):1992;526-531.
-
(1992)
IEEE Trans. Comput.
, vol.41
, Issue.5
, pp. 526-531
-
-
Elnozahy, E.N.1
Zwaenepoel, W.2
-
9
-
-
85084159983
-
Libckpt: Transparent checkpointing under unix
-
January
-
J.S. Plank, M. Beck, G. Kingsley, K. Li, Libckpt: transparent checkpointing under unix, in: Proceedings of the USENIX Winter 1995 Technical Conference, January 1995, pp. 213-223.
-
(1995)
Proceedings of the USENIX Winter 1995 Technical Conference
, pp. 213-223
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Li, K.4
-
10
-
-
0003912256
-
Checkpoint and migration of unix processes in the condor distributed processing system
-
University of Wisconsin-Madison, April
-
M. Litzkow, T. Tannenbaum, J. Basney, M. Livny, Checkpoint and migration of unix processes in the condor distributed processing system, Technical Report 1346, University of Wisconsin-Madison, April 1997.
-
(1997)
Technical Report
, vol.1346
-
-
Litzkow, M.1
Tannenbaum, T.2
Basney, J.3
Livny, M.4
-
11
-
-
0034590510
-
Design, implementation, and performance of checkpointing in netsolve
-
New York, June 25
-
A. Agbaria, J.S. Plank, Design, implementation, and performance of checkpointing in netsolve, in: Proceedings of the International Conference on Dependable Systems and Networks, New York, June 25, 2000.
-
(2000)
Proceedings of the International Conference on Dependable Systems and Networks
-
-
Agbaria, A.1
Plank, J.S.2
-
12
-
-
84871146551
-
The performance of consistent checkpointing
-
IEEE Computer Society, Houston, TX, USA, October
-
E.N. Elnozahy, D.B. Johnson, W. Zwaenepoel, The performance of consistent checkpointing, in: Proceedings of the 11th Symposium on Reliable Distributed Systems, IEEE Computer Society, Houston, TX, USA, October 1992, pp. 39-47.
-
(1992)
Proceedings of the 11th Symposium on Reliable Distributed Systems
, pp. 39-47
-
-
Elnozahy, E.N.1
Johnson, D.B.2
Zwaenepoel, W.3
-
14
-
-
1842430279
-
Easy and high performance parallel execution on cows using concurrent process creation
-
Fort Lauderdale, FL
-
M. Hobbs, A. Goscinski, Easy and high performance parallel execution on cows using concurrent process creation, in: Proceedings of the ISCA 12th International Conference on Parallel and Distributed Computing Systems, Fort Lauderdale, FL, 1999, pp. 1-6.
-
(1999)
Proceedings of the ISCA 12th International Conference on Parallel and Distributed Computing Systems
, pp. 1-6
-
-
Hobbs, M.1
Goscinski, A.2
-
15
-
-
22644450661
-
Remote and concurrent process duplication for SPMD based parallel processing cows
-
Proceedings of the High-performance Computing and Networking (HPCN'99), Springer, Berlin
-
M. Hobbs, A. Goscinski, Remote and concurrent process duplication for SPMD based parallel processing cows, in: Proceedings of the High-performance Computing and Networking (HPCN'99), Lecture Notes in Computer Science 1593, Springer, Berlin, 1999, pp. 603-612.
-
(1999)
Lecture Notes in Computer Science
, vol.1593
, pp. 603-612
-
-
Hobbs, M.1
Goscinski, A.2
-
16
-
-
84944940986
-
A group communications facility for reliable computing on clusters
-
The International Society for Computers and Their Applications, Dallas, TX, USA
-
J. Rough, A.M. Goscinski, A group communications facility for reliable computing on clusters, in: Proceedings of the Parallel and Distributed Computing Systems (PDCS), The International Society for Computers and Their Applications, Dallas, TX, USA, 2001, pp. 19-24.
-
(2001)
Proceedings of the Parallel and Distributed Computing Systems (PDCS)
, pp. 19-24
-
-
Rough, J.1
Goscinski, A.M.2
-
17
-
-
0032686588
-
Finding, expressing and managing parallelism in programs executed on clusters of workstations
-
Goscinski A.M., Finding, expressing and managing parallelism in programs executed on clusters of workstations. Comput. Commun. 22:1999;998-1016.
-
(1999)
Comput. Commun.
, vol.22
, pp. 998-1016
-
-
Goscinski, A.M.1
|