-
1
-
-
0001298824
-
The MOSIX Distributed Operating System, Load Balancing for UNIX
-
Springer-Verlag
-
A. Barak, S. Guday, and R. Wheeler. The MOSIX Distributed Operating System, Load Balancing for UNIX. Number 672 in Lecture Notes in Computer Science. Springer-Verlag, 1993.
-
(1993)
Lecture Notes in Computer Science
, Issue.672
-
-
Barak, A.1
Guday, S.2
Wheeler, R.3
-
2
-
-
85033357209
-
-
M. Beck, J. S. Plank, and G. Kingsley. Compilerassisted checkpointing. Technical Report UT-CS-94-269, Dept. of Computer Science, University of Tennessee, 1994. Also available as http://citeseer.nj.nec.com/173887.html.
-
M. Beck, J. S. Plank, and G. Kingsley. Compilerassisted checkpointing. Technical Report UT-CS-94-269, Dept. of Computer Science, University of Tennessee, 1994. Also available as http://citeseer.nj.nec.com/173887.html.
-
-
-
-
6
-
-
12844286028
-
Application-level checkpointing for shared memory programs
-
G. Bronevetsky, M. Schulz, P. Szwed, D. Marques, and K. Pingali. Application-level checkpointing for shared memory programs. In Conference on Application Support for Programming Languages and Operating Systems, 2004.
-
(2004)
Conference on Application Support for Programming Languages and Operating Systems
-
-
Bronevetsky, G.1
Schulz, M.2
Szwed, P.3
Marques, D.4
Pingali, K.5
-
7
-
-
33847116602
-
Checkpointing shared memory programs at the application-level
-
G. Bronevetsky, M. Schulz, P. Szwed, D. Marques, and K. Pingali. Checkpointing shared memory programs at the application-level. In European Workshop on OpenMP, 2004.
-
(2004)
European Workshop on OpenMP
-
-
Bronevetsky, G.1
Schulz, M.2
Szwed, P.3
Marques, D.4
Pingali, K.5
-
9
-
-
85033331043
-
-
P. S. Center. Lemieux. Available at http://www.psc.edu/machines/tcs/ lemieux.html.
-
P. S. Center. Lemieux. Available at http://www.psc.edu/machines/tcs/ lemieux.html.
-
-
-
-
10
-
-
0022020346
-
Distributed snapshots: Determining global states of distributed systems
-
M. Chandy and L. Lamport. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computing Systems, 3(1):63-75, 1985.
-
(1985)
ACM Transactions on Computing Systems
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy, M.1
Lamport, L.2
-
12
-
-
85033327928
-
Dynamic Data Driven Application Systems. Accessed February 8
-
8-10,2000 NSF sponsored workshop on
-
C. Douglas, A. Deshmukh, et al. Report from the March 8-10,2000 NSF sponsored workshop on Dynamic Data Driven Application Systems. Accessed February 8, 2003.
-
(2003)
Report from the March
-
-
Douglas, C.1
Deshmukh, A.2
-
14
-
-
0042078549
-
A survey of rollback-recovery protocols in messagepassing systems
-
Sept
-
E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in messagepassing systems. ACM Computing Surveys, 34(3):375 -408, Sept. 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.M.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
16
-
-
20744442381
-
Heterogeneous Process State Capture and Recovery through Process Introspection
-
A. Ferrari, S. J. Chapin, and A. S. Grimshaw. Heterogeneous Process State Capture and Recovery through Process Introspection. In Cluster Computing, pages 63-73, 2000.
-
(2000)
Cluster Computing
, pp. 63-73
-
-
Ferrari, A.1
Chapin, S.J.2
Grimshaw, A.S.3
-
17
-
-
85033344899
-
-
M. P. I. Forum. MPI: A Message-Passing Interface Standard. Technical Report UT-CS-94-230, University of Tennessee, 1994.
-
M. P. I. Forum. MPI: A Message-Passing Interface Standard. Technical Report UT-CS-94-230, University of Tennessee, 1994.
-
-
-
-
18
-
-
0003912256
-
Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System
-
Technical Report 1346, University of Wisconsin-Madison
-
M. Litzkow, T. Tannenbaum, J. Basney, and M. Livny. Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System. Technical Report 1346, University of Wisconsin-Madison, 1997.
-
(1997)
-
-
Litzkow, M.1
Tannenbaum, T.2
Basney, J.3
Livny, M.4
-
19
-
-
0004215089
-
-
Morgan Kaufmann, San Francisco, California, first edition
-
N. Lynch. Distributed Algorithms. Morgan Kaufmann, San Francisco, California, first edition, 1996.
-
(1996)
Distributed Algorithms
-
-
Lynch, N.1
-
22
-
-
2442691414
-
Guaranteed-quality parallel Delaunay refinement for restricted polyhedral domains
-
June
-
D. Nave, N. Chrisochoides, and P. Chew. Guaranteed-quality parallel Delaunay refinement for restricted polyhedral domains. Computational Geometry: Theory and Applications, 28(2-3): 191-215, June 2004.
-
(2004)
Computational Geometry: Theory and Applications
, vol.28
, Issue.2-3
, pp. 191-215
-
-
Nave, D.1
Chrisochoides, N.2
Chew, P.3
-
24
-
-
85084159983
-
Transparent Checkpointing under UNIX
-
J. S. Plank, M. Beck, G. Kingsley, and K. Li. Libckpt: Transparent Checkpointing under UNIX. In USENIX Winter, pages 213-224, 1995.
-
(1995)
USENIX Winter
, pp. 213-224
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Libckpt, K.L.4
-
25
-
-
0033077475
-
Memory exclusion: Optimizing the performance of checkpointing systems
-
Also available at
-
J. S. Plank, Y. Chen, K. Li, M. Beck, and G. Kingsley. Memory exclusion: optimizing the performance of checkpointing systems. Software Practice and Experience, 29(2):125-142, 1999. Also available at http://citeseer.nj.nec. com/plank96memory.html.
-
(1999)
Software Practice and Experience
, vol.29
, Issue.2
, pp. 125-142
-
-
Plank, J.S.1
Chen, Y.2
Li, K.3
Beck, M.4
Kingsley, G.5
-
26
-
-
0004097019
-
Compressed differences: An algorithm for fast incremental checkpointing
-
Technical Report CS-95-302, University of Tennessee, Aug, Also available at
-
J. S. Plank, J. Xu, and R. Netzer. Compressed differences: An algorithm for fast incremental checkpointing. Technical Report CS-95-302, University of Tennessee, Aug. 1995. Also available at http://www.cs.utk.edu/∼plank/plank/ papers/CS-95-302.ps.Z.
-
(1995)
-
-
Plank, J.S.1
Xu, J.2
Netzer, R.3
-
28
-
-
84934312471
-
Implementation and evaluation of a scalable application-level checkpoint-recovery scheme for mpi programs
-
M. Schulz, G. Bronevetsky, R. Fernandes, D. Marques, K. Pingali, and P. Stodghill. Implementation and evaluation of a scalable application-level checkpoint-recovery scheme for mpi programs. In Supercomputing '04, 2004.
-
(2004)
Supercomputing '04
-
-
Schulz, M.1
Bronevetsky, G.2
Fernandes, R.3
Marques, D.4
Pingali, K.5
Stodghill, P.6
-
29
-
-
35248827046
-
A Component Architecture for LAM/MPI
-
Proceedings, 10th European PVM/MPI Users' Group Meeting, number in, Venice, Italy, September /October, Springer-Verlag
-
J. M. Squyres and A. Lumsdaine. A Component Architecture for LAM/MPI. In Proceedings, 10th European PVM/MPI Users' Group Meeting, number 2840 in Lecture Notes in Computer Science, pages 379-387, Venice, Italy, September /October 2003. Springer-Verlag.
-
(2003)
Lecture Notes in Computer Science
, vol.2840
, pp. 379-387
-
-
Squyres, J.M.1
Lumsdaine, A.2
-
31
-
-
4544385770
-
SimSnap: Fast-forwarding via native execution and application-level checkpointing
-
P. Szwed, D. Marques, R. Buels, S. McKee, and M. Schulz. SimSnap: Fast-forwarding via native execution and application-level checkpointing. In Interact-8: Workshop on the Interaction between Compilers and Computer Architectures, 2004.
-
(2004)
Interact-8: Workshop on the Interaction between Compilers and Computer Architectures
-
-
Szwed, P.1
Marques, D.2
Buels, R.3
McKee, S.4
Schulz, M.5
|