-
1
-
-
0032597670
-
An analysis of communication-induced checkpointing
-
June
-
L. Alvisi, E. Elnozahy, S. Rao, S.A. Husain, A. De Mel, An analysis of communication-induced checkpointing, in: Proceedings of the 29th Fault-tolerant Computing Symposium, June 1999, pp. 242-249.
-
(1999)
Proceedings of the 29th Fault-tolerant Computing Symposium
, pp. 242-249
-
-
Alvisi, L.1
Elnozahy, E.2
Rao, S.3
Husain, S.A.4
De Mel, A.5
-
2
-
-
0029237761
-
Message logging: Pessimistic, optimistic, and causal
-
June
-
L. Alvisi, K. Marzullo, Message logging: pessimistic, optimistic, and causal, in: Proceedings of the 15th IEEE International Conference on Distributed Computing Systems, June 1995, pp. 229-236.
-
(1995)
Proceedings of the 15th IEEE International Conference on Distributed Computing Systems
, pp. 229-236
-
-
Alvisi, L.1
Marzullo, K.2
-
3
-
-
24544442574
-
On modeling consistent checkpoints and the domino effect in distributed systems
-
Rapporte de Recherche No. 2569, INRIA, France, June
-
R. Baldoni, J.M. Helary, A. Mostefaoui, M. Raynal, On modeling consistent checkpoints and the domino effect in distributed systems, Rapporte de Recherche No. 2569, INRIA, France, June 1995.
-
(1995)
-
-
Baldoni, R.1
Helary, J.M.2
Mostefaoui, A.3
Raynal, M.4
-
4
-
-
0029190387
-
Characterizing consistent checkpoints in large-scale distributed systems
-
Chejiu Islands, South Korea, August
-
R. Baldoni, J.M. Helary, A. Mostefaoui, M. Raynal, Characterizing consistent checkpoints in large-scale distributed systems, in: Proceedings of the Fifth IEEE International Conference on Parallel and Distributed Computing, Chejiu Islands, South Korea, August 1995, pp. 314-323.
-
(1995)
Proceedings of the Fifth IEEE International Conference on Parallel and Distributed Computing
, pp. 314-323
-
-
Baldoni, R.1
Helary, J.M.2
Mostefaoui, A.3
Raynal, M.4
-
5
-
-
0012280723
-
A communication induced algorithm that ensures the rollback dependency trackability
-
Seattle, July
-
R. Baldoni, J.M. Helary, A. Mostefaoui, M. Raynal, A communication induced algorithm that ensures the rollback dependency trackability, in: Proceedings of the 27th International Symposium on Fault-Tolerant Computing, Seattle, July 1997.
-
(1997)
Proceedings of the 27th International Symposium on Fault-Tolerant Computing
-
-
Baldoni, R.1
Helary, J.M.2
Mostefaoui, A.3
Raynal, M.4
-
7
-
-
0033074998
-
An index-based checkpointing algorithm for autonomous distributed systems
-
R. Baldoni, F. Quaglia, P. Fornara, An index-based checkpointing algorithm for autonomous distributed systems, IEEE Trans. Parallel Distrib. Systems 10 (2) (1999) 181-192.
-
(1999)
IEEE Trans. Parallel Distrib. Systems
, vol.10
, Issue.2
, pp. 181-192
-
-
Baldoni, R.1
Quaglia, F.2
Fornara, P.3
-
8
-
-
0021538527
-
A distributed domino-effect free recovery algorithm
-
IEEE
-
D. Briatico, A. Ciuffoletti, L. Simoncini, A distributed domino-effect free recovery algorithm, in: Proceedings of IEEE Fourth Symposium on Reliability in Distributed Software and Database Systems, IEEE, 1984, pp. 207-215.
-
(1984)
Proceedings of IEEE Fourth Symposium on Reliability in Distributed Software and Database Systems
, pp. 207-215
-
-
Briatico, D.1
Ciuffoletti, A.2
Simoncini, L.3
-
10
-
-
0004096191
-
A survey of rollback-recovery protocols in message-passing systems
-
Technical Report CMU-CS-99-148, School of Computer Science, Carnegie Mellon University, Pittsburg, PA
-
E.N. Elnozahy, L. Alvisi, Y.M. Wang, D.B. Johnson, A survey of rollback-recovery protocols in message-passing systems, Technical Report CMU-CS-99-148, School of Computer Science, Carnegie Mellon University, Pittsburg, PA, 1999.
-
(1999)
-
-
Elnozahy, E.N.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
11
-
-
0031339111
-
Preventing useless checkpoints in distributed computations
-
J.-M. Helary, A. Mostefaoui, M. Raynal, R. Netzer, Preventing useless checkpoints in distributed computations, in: Proceedings of the 16th Symposium on Reliable Distributed Systems, 1997.
-
(1997)
Proceedings of the 16th Symposium on Reliable Distributed Systems
-
-
Helary, J.-M.1
Mostefaoui, A.2
Raynal, M.3
Netzer, R.4
-
13
-
-
38249017422
-
Recovery in distributed systems using optimistic message logging and checkpointing
-
(September)
-
D.B. Johnson, W. Zwaenepoel, Recovery in distributed systems using optimistic message logging and checkpointing, J. Algorithms 11 (3) (September 1990) 462-491.
-
(1990)
J. Algorithms
, vol.11
, Issue.3
, pp. 462-491
-
-
Johnson, D.B.1
Zwaenepoel, W.2
-
16
-
-
0027644652
-
An efficient protocol for checkpointing recovery in distributed systems
-
(August)
-
Junguk L. Kim, Taesoon Park, An efficient protocol for checkpointing recovery in distributed systems, IEEE Trans. Parallel Distrib. Systems 4 (8) (August 1993) 955-960.
-
(1993)
IEEE Trans. Parallel Distrib. Systems
, vol.4
, Issue.8
, pp. 955-960
-
-
Junguk, L.1
Taesoon Park, K.2
-
17
-
-
0022566239
-
A scheme for coordinated execution of independently designed recoverable distributed processes
-
June
-
K.H. Kim, A scheme for coordinated execution of independently designed recoverable distributed processes, in: Proceedings of 16th IEEE Symposium on Fault-Tolerant Computing, June 1986, pp. 130-135.
-
(1986)
Proceedings of 16th IEEE Symposium on Fault-Tolerant Computing
, pp. 130-135
-
-
Kim, K.H.1
-
18
-
-
0023090161
-
Checkpointing and roll-back recovery for distributed systems
-
SE-13 (1) (January)
-
R. Koo, S. Toueg, Checkpointing and roll-back recovery for distributed systems, IEEE Trans. Software Eng. SE-13 (1) (January 1987) 23-31.
-
(1987)
IEEE Trans. Software Eng.
, pp. 23-31
-
-
Koo, R.1
Toueg, S.2
-
19
-
-
0017996760
-
Time, clocks and ordering of events in distributed systems
-
(July)
-
L. Lamport, Time, clocks and ordering of events in distributed systems, Comm. ACM 21 (7) (July 1978) 558-565.
-
(1978)
Comm. ACM
, vol.21
, Issue.7
, pp. 558-565
-
-
Lamport, L.1
-
20
-
-
0026242351
-
Checkpointing multicomputer applications
-
K. Li, J.F. Naughton, J.S. Plank, Checkpointing multicomputer applications, in: Proceedings of 10th Symposium on Reliable Distributed Systems, 199 1, pp. 2-11.
-
(1991)
Proceedings of 10th Symposium on Reliable Distributed Systems
, pp. 2-11
-
-
Li, K.1
Naughton, J.F.2
Plank, J.S.3
-
21
-
-
0031162195
-
Finding consistent global checkpoints in a distributed computation
-
(June)
-
D. Manivannan, R.H.B. Netzer, M. Singhal, Finding consistent global checkpoints in a distributed computation, IEEE Trans. Parallel Distrib. Systems 8 (6) (June 1997) 623-627.
-
(1997)
IEEE Trans. Parallel Distrib. Systems
, vol.8
, Issue.6
, pp. 623-627
-
-
Manivannan, D.1
Netzer, R.H.B.2
Singhal, M.3
-
22
-
-
0029723377
-
A low-overhead recovery technique using quasi-synchronous checkpointing
-
Hong Kong, May
-
D. Manivannan, M. Singhal, A low-overhead recovery technique using quasi-synchronous checkpointing, in: Proceedings of the 16th IEEE International Conference on Distributed Computing Systems, Hong Kong, May 1996, pp. 100-107.
-
(1996)
Proceedings of the 16th IEEE International Conference on Distributed Computing Systems
, pp. 100-107
-
-
Manivannan, D.1
Singhal, M.2
-
23
-
-
0033360051
-
Quasi-synchronous checkpointing: Models, characterization, and classification
-
(July)
-
D. Manivannan, M. Singhal, Quasi-synchronous checkpointing: models, characterization, and classification, IEEE Trans. Parallel Distrib. Systems 10 (7) (July 1999) 703-713.
-
(1999)
IEEE Trans. Parallel Distrib. Systems
, vol.10
, Issue.7
, pp. 703-713
-
-
Manivannan, D.1
Singhal, M.2
-
25
-
-
0029255243
-
Necessary and sufficient conditions for consistent global snapshots
-
(February)
-
R.H.B. Netzer, Jian Xu, Necessary and sufficient conditions for consistent global snapshots, IEEE Trans. Parallel Distrib. Systems 6 (2) (February 1995) 165-169.
-
(1995)
IEEE Trans. Parallel Distrib. Systems
, vol.6
, Issue.2
, pp. 165-169
-
-
Netzer, R.H.B.1
Xu, J.2
-
27
-
-
0030262195
-
Low-cost checkpointing and failure recovery in mobile computing systems
-
(October)
-
R. Prakash, M. Singhal, Low-cost checkpointing and failure recovery in mobile computing systems, IEEE Trans. Parallel Distrib. Systems 7 (10) (October 1996) 1035-1048.
-
(1996)
IEEE Trans. Parallel Distrib. Systems
, vol.7
, Issue.10
, pp. 1035-1048
-
-
Prakash, R.1
Singhal, M.2
-
28
-
-
0031124940
-
Complete process recovery: Using vector time to handle multiple failures in distributed systems
-
(April-June)
-
G. Richard III, M. Singhal, Complete process recovery: using vector time to handle multiple failures in distributed systems, IEEE Concurrency 5 (2) (April-June 1997) 50-58.
-
(1997)
IEEE Concurrency
, vol.5
, Issue.2
, pp. 50-58
-
-
Richard G. III1
Singhal, M.2
-
30
-
-
0028994250
-
Completely asynchronous optimistic recovery with minimal rollbacks
-
S.W. Smith, D.B. Johnson, J.D. Tygar, Completely asynchronous optimistic recovery with minimal rollbacks, in: Proceedings of 25th International Symposium on Fault-Tolerant Computing, IEEE, 1995, pp. 361-370.
-
(1995)
Proceedings of 25th International Symposium on Fault-Tolerant Computing, IEEE
, pp. 361-370
-
-
Smith, S.W.1
Johnson, D.B.2
Tygar, J.D.3
-
31
-
-
0022112420
-
Optimistic recovery in distributed systems
-
(August)
-
R.E. Strom, S. Yemini, Optimistic recovery in distributed systems, ACM Trans. Comput. Systems 3 (3) (August 1985) 204-226.
-
(1985)
ACM Trans. Comput. Systems
, vol.3
, Issue.3
, pp. 204-226
-
-
Strom, R.E.1
Yemini, S.2
|