-
1
-
-
3543102906
-
System fault tolerance
-
T. Anderson, and B. Randell, Eds., Cambridge, MA: Cambridge University Press
-
T. Anderson, P.A. Lee, and S.K. Shrivastava, "System fault tolerance", in Computing System Reliability, T. Anderson, and B. Randell, Eds., Cambridge, MA: Cambridge University Press, 1979, pp. 153-210.
-
(1979)
Computing System Reliability
, pp. 153-210
-
-
Anderson, T.1
Lee, P.A.2
Shrivastava, S.K.3
-
3
-
-
0019684078
-
A formal model of atomicity in asynchronous systems"
-
E. Best, and B. Ranbdel, "A formal model of atomicity in asynchronous systems", Acta Information, Vol. 16, 1981, pp. 93-124.
-
(1981)
Acta Information
, vol.16
, pp. 93-124
-
-
Best, E.1
Ranbdel, B.2
-
4
-
-
0024123530
-
Independent checkpointing and concurrent rollback recovery for distributed systems - An optimistic approach
-
Oct
-
B. Bhargava, and L. Lilien, "Independent checkpointing and concurrent rollback recovery for distributed systems - An Optimistic Approach", IEEE Proc. 7th Symp. Reliability in Distributed Systems, Oct 1988, pp. 3-12.
-
(1988)
IEEE Proc. 7th Symp. Reliability in Distributed Systems
, pp. 3-12
-
-
Bhargava, B.1
Lilien, L.2
-
5
-
-
0021538527
-
A distributed domino-effect free recovery algorithm
-
Oct
-
D. Briatico, A. Giuffoletti, and L. Simoncini, "A distributed domino-effect free recovery algorithm", IEEE Proc. 4th Symp. Reliability in Distributed Software and Database Systems, Oct 1984, pp. 207-215.
-
(1984)
IEEE Proc. 4th Symp. Reliability in Distributed Software and Database Systems
, pp. 207-215
-
-
Briatico, D.1
Giuffoletti, A.2
Simoncini, L.3
-
6
-
-
3543071398
-
An abstract model of rollback recovery control in distributed systems
-
Oct
-
J. Cao, and K.C. Wang, "An abstract model of rollback recovery control in distributed systems", ACM Oper. Sys. Review, Oct 1992, pp.62-76.
-
(1992)
ACM Oper. Sys. Review
, pp. 62-76
-
-
Cao, J.1
Wang, K.C.2
-
7
-
-
0032314841
-
On coordinated checkpointing in distributed systems
-
Dec
-
G. Cao, and M. Singhal, "On coordinated checkpointing in distributed systems", IEEE Trans. Parallel and Distributed Systems, Vol. 9, No. 12, Dec 1998, pp. 1213-1225.
-
(1998)
IEEE Trans. Parallel and Distributed Systems
, vol.9
, Issue.12
, pp. 1213-1225
-
-
Cao, G.1
Singhal, M.2
-
8
-
-
3543049206
-
Distributed computing research issues in Grid computing
-
Sep
-
H. Casanova, "Distributed computing research issues in Grid computing", ACM SICACT News, Vol. 33, No.3, Sep 2002, pp. 50-70.
-
(2002)
ACM SICACT News
, vol.33
, Issue.3
, pp. 50-70
-
-
Casanova, H.1
-
9
-
-
0022020346
-
Distributed snapshots: Determining global state of distributed systems
-
K.M. Chandy, and L. Lamport, "Distributed snapshots: determining global state of distributed systems", ACM Trans. Computer Systems, Vol. 3, No. 1, 1985, pp. 63-75.
-
(1985)
ACM Trans. Computer Systems
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy, K.M.1
Lamport, L.2
-
10
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
Sep
-
E.N. Elnozahy, L. Alvisi, Y.M. Wang, and D.B. Johnson, "A survey of rollback-recovery protocols in message-passing systems", ACM Computing Surveys, Vol. 34, No. 3, Sep 2002, pp. 375-408.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
11
-
-
0035455653
-
The anatomy of the Grid: Enabling scalable virtual organizations
-
I. Foster, C. Kesselman, and S. Tuecke, "The anatomy of the Grid: enabling scalable virtual organizations", Int'l J. High Performance Computing Applications, Vol. 15, No. 3, 2001.
-
(2001)
Int'l J. High Performance Computing Applications
, vol.15
, Issue.3
-
-
Foster, I.1
Kesselman, C.2
Tuecke, S.3
-
12
-
-
38249017422
-
Recovery in distributed systems using optimistic message logging and checkpointng
-
D.B. Johnson, and W. Zwaenepoel, "Recovery in distributed systems using optimistic message logging and checkpointng", J. Algorithms, Vol. 11, 1990, pp. 462-491.
-
(1990)
J. Algorithms
, vol.11
, pp. 462-491
-
-
Johnson, D.B.1
Zwaenepoel, W.2
-
13
-
-
0023090161
-
Checkpointing and rollback recovery for distributed systems
-
Jan
-
R. Koo, and S. Toueg, "Checkpointing and rollback recovery for distributed systems", IEEE Trans. Software Eng., Vol. SE-13, No.1, Jan 1987.
-
(1987)
IEEE Trans. Software Eng.
, vol.SE-13
, Issue.1
-
-
Koo, R.1
Toueg, S.2
-
15
-
-
0023834241
-
Concurrent robust checkpointing and recovery in distributed systems
-
P.J. Leu, and B. Bhargava, "Concurrent robust checkpointing and recovery in distributed systems", IEEE 4th Conf. Data Eng., 1988, pp. 154-163.
-
(1988)
IEEE 4th Conf. Data Eng.
, pp. 154-163
-
-
Leu, P.J.1
Bhargava, B.2
-
16
-
-
0033360051
-
Quasi-synchronous checkpointing: Models, characterization, and classification
-
Jul
-
D. Manivannan, and M. Singhal, "Quasi-synchronous checkpointing: Models, characterization, and classification", IEEE Trans. Parallel and Distributed Systems, Vol. 10, No. 7, Jul 1999, pp. 703-713.
-
(1999)
IEEE Trans. Parallel and Distributed Systems
, vol.10
, Issue.7
, pp. 703-713
-
-
Manivannan, D.1
Singhal, M.2
-
19
-
-
0029255243
-
Necessary and sufficient conditions for consistent global snapshots
-
Feb
-
R.H.B. Netzer, and J. Xu, "Necessary and sufficient conditions for consistent global snapshots", IEEE Trans. Parallel and Distributed Systems, Vol. 6, No. 2, Feb 1995, pp. 65-169.
-
(1995)
IEEE Trans. Parallel and Distributed Systems
, vol.6
, Issue.2
, pp. 65-169
-
-
Netzer, R.H.B.1
Xu, J.2
-
21
-
-
0016522101
-
System Structure for Software Fault Tolerance
-
B. Randel, "System Structure for Software Fault Tolerance", IEEE Trans. Software Eng., Vol. SE-1, No. 2, 1975, pp.220-232.
-
(1975)
IEEE Trans. Software Eng.
, vol.SE-1
, Issue.2
, pp. 220-232
-
-
Randel, B.1
-
23
-
-
0022112420
-
Optimistic recovery in distributed systems
-
Aug
-
R. Strom, and S. temini, "Optimistic Recovery in Distributed Systems", ACM Trans. Computer Systems, Aug 1985, pp. 204-226.
-
(1985)
ACM Trans. Computer Systems
, pp. 204-226
-
-
Strom, R.1
Temini, S.2
-
24
-
-
0344591959
-
Dynamic recovery schemes for distributed processes
-
K. Tsuruoka, A. Kaneko, and Y. Nishihara, "Dynamic recovery schemes for distributed processes", Proc. IEEE 2nd Symp. Reliability in Distributed Software and Database Systems, 1981, pp. 124-130.
-
(1981)
Proc. IEEE 2nd Symp. Reliability in Distributed Software and Database Systems
, pp. 124-130
-
-
Tsuruoka, K.1
Kaneko, A.2
Nishihara, Y.3
-
25
-
-
0029305383
-
Checkpoint space reclamation for uncoordinated checkpointing in message-passing systems
-
May
-
Y.M. Wang, P.Y. Chung, I.J. Lin, and W.K. Fuchs, "Checkpoint space reclamation for uncoordinated checkpointing in message-passing systems", IEEE Trans. Parallel and Distributed Systems, Vol. 6, No. 5, May 1995, pp. 546-554.
-
(1995)
IEEE Trans. Parallel and Distributed Systems
, vol.6
, Issue.5
, pp. 546-554
-
-
Wang, Y.M.1
Chung, P.Y.2
Lin, I.J.3
Fuchs, W.K.4
-
26
-
-
0031124071
-
Consistent global checkpoints that contain a given set of local checkpoints
-
Apr
-
Y.M. Wang, "Consistent global checkpoints that contain a given set of local checkpoints", IEEE Trans. Computers, Vol. 46, No. 4, Apr 1997, pp. 456-468.
-
(1997)
IEEE Trans. Computers
, vol.46
, Issue.4
, pp. 456-468
-
-
Wang, Y.M.1
|