-
2
-
-
0036601844
-
Grid Services for distributed system integration
-
Jun.
-
I. Foster, C. Kesselman, J. Nick and S. Tuecke, Grid Services for distributed system integration", Computer, vol. 35, no. 6, Jun. 2002, pp. 37-46.
-
(2002)
Computer
, vol.35
, Issue.6
, pp. 37-46
-
-
Foster, I.1
Kesselman, C.2
Nick, J.3
Tuecke, S.4
-
3
-
-
33746312915
-
Current practice and a direction forward in checkpoint/restart implementations for fault tolerance
-
J. C. Sancho, F. Petrini, K. Davis, R. Gioiosa and S. Jiang, "Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance", Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
-
(2005)
Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium
-
-
Sancho, J.C.1
Petrini, F.2
Davis, K.3
Gioiosa, R.4
Jiang, S.5
-
4
-
-
0003912256
-
Checkpoint and migration of unix processes in the condor distributed processing system
-
University of Wisconsin, Madison, WI.
-
M. Litzkow, T. Tanenbaum, J. Basney, and M. Livny, "Checkpoint and migration of unix processes in the condor distributed processing system", Computer Sciences Technical Report 1346, University of Wisconsin, Madison, WI., 1997.
-
(1997)
Computer Sciences Technical Report 1346
-
-
Litzkow, M.1
Tanenbaum, T.2
Basney, J.3
Livny, M.4
-
5
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
Sept
-
E. N. Elnozahy, L. Alvisi, Y. Wang, and D. B. Johnson, "A survey of rollback-recovery protocols in message-passing systems", ACM Computing Surveys, vol. 34, no. 3, Sept. 2002, pp. 375-408.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.1
Alvisi, L.2
Wang, Y.3
Johnson, D.B.4
-
6
-
-
48049114689
-
Berkeley lab checkpoint/restart (blcr) for linux clusters
-
publication LBNL-60520, June 2006
-
P. H. Hargrove and J.C. Duell, "Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters", Proceedings of SciDAC 2006, publication LBNL-60520, June 2006.
-
Proceedings of SciDAC 2006
-
-
Hargrove, P.H.1
Duell, J.C.2
-
7
-
-
84978437417
-
The design and implementation of Zap: A system for migrating computing environments
-
S. Osman, D. Subhraveti, G. Su and J. Nieh, "The design and implementation of Zap: A system for migrating computing environments", Proceesings of the 5th Symposium on Operating System Design and Implementation (OSDI 2002), 2002, pp. 361-376.
-
(2002)
Proceesings of the 5th Symposium on Operating System Design and Implementation (OSDI
, vol.2002
, pp. 361-376
-
-
Osman, S.1
Subhraveti, D.2
Su, G.3
Nieh, J.4
-
8
-
-
84892751921
-
Science at llnl with ibm blue gene/q
-
B. Chan, et al., "Science at LLNL with IBM Blue Gene/Q", IBM Journal of Research and Development, vol. 57, pp. 11:1-11:18, 2013.
-
(2013)
IBM Journal of Research and Development
, vol.57
, pp. 111-1118
-
-
Chan, B.1
-
11
-
-
80052811907
-
Independent checkpointing in a heterogeneous grid environment
-
E. Feller, J. Mehnert-Spahn, M. Schoettner, and C. Morin, "Independent checkpointing in a heterogeneous grid environment", Future Generation Computer Systems, vol. 28 (1), 2012, pp. 163-170.
-
(2012)
Future Generation Computer Systems
, vol.28
, Issue.1
, pp. 163-170
-
-
Feller, E.1
Mehnert-Spahn, J.2
Schoettner, M.3
Morin, C.4
-
13
-
-
37849053411
-
Efficient checkpointing of java software using context-sensitive capture and replay
-
G. Xu, A. Rountev, Y. Tang, and F. Qin, "Efficient Checkpointing of Java Software Using Context-Sensitive Capture and Replay", Proceedings of the ESEC/FSE International Conference, 2007.
-
(2007)
Proceedings of the ESEC/FSE International Conference
-
-
Xu, G.1
Rountev, A.2
Tang, Y.3
Qin, F.4
|