-
1
-
-
8344232253
-
Adaptive incremental checkpointing for massively parallel systems
-
New York, NY, USA. ACM
-
Agarwal, S., Garg, R., Gupta, M. S., and Moreira, J. E. (2004). Adaptive incremental checkpointing for massively parallel systems. In Proceedings of the 18th annual international conference on Supercomputing, ICS '04, pages 277-286, New York, NY, USA. ACM.
-
(2004)
Proceedings of the 18th Annual International Conference on Supercomputing, ICS '04
, pp. 277-286
-
-
Agarwal, S.1
Garg, R.2
Gupta, M.S.3
Moreira, J.E.4
-
2
-
-
50649091841
-
Hierarchical replication techniques to ensure checkpoint storage reliability in grid environment
-
Washington, DC, USA. IEEE Computer Society
-
Bouabache, F., Herault, T., Fedak, G., and Cappello, F. (2008). Hierarchical replication techniques to ensure checkpoint storage reliability in grid environment. In Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid, pages 475-483, Washington, DC, USA. IEEE Computer Society.
-
(2008)
Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
, pp. 475-483
-
-
Bouabache, F.1
Herault, T.2
Fedak, G.3
Cappello, F.4
-
3
-
-
77955097389
-
A flexible checkpoint/restart model in distributed systems
-
Berlin, Heidelberg. Springer-Verlag
-
Bouguerra, M.-S., Gautier, T., Trystram, D., and Vincent, J.-M. (2010). A Flexible Checkpoint/Restart Model in Distributed Systems. In Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I, PPAM'09, pages 206-215, Berlin, Heidelberg. Springer-Verlag.
-
(2010)
Proceedings of the 8th International Conference on Parallel Processing and Applied Mathematics: Part I, PPAM'09
, pp. 206-215
-
-
Bouguerra, M.-S.1
Gautier, T.2
Trystram, D.3
Vincent, J.-M.4
-
4
-
-
79961165170
-
On the scheduling of checkpoints in desktop grids
-
Washington, DC, USA. IEEE Computer Society
-
Bouguerra, M. S., Kondo, D., and Trystram, D. (2011). On the Scheduling of Checkpoints in Desktop Grids. In Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID '11, pages 305-313, Washington, DC, USA. IEEE Computer Society.
-
(2011)
Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID '11
, pp. 305-313
-
-
Bouguerra, M.S.1
Kondo, D.2
Trystram, D.3
-
5
-
-
33847113613
-
Gridsim: A toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing
-
cs.DC/0203019
-
Buyya, R. and Murshed, M. M. (2002). Gridsim: A toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing. CoRR, cs.DC/0203019.
-
(2002)
CoRR
-
-
Buyya, R.1
Murshed, M.M.2
-
6
-
-
0037413288
-
Checkpointing with mutable checkpoints
-
Cao, G. and Singhal, M. (2003). Checkpointing with mutable checkpoints. Theor. Comput. Sci., pages 1127-1148.
-
(2003)
Theor. Comput. Sci.
, pp. 1127-1148
-
-
Cao, G.1
Singhal, M.2
-
7
-
-
85008006673
-
Adaptive task checkpointing and replication: Toward efficient fault-tolerant grids
-
Chtepen, M., Claeys, F. H. A., Dhoedt, B., De Turck, F., Demeester, P., and Vanrolleghem, P. A. (2009). Adaptive task checkpointing and replication: Toward efficient fault-tolerant grids. IEEE Trans. Parallel Distrib. Syst., 20:180-190.
-
(2009)
IEEE Trans. Parallel Distrib. Syst.
, vol.20
, pp. 180-190
-
-
Chtepen, M.1
Claeys, F.H.A.2
Dhoedt, B.3
De Turck, F.4
Demeester, P.5
Vanrolleghem, P.A.6
-
8
-
-
0028420052
-
Checkpointing and rollback-recovery algorithms in distributed systems
-
Deng, Y. and Park, E. (1994). Checkpointing and rollback-recovery algorithms in distributed systems. Journal of Systems and Software, 25(1):59-71.
-
(1994)
Journal of Systems and Software
, vol.25
, Issue.1
, pp. 59-71
-
-
Deng, Y.1
Park, E.2
-
9
-
-
0026867749
-
Manetho: Transparent roll back-recovery with low overhead, limited rollback, and fast output commit
-
Elnozahy, E. and Zwaenepoel, W. (1992). Manetho: transparent roll back-recovery with low overhead, limited rollback, and fast output commit. Computers, IEEE Transactions on, 41(5):526-531.
-
(1992)
Computers, IEEE Transactions on
, vol.41
, Issue.5
, pp. 526-531
-
-
Elnozahy, E.1
Zwaenepoel, W.2
-
10
-
-
0004962477
-
-
Prentice-Hall, Inc., Upper Saddle River, NJ, USA
-
Johnson, B.W. (1996). An introduction to the design and analysis of fault-tolerant systems, pages 1-87. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
-
(1996)
An Introduction to the Design and Analysis of Fault-tolerant Systems
, pp. 1-87
-
-
Johnson, B.W.1
-
11
-
-
0023090161
-
Checkpointing and rollback-recovery for distributed systems
-
Koo, R. and Toueg, S. (1987). Checkpointing and rollback-recovery for distributed systems. IEEE Trans. Softw. Eng., 13:23-31.
-
(1987)
IEEE Trans. Softw. Eng.
, vol.13
, pp. 23-31
-
-
Koo, R.1
Toueg, S.2
-
12
-
-
0345446547
-
The workload on parallel supercomputers: Modeling the characteristics of rigid jobs
-
Lublin, U. and Feitelson, D. G. (2003). The workload on parallel supercomputers: modeling the characteristics of rigid jobs. J. Parallel Distrib. Comput., 63:1105-1122.
-
(2003)
J. Parallel Distrib. Comput.
, vol.63
, pp. 1105-1122
-
-
Lublin, U.1
Feitelson, D.G.2
-
13
-
-
33746060807
-
Transparent consistent replication of java RMI objects
-
Narasimhan, N., Moser, L., and Melliar-Smith, P. (2000). Transparent consistent replication of java rmi objects. In Distributed Objects and Applications, 2000. Proceedings. DOA '00. International Symposium on, pages 17 -26.
-
(2000)
Distributed Objects and Applications, 2000. Proceedings. DOA '00. International Symposium on
, pp. 17-26
-
-
Narasimhan, N.1
Moser, L.2
Melliar-Smith, P.3
-
14
-
-
0002503948
-
Compiler-assisted memory exclusion for fast checkpointing
-
Plank, J. S., Beck, M., and Kingsley, G. (1995). Compiler-assisted memory exclusion for fast checkpointing. IEEE TECHNICAL COMMITTEE ON OPERATING SYSTEMS AND APPLICATION ENVIRONMENTS, 7:62-67.
-
(1995)
IEEE Technical Committee on Operating Systems and Application Environments
, vol.7
, pp. 62-67
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
-
16
-
-
34547253958
-
Roam: A scalable replication system for mobile computing
-
Ratner, D., Reiher, P., and Popek, G. (1999). Roam: a scalable replication system for mobile computing. In Database and Expert Systems Applications, 1999. Proceedings. TenthInternational Workshop on, pages 96 -104.
-
(1999)
Database and Expert Systems Applications, 1999. Proceedings. TenthInternational Workshop on
, pp. 96-104
-
-
Ratner, D.1
Reiher, P.2
Popek, G.3
-
17
-
-
84958764962
-
Optimistic replication for internet data services
-
London, UK, UK. Springer-Verlag
-
Saito, Y. and Levy, H. M. (2000). Optimistic replication for internet data services. In Proceedings of the 14th International Conference on Distributed Computing, DISC '00, pages 297-314, London, UK, UK. Springer-Verlag.
-
(2000)
Proceedings of the 14th International Conference on Distributed Computing, DISC '00
, pp. 297-314
-
-
Saito, Y.1
Levy, H.M.2
-
18
-
-
84864756973
-
An experimental study about diskless checkpointing
-
vol.1
-
Silva, L. and Silva, J. (1998). An experimental study about diskless checkpointing. In Euromicro Conference, 1998. Proceedings. 24th, volume 1, pages 395 -402 vol.1.
-
(1998)
Euromicro Conference, 1998. Proceedings. 24th
, vol.1
, pp. 395-402
-
-
Silva, L.1
Silva, J.2
-
21
-
-
79952466726
-
Grid and P2P middleware for wide-area parallel processing
-
Xhafa, F., Pllana, S., Barolli, L., and Spaho, E. (2011). Grid and P2P Middleware for Wide-Area Parallel Processing. Concurrency and Computation: Practice and Experience, 23(5):458-476
-
(2011)
Concurrency and Computation: Practice and Experience
, vol.23
, Issue.5
, pp. 458-476
-
-
Xhafa, F.1
Pllana, S.2
Barolli, L.3
Spaho, E.4
|