-
4
-
-
12344277946
-
The design and implementation of Berkeley Lab's Linux checkpoint/restart
-
Available at
-
J. Duell, P. Hargrove, and E. Roman. The Design and Implementation of Berkeley Lab's Linux Checkpoint/Restart. Technical Report LBNL-54941, 2002. Available at https://ftg.lbl.gov/CheckpointRestart/Pubs/blcr.pdf.
-
Technical Report LBNL-54941
-
-
Duell, J.1
Hargrove, P.2
Roman, E.3
-
5
-
-
34848824452
-
A survey of checkpoint/restart implementations
-
Lawrence Berkeley National Laboratory, 2002. Available at
-
E. Roman. A Survey of Checkpoint/Restart Implementations. Technical Report LBNL-54942, Lawrence Berkeley National Laboratory, 2002. Available at https://ftg.lbl.gov/CheckpointRestart/CheckpointPapers.shtml.
-
(2002)
Technical Report LBNL-54942
-
-
Roman, E.1
-
6
-
-
0032317368
-
System-level versus user-defined checkpointing
-
J.G. Silva and L.M. Silva. System-level versus user-defined checkpointing. In SRDS, 1998.
-
(1998)
SRDS
-
-
Silva, J.G.1
Silva, L.M.2
-
7
-
-
34948863388
-
Migol: A fault-tolerant service framework for MPI applications in the grid
-
A. Luckow and B. Schnor. Migol: A Fault-Tolerant Service Framework for MPI Applications in the Grid. In Journal of Future Generation Computer Systems '08, volume 24, pages 142-152, 2008.
-
(2008)
Journal of Future Generation Computer Systems '08
, vol.24
, pp. 142-152
-
-
Luckow, A.1
Schnor, B.2
-
8
-
-
34548789748
-
The design and implementation of checkpoint/restart process fault tolerance for open MPI
-
J. Hursey, J. Squyres, T. Mattox, and A. Lumsdaine. The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI. In IPDPS, 2007.
-
(2007)
IPDPS
-
-
Hursey, J.1
Squyres, J.2
Mattox, T.3
Lumsdaine, A.4
-
9
-
-
34547940148
-
FEMPI: A lightweight fault-tolerant MPI for embedded cluster systems
-
R. Subramaniyan, V. Aggarwal, A. Jacobs, and A. George. FEMPI: A Lightweight Fault-Tolerant MPI for Embedded Cluster Systems. In ESA, 2006.
-
(2006)
ESA
-
-
Subramaniyan, R.1
Aggarwal, V.2
Jacobs, A.3
George, A.4
-
10
-
-
34547424834
-
Application-transparent checkpoint/restart for MPI programs over infiniband
-
Q. Gao, W. Yu, W. Huang, and D.K. Panda. Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand. In ICPP, 2006.
-
(2006)
ICPP
-
-
Gao, Q.1
Yu, W.2
Huang, W.3
Panda, D.K.4
-
11
-
-
33746779994
-
MPICH-V project: A multiprotocol automatic fault tolerant MPI
-
A. Bouteiler, T. Herault, G. Krawezik, P. Lemarinier, and F. Cappello. MPICH-V project: A multiprotocol automatic fault tolerant MPI. The International Journal of High Performance Computing Applications, 20:319-333, 2006.
-
(2006)
International Journal of High Performance Computing Applications
, vol.20
, pp. 319-333
-
-
Bouteiler, A.1
Herault, T.2
Krawezik, G.3
Lemarinier, P.4
Cappello, F.5
-
15
-
-
51049095700
-
Enhancing application robustness through adaptive fault tolerance
-
Z. Zheng, P. Gujrati, Z. Lan, and Y. Li. Enhancing Application Robustness through Adaptive Fault Tolerance. In IPDPS, 2008.
-
(2008)
IPDPS
-
-
Zheng, Z.1
Gujrati, P.2
Lan, Z.3
Li, Y.4
-
16
-
-
67650091156
-
A tunable holistic resiliency approach for high-performance computing systems
-
S. Scott, C. Engelmann, G. Vallee, and T. Naughton et al. A tunable holistic resiliency approach for high-performance computing systems. In PPoPP, 2009.
-
(2009)
PPoPP
-
-
Scott, S.1
Engelmann, C.2
Vallee, G.3
Naughton, T.4
-
19
-
-
0036534708
-
Implementing the JMS publish/subscribe API
-
P. Rousselle. Implementing the JMS publish/subscribe API. In Dr. Dobb's Journal, volume 27, pages 28-32, 2002.
-
(2002)
Dr. Dobb's Journal
, vol.27
, pp. 28-32
-
-
Rousselle, P.1
-
24
-
-
77951471742
-
-
S. Yemini, S. Kliger, E. Mozes, Y. Yemini, and D. Ohsie. High speed and robust event correlation. 34, 1996.
-
(1996)
High speed and robust event correlation
, vol.34
-
-
Yemini, S.1
Kliger, S.2
Mozes, E.3
Yemini, Y.4
Ohsie, D.5
-
28
-
-
36049039788
-
Data-driven, data-intensive computing for modelling and analysis of biological networks: Application to bioethanol production
-
B.H. Park, N.F. Samatova, A. Jallouk, S. Molony, S. Horton, and S. Arcangeli. Data-driven, data-intensive computing for modelling and analysis of biological networks: Application to bioethanol production. Journal of Physics: Conference Series, 78, 2007.
-
(2007)
Journal of Physics: Conference Series
, vol.78
-
-
Park, B.H.1
Samatova, N.F.2
Jallouk, A.3
Molony, S.4
Horton, S.5
Arcangeli, S.6
-
29
-
-
41349108025
-
From pull-down data to protein interaction networks and complexes with biological relevance
-
B. Zhang, B.H. Park, T. Karpinets, and N. Samatova. From pull-down data to protein interaction networks and complexes with biological relevance. Journal of Bioinformatics, (24), 2008.
-
(2008)
Journal of Bioinformatics
, Issue.24
-
-
Zhang, B.1
Park, B.H.2
Karpinets, T.3
Samatova, N.4
-
30
-
-
77951469575
-
-
Cifts website: Http://www.mcs.anl.gov/research/cifts/.
-
-
-
|