-
1
-
-
0025507093
-
Experimental evaluation of the fault tolerance of an atomic multicast system
-
Oct.
-
J. Arlat, M. Aguera, Y. Crouzet, J.C. Fabre, E. Martins, and D. Powell, "Experimental Evaluation of the Fault Tolerance of an Atomic Multicast System," IEEE Trans. Reliability, vol. 39, no. 4, pp. 455-467, Oct. 1990.
-
(1990)
IEEE Trans. Reliability
, vol.39
, Issue.4
, pp. 455-467
-
-
Arlat, J.1
Aguera, M.2
Crouzet, Y.3
Fabre, J.C.4
Martins, E.5
Powell, D.6
-
2
-
-
0004249985
-
Hierarchical error detection in a software-implemented fault tolerance (SIFT) environment
-
PhD thesis, Univ. of Illinois, Urbana
-
S. Bagchi, "Hierarchical Error Detection in a Software-Implemented Fault Tolerance (SIFT) Environment," PhD thesis, Univ. of Illinois, Urbana, 2001.
-
(2001)
-
-
Bagchi, S.1
-
3
-
-
77954003885
-
MPI/FT: Architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing
-
R. Batchus, J.P. Neelamegam, Z. Cui, M. Beddhu, A. Skjellum, Y. Dandass, and M. Apte, "MPI/FT: Architecture and Taxonomies for Fault-Tolerant, Message-Passing Middleware for Performance-Portable Parallel Computing," Proc. First Int'l Symp. Cluster Computing and the Grid, pp. 26-33, 2001.
-
(2001)
Proc. First Int'l Symp. Cluster Computing and the Grid
, pp. 26-33
-
-
Batchus, R.1
Neelamegam, J.P.2
Cui, Z.3
Beddhu, M.4
Skjellum, A.5
Dandass, Y.6
Apte, M.7
-
4
-
-
0034431796
-
Detailed radiation fault modeling of the remove exploration and experimentation (REE) First generation testbed architecture
-
J. Beahan et al., "Detailed Radiation Fault Modeling of the Remove Exploration and Experimentation (REE) First Generation Testbed Architecture," Proc. IEEE Aerospace Conf., vol. 5, pp. 279-281, 2000.
-
(2000)
Proc. IEEE Aerospace Conf.
, vol.5
, pp. 279-281
-
-
Beahan, J.1
-
6
-
-
1942475825
-
Stabilis: A case study in writing fault-tolerant distributed applications using persistent objects
-
Technical Report 400, Univ. of Newcastle upon Tyne, U.K.
-
L. Buzato and A. Calsavara, "Stabilis: A Case Study in Writing Fault-Tolerant Distributed Applications Using Persistent Objects," Technical Report 400, Univ. of Newcastle upon Tyne, U.K., 1992.
-
(1992)
-
-
Buzato, L.1
Calsavara, A.2
-
7
-
-
0034590460
-
Demonstration of the remote exploration and experimentation (REE) fault-tolerant parallel-processing supercomputer for spacecraft onboard scientific data processing
-
F. Chen, L. Craymer, J. Deifik, A.J. Fogel, D.S. Katz, A.G. Silliman Jr., R.R. Some, S.A. Upchurch, and K. Whisnant, "Demonstration of the Remote Exploration and Experimentation (REE) Fault-Tolerant Parallel-Processing Supercomputer for Spacecraft Onboard Scientific Data Processing," Proc. Int'l Conf. Dependable Systems and Networks, pp. 367-372, 2000.
-
(2000)
Proc. Int'l Conf. Dependable Systems and Networks
, pp. 367-372
-
-
Chen, F.1
Craymer, L.2
Deifik, J.3
Fogel, A.J.4
Katz, D.S.5
Silliman Jr., A.G.6
Some, R.R.7
Upchurch, S.A.8
Whisnant, K.9
-
8
-
-
0035789554
-
Experimental evaluation of the fail-silent behavior of a distributed real-time run-time support build from COTS components
-
P. Chevocot and I. Puaut, "Experimental Evaluation of the Fail-Silent Behavior of a Distributed Real-Time Run-Time Support Build from COTS Components," Proc. Int'l Conf. Dependable Systems and Networks, pp. 304-313, 2001.
-
(2001)
Proc. Int'l Conf. Dependable Systems and Networks
, pp. 304-313
-
-
Chevocot, P.1
Puaut, I.2
-
9
-
-
0032306688
-
AQuA: An adaptive architecture that provides dependable distributed objects
-
Y.J. Ren, D.E. Bakken, T. Courtney, M. Cukier, D.A. Karr, P. Rubel, C. Sabnis, W.H. Sanders, R.E. Schantz, and M. Seri, "AQuA: An Adaptive Architecture that Provides Dependable Distributed Objects," Proc. 17th Symp. Reliable Distributed Systems, pp. 245-253, 1998.
-
(1998)
Proc. 17th Symp. Reliable Distributed Systems
, pp. 245-253
-
-
Ren, Y.J.1
Bakken, D.E.2
Courtney, T.3
Cukier, M.4
Karr, D.A.5
Rubel, P.6
Sabnis, C.7
Sanders, W.H.8
Schantz, R.E.9
Seri, M.10
-
10
-
-
0036821893
-
The Möbius framework and its implementation
-
Oct.
-
D.D. Deavours, G. Clark, T. Courtney, D. Daly, S. Derisavi, J.M. Doyle, W.H. Sanders, and P.G. Webster, "The Möbius Framework and Its Implementation," IEEE Trans. Software Eng., vol. 28, no. 10, pp. 956-969, Oct. 2002.
-
(2002)
IEEE Trans. Software Eng.
, vol.28
, Issue.10
, pp. 956-969
-
-
Deavours, D.D.1
Clark, G.2
Courtney, T.3
Daly, D.4
Derisavi, S.5
Doyle, J.M.6
Sanders, W.H.7
Webster, P.G.8
-
11
-
-
0031674242
-
A metaobject architecture for fault-tolerant distributed systems: The FRIENDS approach
-
Jan.
-
J.-C. Fabre and T. Pérennou, "A Metaobject Architecture for Fault-Tolerant Distributed Systems: The FRIENDS Approach," IEEE Trans. Computers, vol. 47, no. 1, pp. 78-95, Jan. 1998.
-
(1998)
IEEE Trans. Computers
, vol.47
, Issue.1
, pp. 78-95
-
-
Fabre, J.-C.1
Pérennou, T.2
-
12
-
-
84940567900
-
FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world
-
G. Fagg and J. Dongarra, "FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in a Dynamic World," Lecture Notes in Computer Science, vol. 1908, pp. 346-353, 2000.
-
(2000)
Lecture Notes in Computer Science
, vol.1908
, pp. 346-353
-
-
Fagg, G.1
Dongarra, J.2
-
14
-
-
0035789206
-
Fault-tolerant high-performance matrix multiplication: Theory and practice
-
J.A. Gunnels, R.A. van de Geijn, D.S. Katz, and E.S. Quintana-Ortí, "Fault-Tolerant High-Performance Matrix Multiplication: Theory and Practice," Proc. 2001 Int'l Conf. Dependable Systems and Networks, pp. 47-56, 2001.
-
(2001)
Proc. 2001 Int'l Conf. Dependable Systems and Networks
, pp. 47-56
-
-
Gunnels, J.A.1
Van De Geijn, R.A.2
Katz, D.S.3
Quintana-Ortí, E.S.4
-
15
-
-
0004136464
-
The Ensemble System
-
PhD thesis, Cornell Univ., Ithaca, N.Y.
-
M. Hayden, "The Ensemble System," PhD thesis, Cornell Univ., Ithaca, N.Y., 1988.
-
(1988)
-
-
Hayden, M.1
-
16
-
-
84956995398
-
Providing QoS customization in distributed object systems
-
J. He, M. Rajagopalan, M.A. Hiltunen, and R.D. Schlichting, "Providing QoS Customization in Distributed Object Systems," Proc. IFIP/ACM Int'l Conf. Distributed Systems Platforms, pp. 351-372, 2001.
-
(2001)
Proc. IFIP/ACM Int'l Conf. Distributed Systems Platforms
, pp. 351-372
-
-
He, J.1
Rajagopalan, M.2
Hiltunen, M.A.3
Schlichting, R.D.4
-
17
-
-
0021439162
-
Algorithm-based fault tolerance for matrix operations
-
June
-
K. Huang and J. Abraham, "Algorithm-Based Fault Tolerance for Matrix Operations," IEEE Trans. Computers, vol. 33, no. 6, pp. 518-528, June 1984.
-
(1984)
IEEE Trans. Computers
, vol.33
, Issue.6
, pp. 518-528
-
-
Huang, K.1
Abraham, J.2
-
18
-
-
0012240398
-
Fault-tolerant cluster management for reliable high-performance computing
-
M. Li, D. Goldberg, W. Tao, and Y. Tamir, "Fault-Tolerant Cluster Management for Reliable High-Performance Computing," Proc. 13th Conf. Parallel and Distributed Computing and Systems, pp. 480-485, 2001.
-
(2001)
Proc. 13th Conf. Parallel and Distributed Computing and Systems
, pp. 480-485
-
-
Li, M.1
Goldberg, D.2
Tao, W.3
Tamir, Y.4
-
19
-
-
0032686475
-
Chameleon: A software infrastructure for adaptive fault tolerance
-
June
-
Z. Kalbarczyk, S. Bagchi, K. Whisnant, and R. Iyer, "Chameleon: A Software Infrastructure for Adaptive Fault Tolerance," IEEE Trans. Parallel and Distributed Systems, vol. 10, no. 6, pp. 560-579, June 1999.
-
(1999)
IEEE Trans. Parallel and Distributed Systems
, vol.10
, Issue.6
, pp. 560-579
-
-
Kalbarczyk, Z.1
Bagchi, S.2
Whisnant, K.3
Iyer, R.4
-
20
-
-
0002598384
-
Application of three physical fault injection techniques to the experimental assessment of the MARS architecture
-
J. Karlsson, J. Arlat, and G. Leber, "Application of Three Physical Fault Injection Techniques to the Experimental Assessment of the MARS Architecture," Proc. Fifth Dependable Computing for Critical Applications Conf., pp. 150-161, 1995.
-
(1995)
Proc. Fifth Dependable Computing for Critical Applications Conf.
, pp. 150-161
-
-
Karlsson, J.1
Arlat, J.2
Leber, G.3
-
21
-
-
0024104188
-
The design of radiation-hardened ICs for space: A compendium of approaches
-
Nov.
-
S.E. Kerns, B.D. Shafer, L.R. Rockett, Jr., J.S. Pridmore, D.F. Berndt, N. van Vonno, and F.E. Barber, "The Design of Radiation-Hardened ICs for Space: A Compendium of Approaches," Proc. IEEE, vol. 76, no. 11, pp. 1470-1509, Nov. 1988.
-
(1988)
Proc. IEEE
, vol.76
, Issue.11
, pp. 1470-1509
-
-
Kerns, S.E.1
Shafer, B.D.2
Rockett Jr., L.R.3
Pridmore, J.S.4
Berndt, D.F.5
Van Vonno, N.6
Barber, F.E.7
-
22
-
-
0036921914
-
Experimental evaluation of a COTS system for space applications
-
H. Madeira, R. Some, F. Moreira, D. Costa, and D. Rennels, "Experimental Evaluation of a COTS System for Space Applications," Proc. 2002 Int'l Conf. Dependable Systems and Networks, 2002.
-
Proc. 2002 Int'l Conf. Dependable Systems and Networks, 2002
-
-
Madeira, H.1
Some, R.2
Moreira, F.3
Costa, D.4
Rennels, D.5
-
23
-
-
0003604499
-
MPI-2: Extensions to the message passing interface
-
Message Passing Interface Forum, "MPI-2: Extensions to the Message Passing Interface," http://www.mpi-forum.org/docs/mpi-20.ps, 1997.
-
(1997)
-
-
-
24
-
-
0030130053
-
Totem: A fault-tolerant multicast group communication system
-
Apr.
-
L.E. Moser, P.M. Melliar-Smith, D.A. Agarwal, R.K. Budhia, and C.A. Lingley-Papadopoulos, "Totem: A Fault-Tolerant Multicast Group Communication System," Comm. ACM, vol. 39, pp. 54-63, Apr. 1996.
-
(1996)
Comm. ACM
, vol.39
, pp. 54-63
-
-
Moser, L.E.1
Melliar-Smith, P.M.2
Agarwal, D.A.3
Budhia, R.K.4
Lingley-Papadopoulos, C.A.5
-
25
-
-
0032597678
-
A fault tolerance framework for CORBA
-
L. Moser, P. Melliar-Smith, and P. Narasimhan, "A Fault Tolerance Framework for CORBA," Proc. 29th Symp. Fault-Tolerant Computing, pp. 150-157, 1999.
-
(1999)
Proc. 29th Symp. Fault-Tolerant Computing
, pp. 150-157
-
-
Moser, L.1
Melliar-Smith, P.2
Narasimhan, P.3
-
26
-
-
0035789849
-
State synchronization and recovery for strongly consistent replicated CORBA objects
-
P. Narasimhan, L. Moser, and P. Melliar-Smith, "State Synchronization and Recovery for Strongly Consistent Replicated CORBA Objects," Proc. 2001 Int'l Conf. Dependable Systems and Networks, pp. 261-270, 2001.
-
(2001)
Proc. 2001 Int'l Conf. Dependable Systems and Networks
, pp. 261-270
-
-
Narasimhan, P.1
Moser, L.2
Melliar-Smith, P.3
-
27
-
-
0024132223
-
The delta-4 approach to dependability in open distributed computing systems
-
D. Powell, D. Seaton, G. Bonn, P. Verissimo, and F. Waeselynk, "The Delta-4 Approach to Dependability in Open Distributed Computing Systems," Proc. 18th Int'l Symp. Fault-Tolerant Computing, pp. 246-251, 1988.
-
(1988)
Proc. 18th Int'l Symp. Fault-Tolerant Computing
, pp. 246-251
-
-
Powell, D.1
Seaton, D.2
Bonn, G.3
Verissimo, P.4
Waeselynk, F.5
-
28
-
-
0032626767
-
GUARDS: A generic upgradable architecture for real-time dependable systems
-
June
-
D. Powell, J. Arlat, L. Beus-Dukic, A. Bondavalli, P. Coppola, A. Fantechi, E. Jenn, C. Rabéjac, and A. Wellings, "GUARDS: A Generic Upgradable Architecture for Real-Time Dependable Systems," IEEE Trans. Parallel and Distributed Systems, vol. 10, no. 6, pp. 580-599, June 1999.
-
(1999)
IEEE Trans. Parallel and Distributed Systems
, vol.10
, Issue.6
, pp. 580-599
-
-
Powell, D.1
Arlat, J.2
Beus-Dukic, L.3
Bondavalli, A.4
Coppola, P.5
Fantechi, A.6
Jenn, E.7
Rabéjac, C.8
Wellings, A.9
-
29
-
-
0012888231
-
AQuA: A framework for providing adaptive fault tolerance to distributed applications
-
PhD thesis, Univ. of Illinois, Urbana
-
J. Ren, "AQuA: A Framework for Providing Adaptive Fault Tolerance to Distributed Applications," PhD thesis, Univ. of Illinois, Urbana, 2001.
-
(2001)
-
-
Ren, J.1
-
30
-
-
0012288454
-
Lessons learned from building and using the arjuna distributed programming system
-
S. Shrivastava, "Lessons Learned from Building and Using the Arjuna Distributed Programming System," Lecture Notes in Computer Science, vol. 938, 1995.
-
(1995)
Lecture Notes in Computer Science
, vol.938
-
-
Shrivastava, S.1
-
32
-
-
0033875633
-
Dependability assessment in distributed systems with lightweight fault injectors in NFTAPE
-
D.T. Stott, B. Floering, Z. Kalbarczyk, and R.K. Iyer, "Dependability Assessment in Distributed Systems with Lightweight Fault Injectors in NFTAPE," Proc. Fourth Int'l Computer Performance and Dependability Symp., pp. 91-100, 2000.
-
(2000)
Proc. Fourth Int'l Computer Performance and Dependability Symp.
, pp. 91-100
-
-
Stott, D.T.1
Floering, B.2
Kalbarczyk, Z.3
Iyer, R.K.4
-
33
-
-
0030130161
-
Horus: A flexible group communication system
-
Apr.
-
R. van Renesse, K. Birman, and S. Maffeis, "Horus: A Flexible Group Communication System," Comm. ACM, vol. 39, pp. 76-83, Apr. 1996.
-
(1996)
Comm. ACM
, vol.39
, pp. 76-83
-
-
Van Renesse, R.1
Birman, K.2
Maffeis, S.3
-
36
-
-
0036928463
-
An experimental evaluation of the REE SIFT environment for spaceborne applications
-
K. Whisnant, R.K. Iyer, P. Jones, R. Some, and D. Rennels, "An Experimental Evaluation of the REE SIFT Environment for Spaceborne Applications," Proc. Int'l Conf. Dependable Systems and Networks, pp. 585-594, 2002.
-
(2002)
Proc. Int'l Conf. Dependable Systems and Networks
, pp. 585-594
-
-
Whisnant, K.1
Iyer, R.K.2
Jones, P.3
Some, R.4
Rennels, D.5
-
37
-
-
0037234868
-
A system model for dynamically reconfigurable software
-
Apr.
-
K. Whisnant, Z. Kalbarczyk, and R.K. Iyer, "A System Model for Dynamically Reconfigurable Software," IBM Systems J., vol. 42, no. 1, pp. 45-59, Apr. 2003.
-
(2003)
IBM Systems J.
, vol.42
, Issue.1
, pp. 45-59
-
-
Whisnant, K.1
Kalbarczyk, Z.2
Iyer, R.K.3
-
38
-
-
1942539791
-
A process architecture and runtime environment for dependable distributed applications
-
PhD thesis, Univ. of Illinois, Urbana
-
K. Whisnant, "A Process Architecture and Runtime Environment for Dependable Distributed Applications," PhD thesis, Univ. of Illinois, Urbana, 2003.
-
(2003)
-
-
Whisnant, K.1
|