-
1
-
-
0026140643
-
Virtual Memory Primitives for User Programs
-
Santa Clara, Calif., Apr.
-
A. Appel and K. Li, "Virtual Memory Primitives for User Programs," Proc. Fourth Int'l Conf. Architectural Support for Program ming Languages and Operating Systems, pp. 96-107, Santa Clara, Calif., Apr. 1991.
-
(1991)
Proc. Fourth Int'l Conf. Architectural Support for Program Ming Languages and Operating Systems
, pp. 96-107
-
-
Appel, A.1
Li, K.2
-
2
-
-
0031570635
-
Application Level Fault Tolerance in Heterogeneous Networks of Workstations
-
Sept.
-
A. Beguelin, E. Seligman, and P. Stephan, "Application Level Fault Tolerance in Heterogeneous Networks of Workstations," J. Parallel and Distributed Computing, vol. 43, Sept. 1997.
-
(1997)
J. Parallel and Distributed Computing
, vol.43
-
-
Beguelin, A.1
Seligman, E.2
Stephan, P.3
-
3
-
-
0028194811
-
EVENODD: An Optimal Scheme for Tolerating Double Disk Failures in RAID Architectures
-
Chicago, Apr.
-
M. Blaum, J. Brady, J. Bruck, and J. Menon, "EVENODD: An Optimal Scheme for Tolerating Double Disk Failures in RAID Architectures," Proc. 21st Ann. Int'l Symp. Computer Architecture, pp. 245-254, Chicago, Apr. 1994.
-
(1994)
Proc. 21st Ann. Int'l Symp. Computer Architecture
, pp. 245-254
-
-
Blaum, M.1
Brady, J.2
Bruck, J.3
Menon, J.4
-
4
-
-
0042838995
-
MIST: PVM with Transparent Migration and Checkpointing
-
Pittsburgh, Pa., May
-
J. Casas, D.L. Clark, P.S. Galbiati, R. Konuru, S.W. Otto, R.M. Prouty, and J. Walpole, "MIST: PVM with Transparent Migration and Checkpointing," Proc. Third Ann. PVM Users' Group Meeting, Pittsburgh, Pa., May 1995.
-
(1995)
Proc. Third Ann. PVM Users' Group Meeting
-
-
Casas, J.1
Clark, D.L.2
Galbiati, P.S.3
Konuru, R.4
Otto, S.W.5
Prouty, R.M.6
Walpole, J.7
-
5
-
-
0028444938
-
RAID: High-Performance, Reliable Secondary Storage
-
June
-
P.M. Chen, E.K. Lee, G.A. Gibson, R.H. Katz, and D.A. Patterson, "RAID: High-Performance, Reliable Secondary Storage," ACM Computing Surveys, vol. 26, no. 2, pp. 145-185, June 1994.
-
(1994)
ACM Computing Surveys
, vol.26
, Issue.2
, pp. 145-185
-
-
Chen, P.M.1
Lee, E.K.2
Gibson, G.A.3
Katz, R.H.4
Patterson, D.A.5
-
6
-
-
0029715009
-
Efficient Checkpoint Mechanisms for Massively Parallel Machines
-
Sendai, June
-
T. Chiueh and P. Deng, "Efficient Checkpoint Mechanisms for Massively Parallel Machines," Proc. 26th Int'l Symp. Fault-Tolerant Computing, pp. 370-379, Sendai, June 1996.
-
(1996)
Proc. 26th Int'l Symp. Fault-Tolerant Computing
, pp. 370-379
-
-
Chiueh, T.1
Deng, P.2
-
7
-
-
0005029744
-
Lightweight Logging for Lazy Release Consistent Distributed Shared Memory
-
Oct.
-
M. Costa, P. Guedes, M. Sequeira, N. Neves, and M. Castro, "Lightweight Logging for Lazy Release Consistent Distributed Shared Memory," Proc. Second Symp. Operating Systems Design and Implementation, Oct. 1996.
-
(1996)
Proc. Second Symp. Operating Systems Design and Implementation
-
-
Costa, M.1
Guedes, P.2
Sequeira, M.3
Neves, N.4
Castro, M.5
-
8
-
-
0004096191
-
-
Technical Report CMU-CS-96-181, Carnegie Mellon Univ., Oct.
-
E.N. Elnozahy, D.B. Johnson, and Y.M. Wang, "A Survey of Roll-back-Recovery Protocols in Message-Passing Systems," Technical Report CMU-CS-96-181, Carnegie Mellon Univ., Oct. 1996.
-
(1996)
A Survey of Roll-back-Recovery Protocols in Message-Passing Systems
-
-
Elnozahy, E.N.1
Johnson, D.B.2
Wang, Y.M.3
-
9
-
-
84871146551
-
The Performance of Consistent Checkpointing
-
Oct.
-
E.N. Elnozahy, D.B. Johnson, and W. Zwaenepoel, "The Performance of Consistent Checkpointing," Proc. 11th Symp. Reliable Distributed Systems, pp. 39-47, Oct. 1992.
-
(1992)
Proc. 11th Symp. Reliable Distributed Systems
, pp. 39-47
-
-
Elnozahy, E.N.1
Johnson, D.B.2
Zwaenepoel, W.3
-
10
-
-
0026867749
-
Manetho: Transparent Roll-back-Recovery with Low Overhead, Limited Rollback and Fast Output Commit
-
May
-
E.N. Elnozahy and W. Zwaenepoel, "Manetho: Transparent Roll-back-Recovery with Low Overhead, Limited Rollback and Fast Output Commit," IEEE Trans. Computers, vol. 41, no. 5, May 1992.
-
(1992)
IEEE Trans. Computers
, vol.41
, Issue.5
-
-
Elnozahy, E.N.1
Zwaenepoel, W.2
-
11
-
-
84976813771
-
Igor: A System for Program Debugging via Reversible Execution
-
Jan.
-
S.I. Feldman and C.B. Brown, "Igor: A System for Program Debugging via Reversible Execution," ACM SIGPLAN Notices, Workshop Parallel and Distributed Debugging, vol. 24, no. 1, pp. 112-123, Jan. 1989.
-
(1989)
ACM SIGPLAN Notices, Workshop Parallel and Distributed Debugging
, vol.24
, Issue.1
, pp. 112-123
-
-
Feldman, S.I.1
Brown, C.B.2
-
12
-
-
0003637465
-
-
Boston: MIT Press
-
A. Geist, A. Beguelin, J. Dongarra, R. Manchek, W. Jaing, and V. Sunderam, PVM - A Users' Guide and Tutorial for Networked Parallel Computing. Boston: MIT Press, 1994.
-
(1994)
PVM - A Users' Guide and Tutorial for Networked Parallel Computing
-
-
Geist, A.1
Beguelin, A.2
Dongarra, J.3
Manchek, R.4
Jaing, W.5
Sunderam, V.6
-
14
-
-
0024866388
-
Failure Correction Techniques for Large Disk Arrays
-
Boston, Apr.
-
G.A. Gibson, L. Hellerstein, R.M. Karp, R.H. Katz, and D.A. Patterson, "Failure Correction Techniques for Large Disk Arrays," Proc. Third Int'l Conf. Architectural Support for Programming Languages and Operating Systems, pp. 123-132, Boston, Apr. 1989.
-
(1989)
Proc. Third Int'l Conf. Architectural Support for Programming Languages and Operating Systems
, pp. 123-132
-
-
Gibson, G.A.1
Hellerstein, L.2
Karp, R.M.3
Katz, R.H.4
Patterson, D.A.5
-
15
-
-
0003800693
-
-
Technical Report TN-388-STR, Nat'l Center for Atmospheric Research, Boulder, Colo.
-
J.J. Hack, R. Jakob, and D.L. Williamson, "Solutions to the Shallow Water Test Set Using the Spectral Transform Method," Technical Report TN-388-STR, Nat'l Center for Atmospheric Research, Boulder, Colo., 1993.
-
(1993)
Solutions to the Shallow Water Test Set Using the Spectral Transform Method
-
-
Hack, J.J.1
Jakob, R.2
Williamson, D.L.3
-
16
-
-
0030649441
-
Fault Tolerant Matrix Operations for Networks of Workstations Using Multiple Checkpointing
-
Seoul, Korea, Apr.
-
Y. Kim, J.S. Plank, and J.J. Dongarra, "Fault Tolerant Matrix Operations for Networks of Workstations Using Multiple Checkpointing," Proc. High Performance Computing on the Information Superhighway, HPC Asia '97, pp. 460-465, Seoul, Korea, Apr. 1997.
-
(1997)
Proc. High Performance Computing on the Information Superhighway, HPC Asia '97
, pp. 460-465
-
-
Kim, Y.1
Plank, J.S.2
Dongarra, J.J.3
-
17
-
-
0000674171
-
Job and Process Recovery in a UNIX-Based Operating System
-
San Diego, Calif., Jan.
-
B.A. Kingsbury and J.T. Kline, "Job and Process Recovery in a UNIX-Based Operating System," Proc. Usenix Winter 1989 Technical Conf., pp. 355-364, San Diego, Calif., Jan. 1989.
-
(1989)
Proc. Usenix Winter 1989 Technical Conf.
, pp. 355-364
-
-
Kingsbury, B.A.1
Kline, J.T.2
-
18
-
-
0003901150
-
-
Redwood City, Calif.: Benjamin/Cummings
-
V. Kumar, A. Grama, A. Gupta, and G. Karypis, Introduction to Parallel Computing. Redwood City, Calif.: Benjamin/Cummings, 1994.
-
(1994)
Introduction to Parallel Computing
-
-
Kumar, V.1
Grama, A.2
Gupta, A.3
Karypis, G.4
-
22
-
-
0028485392
-
Low-Latency, Concurrent Checkpointing for Parallel Programs
-
Aug.
-
K. Li, J.F. Naughton, and J.S. Plank, "Low-Latency, Concurrent Checkpointing for Parallel Programs," IEEE Trans. Parallel and Distributed Systems, vol. 5, no. 8, pp. 874-879, Aug. 1994.
-
(1994)
IEEE Trans. Parallel and Distributed Systems
, vol.5
, Issue.8
, pp. 874-879
-
-
Li, K.1
Naughton, J.F.2
Plank, J.S.3
-
23
-
-
0029204130
-
A Longitudinal Survey of Internet Host Reliability
-
Bad Neuenahr, Sept.
-
D. Long, A. Muir, and R. Golding, "A Longitudinal Survey of Internet Host Reliability," Proc. 14th Symp. Reliable Distributed Systems, pp. 2-9, Bad Neuenahr, Sept. 1995.
-
(1995)
Proc. 14th Symp. Reliable Distributed Systems
, pp. 2-9
-
-
Long, D.1
Muir, A.2
Golding, R.3
-
24
-
-
0030392072
-
Improving the Performance of Coordinated Checkpointers on Networks of Workstations Using RAID Techniques
-
Oct.
-
J.S. Plank, "Improving the Performance of Coordinated Checkpointers on Networks of Workstations Using RAID Techniques," Proc. 15th Symp. Reliable Distributed Systems, pp. 76-85, Oct. 1996.
-
(1996)
Proc. 15th Symp. Reliable Distributed Systems
, pp. 76-85
-
-
Plank, J.S.1
-
25
-
-
0031223146
-
A Tutorial on Reed-Solomon Coding for Fault-Tolerance in RAID-Like Systems
-
Sept.
-
J.S. Plank, "A Tutorial on Reed-Solomon Coding for Fault-Tolerance in RAID-Like Systems," Software - Practice & Experience, vol. 27, no. 9, pp. 995-1,012, Sept. 1997.
-
(1997)
Software - Practice & Experience
, vol.27
, Issue.9
-
-
Plank, J.S.1
-
26
-
-
85084159983
-
Libckpt: Transparent Checkpointing under Unix
-
Jan.
-
J.S. Plank, M. Beck, G. Kingsley, and K. Li, "Libckpt: Transparent Checkpointing Under Unix," Proc. Usenix Winter 1995 Technical Conf., pp. 213-223, Jan. 1995.
-
(1995)
Proc. Usenix Winter 1995 Technical Conf.
, pp. 213-223
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Li, K.4
-
27
-
-
0031570636
-
Fault Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing
-
Sept.
-
J.S. Plank, Y. Kim, and J. Dongarra, "Fault Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing," J. Parallel and Distributed Computing, vol. 43, pp. 125-138, Sept. 1997.
-
(1997)
J. Parallel and Distributed Computing
, vol.43
, pp. 125-138
-
-
Plank, J.S.1
Kim, Y.2
Dongarra, J.3
-
28
-
-
0028060943
-
Faster Checkpointing with N + 1 Parity
-
Austin, Tex., June
-
J.S. Plank and K. Li, "Faster Checkpointing with N + 1 Parity," Proc. 24th Int'l Symp. Fault-Tolerant Computing, pp. 288-297, Austin, Tex., June 1994.
-
(1994)
Proc. 24th Int'l Symp. Fault-Tolerant Computing
, pp. 288-297
-
-
Plank, J.S.1
Li, K.2
-
29
-
-
0002991145
-
Ickp - A Consistent Checkpointer for Multicomputers
-
Summer
-
J.S. Plank and K. Li, "Ickp - A Consistent Checkpointer for Multicomputers," IEEE Parallel & Distributed Technology, vol. 2, no. 2, pp. 62-67, Summer 1994.
-
(1994)
IEEE Parallel & Distributed Technology
, vol.2
, Issue.2
, pp. 62-67
-
-
Plank, J.S.1
Li, K.2
-
30
-
-
0004097019
-
-
Technical Report CS-95-302, Univ. of Tennessee, Aug.
-
J.S. Plank, J. Xu, and R.H.B. Netzer, "Compressed Differences: An Algorithm for Fast Incremental Checkpointing," Technical Report CS-95-302, Univ. of Tennessee, Aug. 1995.
-
(1995)
Compressed Differences: An Algorithm for Fast Incremental Checkpointing
-
-
Plank, J.S.1
Xu, J.2
Netzer, R.H.B.3
-
31
-
-
0028994280
-
Fault-Tolerance for Off-the-Shelf Applications and Hardware
-
Pasadena, Calif., June
-
M. Russinovich and Z. Segall, "Fault-Tolerance for Off-the-Shelf Applications and Hardware," Proc. 25th Int'l Symp. Fault-Tolerant Computing, pp. 67-71, Pasadena, Calif., June 1995.
-
(1995)
Proc. 25th Int'l Symp. Fault-Tolerant Computing
, pp. 67-71
-
-
Russinovich, M.1
Segall, Z.2
-
33
-
-
0028578960
-
Checkpointing SPMD Applications on Transputer Networks
-
Knoxville, Tenn., May
-
L.M. Silva, B. Veer, and J.G. Silva, "Checkpointing SPMD Applications on Transputer Networks," Proc. Scalable High Performance Computing Conf., pp. 694-701, Knoxville, Tenn., May 1994.
-
(1994)
Proc. Scalable High Performance Computing Conf.
, pp. 694-701
-
-
Silva, L.M.1
Veer, B.2
Silva, J.G.3
-
35
-
-
0029251277
-
The Condor Distributed Processing System
-
Feb.
-
T. Tannenbaum and M. Litzkow, "The Condor Distributed Processing System," Dr. Dobb's J., no. 227, pp. 40-48, Feb. 1995.
-
(1995)
Dr. Dobb's J.
, Issue.227
, pp. 40-48
-
-
Tannenbaum, T.1
Litzkow, M.2
-
37
-
-
0031388399
-
Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme
-
August
-
N.H. Vaidya, "Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme," IEEE Trans. Computers, vol. 46, no. 8, pp. 942-947, August 1997.
-
(1997)
IEEE Trans. Computers
, vol.46
, Issue.8
, pp. 942-947
-
-
Vaidya, N.H.1
-
38
-
-
0028994273
-
Checkpointing and Its Applications
-
Pasadena, Calif., June
-
Y-M. Wang, Y. Huang, K.-P. Vo, P.-Y. Chung, and C. Kintala, "Checkpointing and Its Applications," Proc. 25th Int'l Symp. Fault-Tolerant Computing, pp. 22-31, Pasadena, Calif., June 1995.
-
(1995)
Proc. 25th Int'l Symp. Fault-Tolerant Computing
, pp. 22-31
-
-
Wang, Y.-M.1
Huang, Y.2
Vo, K.-P.3
Chung, P.-Y.4
Kintala, C.5
|