-
1
-
-
0003706460
-
-
Philadelphia: SIAM
-
Anderson E., Bai Z., Bischof C., Demmel J., Dongarra J., Du Croz J., Greenbaum A., Hammarling S., McKenney A., Ostrouchov S., Sorensen D. Lapack User's Guide. 1992;SIAM, Philadelphia.
-
(1992)
Lapack User's Guide
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Demmel, J.4
Dongarra, J.5
Du Croz, J.6
Greenbaum, A.7
Hammarling, S.8
McKenney, A.9
Ostrouchov, S.10
Sorensen, D.11
-
2
-
-
0029709764
-
Dome: Parallel programming in a distributed computing environment
-
IEEE Comput. Soc.
-
A. N. C. Arabe, A. Beguelin, B. Lowekamp, E. Seligman, M. Starkey, P. Stephan, 1996, Dome: Parallel programming in a distributed computing environment, Proc. 10th International Parallel Processing Symposium, IEEE Comput. Soc.
-
(1996)
Proc. 10th International Parallel Processing Symposium
-
-
Arabe, A.N.C.1
Beguelin, A.2
Lowekamp, B.3
Seligman, E.4
Starkey, M.5
Stephan, P.6
-
6
-
-
0028194811
-
EVENODD: An optimal scheme for tolerating double disk failures in RAID architectures
-
254.
-
M. Blaum, J. Brady, J. Bruck, J. Menon, 1994, EVENODD: An optimal scheme for tolerating double disk failures in RAID architectures, Proc. 21st Annual International Symposium on Computer Architecture, 245, 254.
-
(1994)
Proc. 21st Annual International Symposium on Computer Architecture
, pp. 245
-
-
Blaum, M.1
Brady, J.2
Bruck, J.3
Menon, J.4
-
7
-
-
0029254186
-
Floating point fault tolerance with backward error assertions
-
Boley D., Golub G. H., Makar S., Saxena N., McCluskey E. J. Floating point fault tolerance with backward error assertions. IEEE Trans. Comput. 44:Feb. 1995.
-
(1995)
IEEE Trans. Comput.
, vol.44
-
-
Boley, D.1
Golub, G.H.2
Makar, S.3
Saxena, N.4
McCluskey, E.J.5
-
8
-
-
0024606852
-
Fault tolerance under UNIX
-
Borg A., Blau W., Graetsch W., Herrman F., Oberle W. Fault tolerance under UNIX. ACM Trans. Comput. Systems. 7:Feb. 1989;1-24.
-
(1989)
ACM Trans. Comput. Systems
, vol.7
, pp. 1-24
-
-
Borg, A.1
Blau, W.2
Graetsch, W.3
Herrman, F.4
Oberle, W.5
-
9
-
-
0027878416
-
Disk array storage system reliability
-
441, IEEE Compt. Soc.
-
W. A. Burkhard, J. Menon, 1993, Disk array storage system reliability, Proc. 23rd International Symposium on Fault-Tolerant Computing, 432, 441, IEEE Compt. Soc.
-
(1993)
Proc. 23rd International Symposium on Fault-Tolerant Computing
, pp. 432
-
-
Burkhard, W.A.1
Menon, J.2
-
10
-
-
0029258203
-
MPVM: A migration transparent version of PVM
-
Casas J., Clark D. L., Konuru R., Otto S. W., Prouty R. M., Walpole J. MPVM: A migration transparent version of PVM. Compt. Systems. 8:Spring 1995;171-216.
-
(1995)
Compt. Systems
, vol.8
, pp. 171-216
-
-
Casas, J.1
Clark, D.L.2
Konuru, R.3
Otto, S.W.4
Prouty, R.M.5
Walpole, J.6
-
11
-
-
0042838995
-
MIST: PVM with transparent migration and checkpointing
-
J. Casas, D. L. Clark, P. S. Galbiati, R. Konuru, S. W. Otto, R. M. Prouty, J. Walpole, 1995, MIST: PVM with transparent migration and checkpointing, 3rd Annual PVM Users' Group Meeting.
-
(1995)
3rd Annual PVM Users' Group Meeting
-
-
Casas, J.1
Clark, D.L.2
Galbiati, P.S.3
Konuru, R.4
Otto, S.W.5
Prouty, R.M.6
Walpole, J.7
-
12
-
-
0022020346
-
Distributed snapshots: Determining global states of distributed systems
-
Chandy K. M., Lamport L. Distributed snapshots: Determining global states of distributed systems. ACM Trans. Comput. Systems. 3:Feb. 1985;3-75.
-
(1985)
ACM Trans. Comput. Systems
, vol.3
, pp. 3-75
-
-
Chandy, K.M.1
Lamport, L.2
-
13
-
-
0002924772
-
ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers
-
127, IEEE Compt. Soc.
-
J. Choi, J. Dongarra, R. Pozo, D. Walker, 1992, ScaLAPACK: A scalable linear algebra library for distributed memory concurrent computers, Proc. 4th Symposium on the Frontiers of Massively Parallel Computation, 120, 127, IEEE Compt. Soc.
-
(1992)
Proc. 4th Symposium on the Frontiers of Massively Parallel Computation
, pp. 120
-
-
Choi, J.1
Dongarra, J.2
Pozo, R.3
Walker, D.4
-
15
-
-
0027961906
-
Checkpoint/rollback in a distributed system using coarse-grained dataflow
-
433, IEEE Compt. Soc.
-
D. Cummings, L. Alkalaj, 1994, Checkpoint/rollback in a distributed system using coarse-grained dataflow, Proc. 24th International Symposium on Fault-Tolerant Computing, 424, 433, IEEE Compt. Soc.
-
(1994)
Proc. 24th International Symposium on Fault-Tolerant Computing
, pp. 424
-
-
Cummings, D.1
Alkalaj, L.2
-
17
-
-
84871146551
-
The performance of consistent checkpointing
-
47.
-
E. N. Elnozahy, D. B. Johnson, W. Zwaenepoel, 1992, The performance of consistent checkpointing, Proc. 11th Symposium on Reliable Distributed Systems, 39, 47.
-
(1992)
Proc. 11th Symposium on Reliable Distributed Systems
, pp. 39
-
-
Elnozahy, E.N.1
Johnson, D.B.2
Zwaenepoel, W.3
-
19
-
-
0003637465
-
-
Boston: MIT Press
-
Geist A., Beguelin A., Dongarra J., Manchek R., Jaing W., Sunderam V. PVM - A Users' Guide and Tutorial for Networked Parallel Computing. 1994;MIT Press, Boston.
-
(1994)
PVM - A Users' Guide and Tutorial for Networked Parallel Computing
-
-
Geist, A.1
Beguelin, A.2
Dongarra, J.3
Manchek, R.4
Jaing, W.5
Sunderam, V.6
-
20
-
-
84974763119
-
Supercomputing out of recycled garbage: Preliminary experience with piranha
-
427, ACM.
-
D. Gelernter, D. Kaminsky, 1992, Supercomputing out of recycled garbage: Preliminary experience with piranha, Proc. International Conference on Supercomputing, 417, 427, ACM.
-
(1992)
Proc. International Conference on Supercomputing
, pp. 417
-
-
Gelernter, D.1
Kaminsky, D.2
-
21
-
-
0024866388
-
Failure correction techniques for large disk arrays
-
132, ACM.
-
G. A. Gibson, L. Hellerstein, R. M. Karp, R. H. Katz, D. A. Patterson, 1989, Failure correction techniques for large disk arrays, Proc. 3rd International Conference on Architectural Support for Programming Languages and Operating Systems, 123, 132, ACM.
-
(1989)
Proc. 3rd International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 123
-
-
Gibson, G.A.1
Hellerstein, L.2
Karp, R.M.3
Katz, R.H.4
Patterson, D.A.5
-
23
-
-
0021439162
-
Algorithm-based fault tolerance for matrix operations
-
June
-
Huang K-H., Abraham J. A. Algorithm-based fault tolerance for matrix operations. IEEE Trans. Comput. C-33:June 1984;518-528.
-
(1984)
IEEE Trans. Comput. C-33
, pp. 518-528
-
-
Huang, K.-H.1
Abraham, J.A.2
-
24
-
-
38249017422
-
Recovery in distributed systems using optimistic message logging and checkpointing
-
Johnson D. B., Zwaenepoel W. Recovery in distributed systems using optimistic message logging and checkpointing. J. Algorithms. 11:Sep. 1990;462-491.
-
(1990)
J. Algorithms
, vol.11
, pp. 462-491
-
-
Johnson, D.B.1
Zwaenepoel, W.2
-
26
-
-
0023090161
-
Checkpointing and rollback-recovery for distributed systems
-
Koo R., Toueg S. Checkpointing and rollback-recovery for distributed systems. IEEE Trans. Software Engrg. SE-13:Jan. 1987;23-31.
-
(1987)
IEEE Trans. Software Engrg.
, vol.13
, pp. 23-31
-
-
Koo, R.1
Toueg, S.2
-
30
-
-
0026174913
-
An efficient checkpointing method for multicomputers with wormhole routing
-
Li K., Naughton J. F., Plank J. S. An efficient checkpointing method for multicomputers with wormhole routing. Int. J. Parallel Process. 20:June 1992;159-180.
-
(1992)
Int. J. Parallel Process.
, vol.20
, pp. 159-180
-
-
Li, K.1
Naughton, J.F.2
Plank, J.S.3
-
32
-
-
0029204130
-
A longitudinal survey of internet host reliability
-
9.
-
D. Long, A. Muir, R. Golding, 1995, A longitudinal survey of internet host reliability, Proc. 14th Symposium on Reliable Distributed Systems, 2, 9.
-
(1995)
Proc. 14th Symposium on Reliable Distributed Systems
, pp. 2
-
-
Long, D.1
Muir, A.2
Golding, R.3
-
33
-
-
0023995880
-
An analysis of algorithm-based fault tolerance techniques
-
Luk F. T., Park H. An analysis of algorithm-based fault tolerance techniques. J. Parallel Distrib. Comput. 5:1988;172-184.
-
(1988)
J. Parallel Distrib. Comput.
, vol.5
, pp. 172-184
-
-
Luk, F.T.1
Park, H.2
-
35
-
-
0000535669
-
The available capacity of a privately owned workstation environment
-
Mutka M. W., Livny M. The available capacity of a privately owned workstation environment. Performance Evaluation. 1991.
-
(1991)
Performance Evaluation
-
-
Mutka, M.W.1
Livny, M.2
-
36
-
-
0030392072
-
Improving the performance of coordinated checkpointers on networks of workstations using RAID techniques
-
J. S. Plank, 1996, Improving the performance of coordinated checkpointers on networks of workstations using RAID techniques, Proc. 15th Symposium on Reliable Distributed Systems, 76, 85.
-
(1996)
Proc. 15th Symposium on Reliable Distributed Systems
, pp. 76
-
-
Plank, J.S.1
-
38
-
-
0002991145
-
Ickp - A consistent checkpointer for multicomputers
-
Plank J. S., Li K. Ickp - a consistent checkpointer for multicomputers. IEEE Parallel Distrib. Technol. 2:Summer 1994;62-67.
-
(1994)
IEEE Parallel Distrib. Technol.
, vol.2
, pp. 62-67
-
-
Plank, J.S.1
Li, K.2
-
41
-
-
0029544460
-
Portable checkpointing and recovery
-
195.
-
L. M. Silva, J. G. Silva, S. Chapple, L. Clarke, 1995, Portable checkpointing and recovery, Proc. HPDC-4, High-Performance Distributed Computing, 188, 195.
-
(1995)
Proc. HPDC-4, High-Performance Distributed Computing
, pp. 188
-
-
Silva, L.M.1
Silva, J.G.2
Chapple, S.3
Clarke, L.4
-
42
-
-
0003710740
-
-
Boston: MIT Press
-
Snir M., Otto S. W., Huss-Lederman S., Walker D. W., Dongarra J. J. MPI: The Complete Reference. 1996;MIT Press, Boston.
-
(1996)
MPI: The Complete Reference
-
-
Snir, M.1
Otto, S.W.2
Huss-Lederman, S.3
Walker, D.W.4
Dongarra, J.J.5
-
44
-
-
0022112420
-
Optimistic recovery in distributed systems
-
Strom R. E., Yemini S. Optimistic recovery in distributed systems. ACM Trans. Comput. Systems. 3:Aug. 1985;204-226.
-
(1985)
ACM Trans. Comput. Systems
, vol.3
, pp. 204-226
-
-
Strom, R.E.1
Yemini, S.2
-
45
-
-
0028994256
-
Reduced overhead logging for rollback recovery in distributed shared memory
-
288.
-
G. Sure, R. Janssens, W. K. Fuchs, 1995, Reduced overhead logging for rollback recovery in distributed shared memory, Proc. 25th International Symposium on Fault-Tolerant Computing, 279, 288.
-
(1995)
Proc. 25th International Symposium on Fault-Tolerant Computing
, pp. 279
-
-
Sure, G.1
Janssens, R.2
Fuchs, W.K.3
|