-
1
-
-
0025536635
-
Lapack: A portable linear algebra library for high-performance computers
-
IEEE Press
-
Anderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J. J., DuCroz, J., Greenbaum, A., Hammarling, S., McKenney, A., and Sorensen, D. Lapack: A portable linear algebra library for high-performance computers. Proceedings of Supercomputing '90, IEEE Press, 1990. pp. 1-10.
-
(1990)
Proceedings of Supercomputing 90
, pp. 1-10
-
-
Erson, E.1
Bai, Z.2
Bischof, C.3
Demmel, J.4
Dongarra, J.J.5
Ducroz, J.6
Greenbaum, A.7
Hammarling, S.8
McKenney, A.9
Sorensen, D.10
-
2
-
-
0003706460
-
-
Philadelphia
-
Anderson, E., Bai, Z., Demmel, J., Dongarra, J., DuCroz, J., Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., and Sorensen, D. LAPACK Users' Guide. SIAM, Philadelphia, 1992.
-
(1992)
LAPACK Users' Guide. SIAM
-
-
Anderson, E.1
Bai, Z.2
Demmel, J.3
Dongarra, J.4
Ducroz, J.5
Greenbaum, A.6
Hammarling, S.7
McKenney, A.8
Ostrouchov, S.9
Sorensen, D.10
-
3
-
-
85067631613
-
Basic linear algebra communication subprograms
-
IEEE Comput. Soc. Press
-
Anderson, E., Benzoni, A., Dongarra, J., Moulton, S., Ostrouchov, S., Tourancheau, B., and van de Geijn, R. Basic linear algebra communication subprograms. Sixth Distributed Memory Computing Conference Proceedings. IEEE Comput. Soc. Press. 1991, pp. 287-290.
-
(1991)
Sixth Distributed Memory Computing Conference Proceedings
, pp. 287-290
-
-
Anderson, E.1
Benzoni, A.2
Dongarra, J.3
Moulton, S.4
Ostrouchov, S.5
Tourancheau, B.6
Van De Geijn, R.7
-
4
-
-
0242343480
-
LAPACK for distributed memory architectures: Progress report
-
SIAM
-
Anderson, E., Benzoni, A., Dongarra, J. J., Moulton, S., Ostrouchov, S., Tourancheau, B., and van de Geijn, R. LAPACK for distributed memory architectures: Progress report. Parallel Processing for Scientific Computing, Fifth SIAM Conference. SIAM, 1991.
-
(1991)
Parallel Processing for Scientific Computing, Fifth SIAM Conference
-
-
Anderson, E.1
Benzoni, A.2
Dongarra, J.J.3
Moulton, S.4
Ostrouchov, S.5
Tourancheau, B.6
Van De Geijn, R.7
-
8
-
-
0025997771
-
Using Strassen’s algorithm to accelerate the solution of linear systems
-
Bailey, D. H., Lee, K., and Simon, H. D. Using Strassen’s algorithm to accelerate the solution of linear systems. J. Supercomputing 4 (1990), 357-371.
-
(1990)
J. Supercomputing
, vol.4
, pp. 357-371
-
-
Bailey, D.H.1
Lee, K.2
Simon, H.D.3
-
9
-
-
3042648854
-
The LINPACK benchmark on the AP 1000: Preliminary report
-
Brent, R. P. The LINPACK benchmark on the AP 1000: Preliminary report. Proceedings of the 2nd CAP Workshop. Nov. 1991.
-
Proceedings of the 2Nd CAP Workshop
, pp. 1991
-
-
Brent, R.P.1
-
10
-
-
0002924772
-
Scalapack: A scalable linear algebra library for distributed memory concurrent computers
-
IEEE Comput. Soc. Press
-
Choi, J., Dongarra, J. J., Pozo, R., and Walker, D. W. Scalapack: A scalable linear algebra library for distributed memory concurrent computers. Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation. IEEE Comput. Soc. Press, 1992. pp. 120-127.
-
(1992)
Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation
, pp. 120-127
-
-
Choi, J.1
Dongarra, J.J.2
Pozo, R.3
Walker, D.W.4
-
11
-
-
85027610980
-
-
Elsevier, Amsterdam
-
Choi, J., Dongarra, J. J., and Walker, D. W. The design of scalable software libraries for distributed memory concurrent computers. Proceedings of the CNRS-NSF Workshop on Environments and Tools for Parallel Scientific Computing. Elsevier, Amsterdam, 1993.
-
(1993)
The Design of Scalable Software Libraries for Distributed Memory Concurrent Computers
-
-
Choi, J.1
Dongarra, J.J.2
Walker, D.W.3
-
12
-
-
35248831050
-
Electromagnetic scattering calculations on the Intel Touchstone Delta
-
IEEE Comput. Soc. Press
-
Cwik, T., Patterson, J., and Scott, D. Electromagnetic scattering calculations on the Intel Touchstone Delta. Proceedings of Supercomputing '92. IEEE Comput. Soc. Press, 1992. pp. 538-542.
-
(1992)
Proceedings of Supercomputing 92
, pp. 538-542
-
-
Cwik, T.1
Patterson, J.2
Scott, D.3
-
13
-
-
84855320879
-
-
Technical Report, Argonne National Laboratory, Mathematics and Computer Science Division
-
Demmel, J., Dongarra, J. J., Du Croz, J., Greenbaum, A., Hammarling, S., and Sorensen, D. Prospectus for the development of a linear algebra library for high performance computers. Technical Report 97, Argonne National Laboratory, Mathematics and Computer Science Division, Sept. 1987.
-
(1987)
Prospectus for the Development of a Linear Algebra Library for High Performance Computers
, vol.97
-
-
Demmel, J.1
Dongarra, J.J.2
Du Croz, J.3
Greenbaum, A.4
Hammarling, S.5
Sorensen, D.6
-
14
-
-
84947657247
-
LINPACK benchmark: Performance of various computers using standard linear equations software
-
Dongarra, J. J. LINPACK benchmark: Performance of various computers using standard linear equations software. Supercomputing Rev. 5, 3 (March 1992), 54-63.
-
(1992)
Supercomputing Rev
, vol.5
, Issue.3
, pp. 54-63
-
-
Dongarra, J.J.1
-
15
-
-
0003555195
-
-
Philadelphia
-
Dongarra, J. J., Bunch, J., Moler, C., and Stewart, G. W. LINPACK User's Guide. SIAM, Philadelphia, 1979.
-
(1979)
LINPACK User's Guide. SIAM
-
-
Dongarra, J.J.1
Bunch, J.2
Moler, C.3
Stewart, G.W.4
-
16
-
-
84911589505
-
-
Technical Report, Argonne National Laboratory. Mathematics and Computer Science Division, Apr
-
Dongarra, J. J., Du Croz, J., Duff, L. and Hammarling, S. A proposal for a set of level 3 basic linear algebra subprograms. Technical Report 88, Argonne National Laboratory. Mathematics and Computer Science Division, Apr. 1987.
-
(1987)
A Proposal for a Set of Level 3 Basic Linear Algebra Subprograms
, vol.88
-
-
Dongarra, J.J.1
Du Croz, J.2
Duff, L.3
Hammarling, S.4
-
17
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
Dongarra, J. J., Duff, L, Du Croz, J., and Hammarling, S. A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Software 16 (March 1990), 1-17.
-
(1990)
ACM Trans. Math. Software
, vol.16
, pp. 1-17
-
-
Dongarra, J.J.1
Duff, L.2
Du Croz, J.3
Hammarling, S.4
-
18
-
-
0003793981
-
-
Philadelphia
-
Dongarra, J. J., Duff, I. S., Sorensen, D. C., and van der Vorst, H. A. Solving Linear Systems on Vector and Shared Memory Computers. SIAM, Philadelphia, 1990.
-
(1990)
Solving Linear Systems on Vector and Shared Memory Computers. SIAM
-
-
Dongarra, J.J.1
Duff, I.S.2
Sorensen, D.C.3
Van Der Vorst, H.A.4
-
19
-
-
0003517895
-
Technical Report TM-12231
-
Oak Ridge National Laboratory
-
Dongarra, J. J., Hempel, R., Hey, A. J. G., and Walker, D. W. A proposal for a user-level message passing interface in a distributed memory environment. Technical Report TM-12231. Oak Ridge National Laboratory. Feb. 1993.
-
(1993)
A Proposal for a User-Level Message Passing Interface in a Distributed Memory Environment
-
-
Dongarra, J.J.1
Hempel, R.2
Hey, A.J.G.3
Walker, D.W.4
-
22
-
-
0004060334
-
Two-dimensional basic linear algebra communication subprograms
-
Computer Science Department, University of Tennessee. Knoxville, TN
-
Dongarra, J. J., and van de Geijn, R. A. Two-dimensional basic linear algebra communication subprograms. Technical Report LAPACK working note 37, Computer Science Department, University of Tennessee. Knoxville, TN, Oct. 1991.
-
(1991)
Technical Report LAPACK Working Note
, vol.37
-
-
Dongarra, J.J.1
Van De Geijn, R.A.2
-
23
-
-
0026912004
-
Reduction to condensed form for the eigenvalue problem on distributed memory architectures
-
Dongarra, J. J. and van de Geijn, R. A. Reduction to condensed form for the eigenvalue problem on distributed memory architectures. Parallel Comput. 18 (1992), 973-982.
-
(1992)
Parallel Comput
, vol.18
, pp. 973-982
-
-
Dongarra, J.J.1
Van De Geijn, R.A.2
-
24
-
-
0026991394
-
A look at scalable dense linear algebra libraries
-
InJ. H. Saltz (Ed.), IEEE Press
-
Dongarra, J. J., van de Geijn, R. A., and Walker, D. W. A look at scalable dense linear algebra libraries. InJ. H. Saltz (Ed.), Proceedings of the 1992 Scalable High Performance Computing Conference. IEEE Press, 1992.
-
(1992)
Proceedings of the 1992 Scalable High Performance Computing Conference
-
-
Dongarra, J.J.1
Van De Geijn, R.A.2
Walker, D.W.3
-
25
-
-
0002663082
-
GEMMW: A portable level 3 BLAS Winograd variant of Strassen's matrix-matrix multiply algorithm
-
Douglas, C. C., Heroux, M., Slishman, G., and Smith, R. M. GEMMW: A portable level 3 BLAS Winograd variant of Strassen's matrix-matrix multiply algorithm. J. Comput. Phys. 110 (1994), 1-10.
-
(1994)
J. Comput. Phys.
, vol.110
, pp. 1-10
-
-
Douglas, C.C.1
Heroux, M.2
Slishman, G.3
Smith, R.M.4
-
26
-
-
84888771978
-
Large dense numerical linear algebra in 1993: The parallel computing influence
-
Edelman, A. Large dense numerical linear algebra in 1993: The parallel computing influence. Int. J. Supercomputing Appl. 7, 2 (1993).
-
(1993)
Int. J. Supercomputing Appl.
, vol.2
, pp. 7
-
-
Edelman, A.1
-
27
-
-
85027594380
-
Fortran D language specification. Technical Report CRPC-TR90079. Center for Research on Parallel Computation
-
Fox, G. C., Hiranandani, S., Kennedy, K., Koelbel, C., Kremer, U., Tseng, C-W., and Wu, M-Y. Fortran D language specification. Technical Report CRPC-TR90079. Center for Research on Parallel Computation, Rice University, Dec. 1990.
-
(1990)
Rice University
-
-
Fox, G.C.1
Hiranandani, S.2
Kennedy, K.3
Koelbel, C.4
Kremer, U.5
Tseng, C.-W.6
Wu, M.-Y.7
-
28
-
-
0003506603
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
Fox, G. C., Johnson, M. A., Lyzenga, G. A., Otto, S. W., Salmon, J. K., and Walker, D. W. Solving Problems on Concurrent Processors. Vol. 1. Prentice-Hall, Englewood Cliffs, NJ, 1988.
-
(1988)
Solving Problems on Concurrent Processors
, vol.1
-
-
Fox, G.C.1
Johnson, M.A.2
Lyzenga, G.A.3
Otto, S.W.4
Salmon, J.K.5
Walker, D.W.6
-
29
-
-
0003407903
-
-
Lecture Notes in Computer Science, Springer-Verlag, Berlin
-
Garbow, B. S., Boyle, J. M., Dongarra, J. J., and Moler, C. B. Matrix Eigensystem Routines—EISPACK Guide Extension. Lecture Notes in Computer Science. Vol. 51. Springer-Verlag, Berlin. 1977.
-
(1977)
Matrix Eigensystem Routines—EISPACK Guide Extension
, vol.51
-
-
Garbow, B.S.1
Boyle, J.M.2
Dongarra, J.J.3
Moler, C.B.4
-
30
-
-
0042625581
-
Technical Report TM-11616
-
Oak Ridge National Laboratory
-
Geist, G. A., Heath, M. T., Peyton, B. W., and Worley, P. H. A user's guide to PICL: A portable instrumented communication library. Technical Report TM-11616, Oak Ridge National Laboratory, Oct. 1990.
-
(1990)
A User's Guide to PICL: A Portable Instrumented Communication Library
-
-
Geist, G.A.1
Heath, M.T.2
Peyton, B.W.3
Worley, P.H.4
-
31
-
-
0027644684
-
The scalability of FFT on parallel computers
-
A detailed version is available as Technical Report TR 90-53, Department of Computer Science, University of Minnesota. MN 55455
-
Gupta, A., and Kumar, V. The scalability of FFT on parallel computers. IEEE Trans. Parallel Distrib. Systems 4, 7 (July 1993). A detailed version is available as Technical Report TR 90-53, Department of Computer Science, University of Minnesota. MN 55455.
-
(1993)
IEEE Trans. Parallel Distrib. Systems
, vol.4
, Issue.7
-
-
Gupta, A.1
Kumar, V.2
-
32
-
-
0024012163
-
Reevaluating Amdahl's law
-
Gustafson, J. Reevaluating Amdahl's law. Comm. ACM 31, 5 (1988), 532-533.
-
(1988)
Comm. ACM
, vol.31
, Issue.5
, pp. 532-533
-
-
Gustafson, J.1
-
33
-
-
0026202198
-
The design of a scalable, fixed-time computer benchmark
-
Gustafson, J., Rover, D., Elbert, S., and Carter, M. The design of a scalable, fixed-time computer benchmark. J. Parallel Distrib. Corn-put. 12 (1991), 388-401.
-
(1991)
J. Parallel Distrib. Corn-Put
, vol.12
, pp. 388-401
-
-
Gustafson, J.1
Rover, D.2
Elbert, S.3
Carter, M.4
-
36
-
-
0025637437
-
Exploiting fast matrix multiplication within the level 3 BLAS
-
Higham, N. J. Exploiting fast matrix multiplication within the level 3 BLAS. ACM Trans. Math. Software 16, 4 (1990), 352-368.
-
(1990)
ACM Trans. Math. Software
, vol.16
, Issue.4
, pp. 352-368
-
-
Higham, N.J.1
-
39
-
-
3543092493
-
Analyzing scalability of parallel algorithms and architectures. Technical report, TR-91-18. Computer Science Department, University of Minnesota. June 1991
-
A short version of the paper, Urbana, IL, Oct, 1991
-
Kumar, V., and Gupta, A. Analyzing scalability of parallel algorithms and architectures. Technical report, TR-91-18. Computer Science Department, University of Minnesota. June 1991. J. Parallel Distrib. Comput. 22, 3 (1994) 379-391. A short version of the paper appears in the Proceedings of the 1991 International Conference on Supercomputing. Germany, and as an invited paper in the Proceedings of the 29th Annual Allerton Conference on Communication, Control and Computing. Urbana, IL, Oct. 1991.
-
(1994)
J. Parallel Distrib. Comput
, vol.22
, Issue.3
, pp. 379-391
-
-
Kumar, V.1
Gupta, A.2
-
43
-
-
10444243598
-
LU factorization of sparse, unsym-metric, Jacobian matrices on multicomputers
-
Walker, D. W. and Stout, Q. F. (Eds.)
-
Skjellum, A. J., and Leung, A. LU factorization of sparse, unsym-metric, Jacobian matrices on multicomputers. In Walker, D. W. and Stout, Q. F. (Eds.). Proceedings of the Fifth Distributed Memory Concurrent Computing Conference. IEEE Press. 1990, pp. 328-337.
-
(1990)
Proceedings of the Fifth Distributed Memory Concurrent Computing Conference. IEEE Press
, pp. 328-337
-
-
Skjellum, A.J.1
Leung, A.2
-
44
-
-
0003595562
-
-
Lecture Notes in Computer Science, Springer-Verlag, Berlin
-
Smith, B. T., Boyle, J. M., Dongarra, J. J., Garbow, B. S., Ikebe, Y., Klema, V. C., and Moler, C. B. Matrix Eigensyslem Routines—EISPACK Guide, Lecture Notes in Computer Science, Vol. 6. Springer-Verlag, Berlin, 1976.
-
(1976)
Matrix Eigensyslem Routines—EISPACK Guide
, vol.6
-
-
Smith, B.T.1
Boyle, J.M.2
Dongarra, J.J.3
Garbow, B.S.4
Ikebe, Y.5
Klema, V.C.6
Moler, C.B.7
-
45
-
-
34250487811
-
Gaussian elimination is not optimal
-
Strassen, V. Gaussian elimination is not optimal. Ntuner. Math. 13 (1969), 354-356.
-
(1969)
Ntuner. Math.
, vol.13
, pp. 354-356
-
-
Strassen, V.1
-
46
-
-
0002853545
-
Scalable problems and memory-bounded speedup
-
Sun, X.-H., and Ni, L. Scalable problems and memory-bounded speedup. J. Parallel Distrib. Computing 19, 1 (1993). 27-37.
-
(1993)
J. Parallel Distrib. Computing
, vol.19
, Issue.1
, pp. 27-37
-
-
Sun, X.-H.1
Ni, L.2
-
47
-
-
85027614460
-
-
Cambridge, MA
-
Thinking Machines Corporation. CMS Technical Summary. Cambridge, MA, 1991.
-
(1991)
CMS Technical Summary
-
-
-
49
-
-
0025639404
-
Data redistribution and concurrency
-
Van de Velde, E. F. Data redistribution and concurrency. Parallel Comput. 16 (Dec 1990).
-
(1990)
Parallel Comput
, pp. 16
-
-
Van De Velde, E.F.1
|