-
1
-
-
0028513316
-
Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
-
AGARWAL, R., GUSTAVSON, F., AND ZUBAIR, M. 1994. Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms. IBM J. Res. Dev. 38, 5 (Sept.), 563-576.
-
(1994)
IBM J. Res. Dev.
, vol.38
, Issue.5 SEPT
, pp. 563-576
-
-
Agarwal, R.1
Gustavson, F.2
Zubair, M.3
-
2
-
-
0003706460
-
-
SIAM, Philadelphia
-
ANDERSON, E., BAI, Z., DEMMEL, J., DONGARRA, J. E., DUCROZ, J., GREENBAUM, A., HAMMARLING, S., MCKENNEY, A. E., OSTROUCHOV, S., AND SORENSEN, D. 1992. LAPACK Users' Guide. SIAM, Philadelphia.
-
(1992)
LAPACK Users' Guide
-
-
Anderson, E.1
Bai, Z.2
Demmel, J.3
Dongarra, J.E.4
Ducroz, J.5
Greenbaum, A.6
Hammarling, S.7
McKenney, A.E.8
Ostrouchov, S.9
Sorensen, D.10
-
3
-
-
0039637901
-
-
LAPACK Working Note 146 CS-00-441, University of Tennessee, Knoxville (May)
-
ANDERSEN, B. S., GUSTAVSON, F. G., AND WASNIEWSKI, J. 2000. A recursive formulation of Cholesky factorization of a matrix in packed storage. LAPACK Working Note 146 CS-00-441, University of Tennessee, Knoxville (May).
-
(2000)
A Recursive Formulation of Cholesky Factorization of a Matrix in Packed Storage
-
-
Andersen, B.S.1
Gustavson, F.G.2
Wasniewski, J.3
-
4
-
-
0002924772
-
Scalapack: A scalable linear algebra library for distributed memory concurrent computers
-
IEEE Computer Society Press
-
CHOI, J., DONGARRA, J. J., POZO, R., AND WALKER, D. W. 1992. Scalapack: A scalable linear algebra library for distributed memory concurrent computers. In Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation. IEEE Computer Society Press, 120-127.
-
(1992)
Proceedings of the Fourth Symposium on the Frontiers of Massively Parallel Computation
, pp. 120-127
-
-
Choi, J.1
Dongarra, J.J.2
Pozo, R.3
Walker, D.W.4
-
5
-
-
0001642550
-
A short method for evaluating determinants and solving systems of linear equations with real or complex coefficients
-
CROUT, P. D. 1941. A short method for evaluating determinants and solving systems of linear equations with real or complex coefficients. Trans AIEE 60, 1235-1240.
-
(1941)
Trans AIEE
, vol.60
, pp. 1235-1240
-
-
Crout, P.D.1
-
6
-
-
0040229822
-
Under the spell of Leibniz's dream
-
The University of Texas at Austin (April).
-
DIJKSTRA, E. W. 2000. Under the spell of Leibniz's dream. Tech. Rep. EWD1298, The University of Texas at Austin (April). http://uww.cs.utexas.edu/users/EWD/.
-
(2000)
Tech. Rep. EWD1298
-
-
Dijkstra, E.W.1
-
7
-
-
0003555195
-
-
SIAM, Philadelphia
-
DONGARRA, J. J., BUNCH, J. R., MOLER, C. B., AND STEWART, G. W. 1979. LINPACK Users' Guide SIAM, Philadelphia.
-
(1979)
LINPACK Users' Guide
-
-
Dongarra, J.J.1
Bunch, J.R.2
Moler, C.B.3
Stewart, G.W.4
-
8
-
-
0025402476
-
A set of level 3 basic linear algebra subprograms
-
DONGARRA, J. J., DU CROZ, J., HAMMARLING, S., AND DUFF, I. 1990. A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Soft. 16, 1 (March), 1-17.
-
(1990)
ACM Trans. Math. Soft.
, vol.16
, Issue.1 MARCH
, pp. 1-17
-
-
Dongarra, J.J.1
Du Croz, J.2
Hammarling, S.3
Duff, I.4
-
9
-
-
0023983122
-
An extended set of FORTRAN basic linear algebra subprograms
-
DONGARRA, J. J., DU CROZ, J., HAMMARLING, S., AND HANSON, R. J. 1988. An extended set of FORTRAN basic linear algebra subprograms. ACM Trans. Math. Soft. 14, 1 (March), 1-17.
-
(1988)
ACM Trans. Math. Soft.
, vol.14
, Issue.1 MARCH
, pp. 1-17
-
-
Dongarra, J.J.1
Du Croz, J.2
Hammarling, S.3
Hanson, R.J.4
-
10
-
-
0003793981
-
-
SIAM, Philadelphia, PA
-
DONGARRA, J. J., DUFF, I. S., SORENSEN, D. C., AND VAN DER VORST, H. A. 1991. Solving Linear Systems on Vector and Shared Memory Computers. SIAM, Philadelphia, PA.
-
(1991)
Solving Linear Systems on Vector and Shared Memory Computers
-
-
Dongarra, J.J.1
Duff, I.S.2
Sorensen, D.C.3
Van Der Vorst, H.A.4
-
11
-
-
0021310295
-
Implementing linear algebra algorithms for dense matrices on a vector pipeline machine
-
DONGARRA, J. J., GUSTAVSON, F. G., AND KARP, A. 1984. Implementing linear algebra algorithms for dense matrices on a vector pipeline machine. SIAM Review 26, 1 (Jan.), 91-112.
-
(1984)
SIAM Review
, vol.26
, Issue.1 JAN
, pp. 91-112
-
-
Dongarra, J.J.1
Gustavson, F.G.2
Karp, A.3
-
12
-
-
0034224207
-
Applying recursion to serial and parallel QR factorization leads to better performance
-
ELMROTH, E. AND GUSTAVSON, F. 2000. Applying recursion to serial and parallel QR factorization leads to better performance. IBM J. Res. Dev. 44, 4, 605-624.
-
(2000)
IBM J. Res. Dev.
, vol.44
, Issue.4
, pp. 605-624
-
-
Elmroth, E.1
Gustavson, F.2
-
15
-
-
84949665448
-
A family of high-performance matrix multiplication algorithms
-
Computational Science - ICCS 2001, Part I, V. N. Alexandrov, J. J. Dongarra, B. A. Juliano, R. S. Renner, and C. K. Tan, Eds. Springer-Verlag, New York
-
GUNNELS, J. A., HENRY, G. M., AND VAN DE GEIJN, R. A. 2001. A family of high-performance matrix multiplication algorithms. In Computational Science - ICCS 2001, Part I, V. N. Alexandrov, J. J. Dongarra, B. A. Juliano, R. S. Renner, and C. K. Tan, Eds. Lecture Notes in Computer Science 2073. Springer-Verlag, New York, 51-60.
-
(2001)
Lecture Notes in Computer Science
, vol.2073
, pp. 51-60
-
-
Gunnels, J.A.1
Henry, G.M.2
Van De Geijn, R.A.3
-
16
-
-
0031647665
-
A flexible class of parallel matrix multiplication algorithms
-
GUNNELS, J., LIN, C., MORROW, G., AND VAN DE GEIJN, R. 1998. A flexible class of parallel matrix multiplication algorithms. In Proceedings of First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing (1998 IPPS/SPDP '98). 110-116.
-
(1998)
Proceedings of First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing (1998 IPPS/SPDP '98)
, pp. 110-116
-
-
Gunnels, J.1
Lin, C.2
Morrow, G.3
Van De Geijn, R.4
-
17
-
-
84860927033
-
Developing linear algebra algorithms: A collection of class projects
-
Department of Computer Sciences, The University of Texas at Austin, May
-
GUNNELS, J. A. AND VAN DE GEIJN, R. A. 2001a. Developing linear algebra algorithms: A collection of class projects. Tech. Rep. CS-TR-01-19, Department of Computer Sciences, The University of Texas at Austin, May. http://www.cs.utexas.edu/users/flame/pubs.html.
-
(2001)
Tech. Rep. CS-TR-01-19
-
-
Gunnels, J.A.1
Van De Geijn, R.A.2
-
18
-
-
84901946998
-
Formal methods for high-performance linear algebra libraries
-
R. F. Boisvert and P. T. P. Tang, Eds. Kluwer Academic Press, Orlando, FL
-
GUNNELS, J. A. AND VAN DE GEIJN, R. A. 2001b. Formal methods for high-performance linear algebra libraries. In The Architecture of Scientific Software, R. F. Boisvert and P. T. P. Tang, Eds. Kluwer Academic Press, Orlando, FL, 193-210.
-
(2001)
The Architecture of Scientific Software
, pp. 193-210
-
-
Gunnels, J.A.1
Van De Geijn, R.A.2
-
19
-
-
0040824510
-
Parallel out-of-core cholesky and qr factorizations with pooclapack
-
IEEE Computer Society Press, Los Alamitos, CA
-
GUNTER, B. C., REILEY, W. C., AND VAN DE GEIJN, R. A. 2001. Parallel out-of-core cholesky and qr factorizations with pooclapack. In Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS). IEEE Computer Society Press, Los Alamitos, CA.
-
(2001)
Proceedings of the 15th International Parallel and Distributed Processing Symposium (IPDPS)
-
-
Gunter, B.C.1
Reiley, W.C.2
Van De Geijn, R.A.3
-
20
-
-
0031273280
-
Recursion leads to automatic variable blocking for dense linear-algebra algorithms
-
GUSTAVSON, F. G. 1997. Recursion leads to automatic variable blocking for dense linear-algebra algorithms. IBM J. of Res. Dev. 41, 6 (November), 737-755.
-
(1997)
IBM J. of Res. Dev.
, vol.41
, Issue.6 NOVEMBER
, pp. 737-755
-
-
Gustavson, F.G.1
-
21
-
-
0012482835
-
New generalized matrix data structures lead to a variety of high-performance algorithms
-
R. F. Boisvert and P. T. P. Tang, Eds: Kluwer Academic Press, Orlando, FL
-
GUSTAVSON, F. G. 2001. New generalized matrix data structures lead to a variety of high-performance algorithms. In The Architecture of Scientific Software, R. F. Boisvert and P. T. P. Tang, Eds: Kluwer Academic Press, Orlando, FL.
-
(2001)
The Architecture of Scientific Software
-
-
Gustavson, F.G.1
-
22
-
-
84947926251
-
Recursive blocked data formats and BLAS's for dense linear algebra algorithms
-
Applied Parallel Computing, Large Scale Scientific and Industrial Problems, B. K. et al., Ed. Springer-Verlag, New York
-
GUSTAVSON, F., HENRIKSSON, A., JONSSON, I., KÅGSTRÖM, B., AND LING, P. 1998a. Recursive blocked data formats and BLAS's for dense linear algebra algorithms. In Applied Parallel Computing, Large Scale Scientific and Industrial Problems, B. K. et al., Ed. Lecture Notes in Computer Science 1541. Springer-Verlag, New York, 195-206.
-
(1998)
Lecture Notes in Computer Science
, vol.1541
, pp. 195-206
-
-
Gustavson, F.1
Henriksson, A.2
Jonsson, I.3
Kågström, B.4
Ling, P.5
-
23
-
-
84947907655
-
Superscalar GEMM-based level 3 BLAS - The on-going evolution of a portable and high-performance library
-
Applied Parallel Computing, Large Scale Scientific and Industrial Problems, B. K. et al., Ed. Springer-Verlag, New York
-
GUSTAVSON, F., HENRIKSSON, A., JONSSON, I., KÅGSTRÖM, B., AND LING, P. 1998b. Superscalar GEMM-based level 3 BLAS - the on-going evolution of a portable and high-performance library. In Applied Parallel Computing, Large Scale Scientific and Industrial Problems, B. K. et al., Ed. Lecture Notes in Computer Science 1541. Springer-Verlag, New York, 207-215.
-
(1998)
Lecture Notes in Computer Science
, vol.1541
, pp. 207-215
-
-
Gustavson, F.1
Henriksson, A.2
Jonsson, I.3
Kågström, B.4
Ling, P.5
-
24
-
-
0034312453
-
Minimal storage high-performance Cholesky factorization via blocking and recursion
-
GUSTAVSON, F. AND JONSSON, I. 2000. Minimal storage high-performance Cholesky factorization via blocking and recursion. IBM J. Res. Dev. 44, 6 (November), 823-850.
-
(2000)
IBM J. Res. Dev.
, vol.44
, Issue.6 NOVEMBER
, pp. 823-850
-
-
Gustavson, F.1
Jonsson, I.2
-
25
-
-
0032155271
-
GEMM-based level 3 BLAS: High performance model implementations and performance evaluation benchmark
-
KÅGSTRÖM, B., LING, P., AND LOAN, C. V. 1998. GEMM-based level 3 BLAS: High performance model implementations and performance evaluation benchmark. ACM Trans. Math. Soft. 24, 3, 268-302.
-
(1998)
ACM Trans. Math. Soft.
, vol.24
, Issue.3
, pp. 268-302
-
-
Kågström, B.1
Ling, P.2
Loan, C.V.3
-
26
-
-
0018515759
-
Basic linear algebra subprograms for Fortran usage
-
LAWSON, C. L., HANSON, R. J., KINCAID, D. R., AND KROGH, F. T. 1979. Basic linear algebra subprograms for Fortran usage. ACM Trans. Math. Soft. 5, 3 (Sept.), 308-323.
-
(1979)
ACM Trans. Math. Soft.
, vol.5
, Issue.3 SEPT
, pp. 308-323
-
-
Lawson, C.L.1
Hanson, R.J.2
Kincaid, D.R.3
Krogh, F.T.4
-
27
-
-
0040229821
-
Efficient parallel out-of-core implementation of the Cholesky factorization
-
Department of Computer Sciences, The University of Texas at Austin. (Dec.) Undergraduate Honors Thesis
-
REILEY, W. C. 1999. Efficient parallel out-of-core implementation of the Cholesky factorization. Tech. Rep. CS-TR-99-33, Department of Computer Sciences, The University of Texas at Austin. (Dec.) Undergraduate Honors Thesis.
-
(1999)
Tech. Rep. CS-TR-99-33
-
-
Reiley, W.C.1
-
28
-
-
0040824509
-
POOCLAPACK: Parallel out-of-core linear algebra package
-
Department of Computer Sciences, The University of Texas at Austin (Nov.)
-
REILEY, W. C. AND VAN DE GEIJN, R. A. 1999. POOCLAPACK: Parallel Out-of-Core Linear Algebra Package. Tech. Rep. CS-TR-99-33, Department of Computer Sciences, The University of Texas at Austin (Nov.).
-
(1999)
Tech. Rep. CS-TR-99-33
-
-
Reiley, W.C.1
Van De Geijn, R.A.2
-
29
-
-
0003595562
-
Matrix eigensystem routines - EISPACK guide, second ed
-
Springer-Verlag, New York
-
SMITH, B. T. ET AL. 1976. Matrix Eigensystem Routines - EISPACK Guide, Second ed. Lecture Notes in Computer Science 6. Springer-Verlag, New York.
-
(1976)
Lecture Notes in Computer Science
, vol.6
-
-
Smith, B.T.1
-
30
-
-
0003710740
-
-
The MIT Press
-
SNIR, M., OTTO, S. W., HUSS-LEDERMAN, S., WALKER, D. W., AND DONGARRA, J. 1996. MPI: The Complete Reference. The MIT Press.
-
(1996)
MPI: The Complete Reference
-
-
Snir, M.1
Otto, S.W.2
Huss-Lederman, S.3
Walker, D.W.4
Dongarra, J.5
|