-
1
-
-
0028427170
-
Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch
-
May
-
R. C. Agarwal, F. G. Gustavson, and M. Zubair. Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch. IBM J. Res. Develop, 38(3):265-275, May 1994.
-
(1994)
IBM J. Res. Develop
, vol.38
, Issue.3
, pp. 265-275
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
2
-
-
0028513316
-
Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
-
September
-
R. C. Agarwal, F. G. Gustavson, and M. Zubair. Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms. IBM J. Res. Develop, 38(5):563-576, September 1994.
-
(1994)
IBM J. Res. Develop
, vol.38
, Issue.5
, pp. 563-576
-
-
Agarwal, R.C.1
Gustavson, F.G.2
Zubair, M.3
-
3
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high performance, ANSI C coding methodology
-
New York, July 7-11, ACM Press
-
J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: A portable, high performance, ANSI C coding methodology. In Proceedings of the 11th International Conference on Supercomputing (ICS-97), pages 340-347, New York, July 7-11 1997. ACM Press.
-
(1997)
In Proceedings of the 11Th International Conference on Supercomputing (ICS-97)
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
4
-
-
0025402476
-
A Set of Level 3 Basic Linear Algebra Subprograms
-
18-28, March
-
J. Dongarra, J. DuCroz, I. Duff, and S. Hammarling. A Set of Level 3 Basic Linear Algebra Subprograms. ACM Trans. Math. Softw., 16(1):1-17, 18-28, March 1990.
-
(1990)
ACM Trans. Math. Softw
, vol.16
, Issue.1
, pp. 1-17
-
-
Dongarra, J.1
Ducroz, J.2
Duff, I.3
Hammarling, S.4
-
5
-
-
0028443077
-
A parallel block implementation of level- 3 BLAS for MIMD vector processors
-
M. J. Dayde, I. S. Duff, and A. Petitet. A parallel block implementation of level- 3 BLAS for MIMD vector processors. ACM Trans. Math. Softw., 20(2):178-193, June 1994.
-
(1994)
ACM Trans. Math. Softw.
, vol.20
, Issue.2
, pp. 178-193
-
-
Dayde, M.J.1
Duff, I.S.2
Petitet, A.3
-
6
-
-
0040352753
-
-
This Proceedings, Springer Verlag
-
F. Gustavson, A. Henriksson, I. Jonsson, B. Ks and P. Ling. Recursive Blocked Data Formats and BLAS's for Dense Linear Algebra Algorithms. This Proceedings, Springer Verlag, 1998.
-
(1998)
Recursive Blocked Data Formats and Blas's for Dense Linear Algebra Algorithms
-
-
Gustavson, F.1
Henriksson, A.2
Jonsson, I.3
Ks, B.4
Ling, P.5
-
7
-
-
0343910469
-
-
Master Thesis, UMNAD 98.235, Department of Computing Science, Umes University, S-901 87 Umes June
-
A. Henriksson and I. Jonsson. High-Performance Matrix Multiplication on the IBM SP High Node. Master Thesis, UMNAD 98.235, Department of Computing Science, Umes University, S-901 87 Umes June 1998.
-
(1998)
High-Performance Matrix Multiplication on the IBM SP High Node
-
-
Henriksson, A.1
Jonsson, I.2
-
8
-
-
10844292223
-
-
Technical Report CTC91TR47, Department of Computer Science, Cornell University, Dec
-
B. Kågström and C. Van Loan. GEMM-Based Level-3 BLAS. Technical Report CTC91TR47, Department of Computer Science, Cornell University, Dec. 1989.
-
(1989)
Gemm-Based Level-3 BLAS
-
-
Kågström, B.1
Van Loan, C.2
-
9
-
-
0032155271
-
GEMM-based level 3 BLAS: Highperformance model implementations and performance evaluation benchmark
-
To appear
-
B. Kågström P. Ling, and C. Van Loan. GEMM-based level 3 BLAS: Highperformance model implementations and performance evaluation benchmark. ACM Trans. Math. Software, 1997. To appear.
-
(1997)
ACM Trans. Math. Software
-
-
Kågström, B.1
Ling, P.2
Van Loan, C.3
-
11
-
-
0027656965
-
A set of high-performance level 3 BLAS structured and tuned for the IBM 3090 VF and implemented in Fortran 77
-
September
-
P. Ling. A set of high-performance level 3 BLAS structured and tuned for the IBM 3090 VF and implemented in Fortran 77. The Journal of Supercomputing, 7(3):323-355, September 1993.
-
(1993)
The Journal of Supercomputing
, vol.7
, Issue.3
, pp. 323-355
-
-
Ling, P.1
|