메뉴 건너뛰기




Volumn 1541, Issue , 1998, Pages 207-215

Superscalar GEMM-based level 3 BLAS - the on-going evolution of a portable and high-performance library

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SCIENCE; COMPUTERS;

EID: 84947907655     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/bfb0095338     Document Type: Conference Paper
Times cited : (19)

References (12)
  • 1
    • 0028427170 scopus 로고
    • Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch
    • May
    • R. C. Agarwal, F. G. Gustavson, and M. Zubair. Improving performance of linear algebra algorithms for dense matrices, using algorithmic prefetch. IBM J. Res. Develop, 38(3):265-275, May 1994.
    • (1994) IBM J. Res. Develop , vol.38 , Issue.3 , pp. 265-275
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 2
    • 0028513316 scopus 로고
    • Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
    • September
    • R. C. Agarwal, F. G. Gustavson, and M. Zubair. Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms. IBM J. Res. Develop, 38(5):563-576, September 1994.
    • (1994) IBM J. Res. Develop , vol.38 , Issue.5 , pp. 563-576
    • Agarwal, R.C.1    Gustavson, F.G.2    Zubair, M.3
  • 4
    • 0025402476 scopus 로고
    • A Set of Level 3 Basic Linear Algebra Subprograms
    • 18-28, March
    • J. Dongarra, J. DuCroz, I. Duff, and S. Hammarling. A Set of Level 3 Basic Linear Algebra Subprograms. ACM Trans. Math. Softw., 16(1):1-17, 18-28, March 1990.
    • (1990) ACM Trans. Math. Softw , vol.16 , Issue.1 , pp. 1-17
    • Dongarra, J.1    Ducroz, J.2    Duff, I.3    Hammarling, S.4
  • 5
    • 0028443077 scopus 로고
    • A parallel block implementation of level- 3 BLAS for MIMD vector processors
    • M. J. Dayde, I. S. Duff, and A. Petitet. A parallel block implementation of level- 3 BLAS for MIMD vector processors. ACM Trans. Math. Softw., 20(2):178-193, June 1994.
    • (1994) ACM Trans. Math. Softw. , vol.20 , Issue.2 , pp. 178-193
    • Dayde, M.J.1    Duff, I.S.2    Petitet, A.3
  • 8
    • 10844292223 scopus 로고
    • Technical Report CTC91TR47, Department of Computer Science, Cornell University, Dec
    • B. Kågström and C. Van Loan. GEMM-Based Level-3 BLAS. Technical Report CTC91TR47, Department of Computer Science, Cornell University, Dec. 1989.
    • (1989) Gemm-Based Level-3 BLAS
    • Kågström, B.1    Van Loan, C.2
  • 9
    • 0032155271 scopus 로고    scopus 로고
    • GEMM-based level 3 BLAS: Highperformance model implementations and performance evaluation benchmark
    • To appear
    • B. Kågström P. Ling, and C. Van Loan. GEMM-based level 3 BLAS: Highperformance model implementations and performance evaluation benchmark. ACM Trans. Math. Software, 1997. To appear.
    • (1997) ACM Trans. Math. Software
    • Kågström, B.1    Ling, P.2    Van Loan, C.3
  • 10
  • 11
    • 0027656965 scopus 로고
    • A set of high-performance level 3 BLAS structured and tuned for the IBM 3090 VF and implemented in Fortran 77
    • September
    • P. Ling. A set of high-performance level 3 BLAS structured and tuned for the IBM 3090 VF and implemented in Fortran 77. The Journal of Supercomputing, 7(3):323-355, September 1993.
    • (1993) The Journal of Supercomputing , vol.7 , Issue.3 , pp. 323-355
    • Ling, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.