메뉴 건너뛰기




Volumn 27, Issue 1-2, 2001, Pages 3-35

Automated empirical optimizations of software and the ATLAS project

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATION; COMPUTER AIDED SOFTWARE ENGINEERING; DATA STRUCTURES; DATABASE SYSTEMS;

EID: 0343462141     PISSN: 01678191     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-8191(00)00087-9     Document Type: Article
Times cited : (969)

References (24)
  • 2
    • 0031215987 scopus 로고    scopus 로고
    • The spectral decomposition of nonsymmetric matrices on distributed-memory computers
    • Bai Z., Demmel J., Dongarra J., Petitet A., Robinson H., Stanley K. The spectral decomposition of nonsymmetric matrices on distributed-memory computers. SIAM J. Sci. Comput. 18(5):1997;1446-1461.
    • (1997) SIAM J. Sci. Comput. , vol.18 , Issue.5 , pp. 1446-1461
    • Bai, Z.1    Demmel, J.2    Dongarra, J.3    Petitet, A.4    Robinson, H.5    Stanley, K.6
  • 3
    • 0009129146 scopus 로고    scopus 로고
    • Optimizing Matrix Multiply using PHiPAC: A Portable
    • Technical Report UT CS-96-326, LAPACK Working Note No. 111, University of Tennessee
    • J. Bilmes, K. Asanovic, J. Demmel, D. Lam, C. Chin, Optimizing Matrix Multiply using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology, Technical Report UT CS-96-326, LAPACK Working Note No. 111, University of Tennessee, 1996.
    • (1996) High-Performance, ANSI C Coding Methodology
    • Bilmes, J.1    Asanovic, K.2    Demmel, J.3    Lam, D.4    Chin, C.5
  • 4
    • 0028443077 scopus 로고
    • A parallel block implementation of level 3 BLAS for MIMD vector processors
    • Dayde M., Duff I., Petitet A. A parallel block implementation of level 3 BLAS for MIMD vector processors. ACM Trans. Math. Software. 20(2):1994;178-193.
    • (1994) ACM Trans. Math. Software , vol.20 , Issue.2 , pp. 178-193
    • Dayde, M.1    Duff, I.2    Petitet, A.3
  • 6
    • 0023982822 scopus 로고
    • Algorithm 656: An extended set of basic linear algebra subprograms: Model implementation and test programs
    • Dongarra J., Du Croz J., Hammarling S., Hanson R. Algorithm 656: An extended set of basic linear algebra subprograms: Model implementation and test programs. ACM Trans. Math. Software. 14(1):1988;18-32.
    • (1988) ACM Trans. Math. Software , vol.14 , Issue.1 , pp. 18-32
    • Dongarra, J.1    Du Croz, J.2    Hammarling, S.3    Hanson, R.4
  • 10
    • 0031636309 scopus 로고    scopus 로고
    • FFTW: An Adaptive Software Architecture for the FFT
    • M. Frigo, FFTW: An Adaptive Software Architecture for the FFT, in: Proceedings of the ICASSP Conference, vol. 3, 1998, p. 1381.
    • (1998) In: Proceedings of the ICASSP Conference , vol.3 , pp. 1381
    • Frigo, M.1
  • 12
    • 84947926251 scopus 로고    scopus 로고
    • Recursive blocked data formats and blas's for dense linear algebra algorithms
    • in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Applied Parallel Computing, PARA '98
    • F. Gustavson, A. Henriksson, I. Jonsson, B. Kågström, P. Ling, Recursive blocked data formats and blas's for dense linear algebra algorithms, in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Applied Parallel Computing, PARA '98, Lecture Notes in Computer Science, No. 1541, 1998, pp. 195-206.
    • (1998) Lecture Notes in Computer Science , vol.1541 , pp. 195-206
    • Gustavson, F.1    Henriksson, A.2    Jonsson, I.3    Kågström, B.4    Ling, P.5
  • 13
    • 84947907655 scopus 로고    scopus 로고
    • Superscalar GEMM-based level 3 BLAS - The on-going evolution of a portable and high-performance library
    • in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Applied Parallel Computing, PARA'98
    • F. Gustavson, A. Henriksson, I. Jonsson, B. Kågström, P. Ling, Superscalar GEMM-based level 3 BLAS - the on-going evolution of a portable and high-performance library, in: B. Kågström, J. Dongarra, E. Elmroth, J. Waśniewski (Eds.), Applied Parallel Computing, PARA'98, Lecture Notes in Computer Science, No. 1541, 1998, pp. 207-215.
    • (1998) Lecture Notes in Computer Science , vol.1541 , pp. 207-215
    • Gustavson, F.1    Henriksson, A.2    Jonsson, I.3    Kågström, B.4    Ling, P.5
  • 14
    • 0042014175 scopus 로고
    • A proposal for standard linear algebra subprograms
    • Hanson R., Krogh F., Lawson C. A proposal for standard linear algebra subprograms. ACM SIGNUM Newsl. 8(16):1973.
    • (1973) ACM SIGNUM Newsl. , vol.8 , Issue.16
    • Hanson, R.1    Krogh, F.2    Lawson, C.3
  • 15
    • 0032155271 scopus 로고    scopus 로고
    • Gemm-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark
    • Technical Report UMINF 95-18, Department of Computing Science, Umeå University, 1995
    • B. Kågström, P. Ling, C. van Loan, Gemm-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark, Technical Report UMINF 95-18, Department of Computing Science, Umeå University, 1995, ACM TOMS 24 (3) (1998) 268-302.
    • (1998) ACM TOMS , vol.24 , Issue.3 , pp. 268-302
    • Kågström, B.1    Ling, P.2    Van Loan, C.3
  • 16
    • 0033906425 scopus 로고    scopus 로고
    • Telescoping Languages: A Compiler Strategy for Implementation of High-Level Domain-Specific Programming Systems
    • May to appear
    • Ken Kennedy, Telescoping Languages: A Compiler Strategy for Implementation of High-Level Domain-Specific Programming Systems, in: Proceedings of IPDPS 2000, May 2000, to appear.
    • (2000) In: Proceedings of IPDPS 2000
    • Kennedy, K.1
  • 21
    • 0003418094 scopus 로고    scopus 로고
    • Winner, best paper in the systems category, SC98: High Performance Networking and Computing
    • R. Clint Whaley, Jack Dongarra, Automatically Tuned Linear Algebra Software, http://www.cs.utk.edu/ rwhaley/ATL/INDEX.HTM , 1998, Winner, best paper in the systems category, SC98: High Performance Networking and Computing.
    • (1998) Automatically Tuned Linear Algebra Software
    • Whaley, R.C.1    Dongarra, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.