메뉴 건너뛰기




Volumn 5544 LNCS, Issue PART 1, 2009, Pages 248-258

Generating empirically optimized composed matrix kernels from matlab prototypes

Author keywords

Code generation; Empirical performance tuning; MATLAB

Indexed keywords

AUTOMATED SYSTEMS; C CODES; COARSE-GRAINED; CODE GENERATION; COMPUTATIONAL SCIENTISTS; EMPIRICAL PERFORMANCE; EMPIRICAL PERFORMANCE TUNING; LOOP UNROLLING; MATLAB SCRIPTS; MATRIX; OPTIMIZATION SYSTEM; OPTIMIZATION TECHNIQUES; SCIENTIFIC APPLICATIONS; THREE-STEP APPROACH; TUNING SYSTEM;

EID: 68849096760     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-01970-8_25     Document Type: Conference Paper
Times cited : (6)

References (21)
  • 8
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • Whaley, R.C., Petitet, A., Dongarra, J.J.: Automated empirical optimization of software and the ATLAS project. Parallel Computing 27(1-2), 3-35 (2001)
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 9
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
    • Bilmes, J., Asanovic, K., Chin, C.W., Demmel, J.: Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. In: International Conference on Supercomputing, pp. 340-347 (1997)
    • (1997) International Conference on Supercomputing , pp. 340-347
    • Bilmes, J.1    Asanovic, K.2    Chin, C.W.3    Demmel, J.4
  • 10
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • Proceedings of SciDAC 2005. Journal of Physics:, Institute of Physics Publishing June
    • Vuduc, R., Demmel, J., Yelick, K.: OSKI: A library of automatically tuned sparse matrix kernels. In: Proceedings of SciDAC 2005. Journal of Physics: Conference Series, vol. 16, pp. 521-530. Institute of Physics Publishing (June 2005)
    • (2005) Conference Series , vol.16 , pp. 521-530
    • Vuduc, R.1    Demmel, J.2    Yelick, K.3
  • 11
    • 68849109682 scopus 로고    scopus 로고
    • Fowler, R., Jin, G., Mellor-Crummey, J.: Increasing temporal locality with skewing and recursive blocking. In: Proceedings of SC 2001: High-Performance Computing and Networking (November 2001)
    • Fowler, R., Jin, G., Mellor-Crummey, J.: Increasing temporal locality with skewing and recursive blocking. In: Proceedings of SC 2001: High-Performance Computing and Networking (November 2001)
  • 15
    • 34548762396 scopus 로고    scopus 로고
    • High-performance implementation of the level-3 BLAS
    • Technical Report TR-2006-23, The University of Texas at Austin, Department of Computer Sciences
    • Goto, K., van de Geijn, R.: High-performance implementation of the level-3 BLAS. Technical Report TR-2006-23, The University of Texas at Austin, Department of Computer Sciences (2006)
    • (2006)
    • Goto, K.1    van de Geijn, R.2
  • 17
    • 1842829625 scopus 로고    scopus 로고
    • Society for Industrial and Applied Mathematics, Philadelphia, PA, USA
    • Saad, Y.: Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA (2003)
    • (2003) Iterative Methods for Sparse Linear Systems
    • Saad, Y.1
  • 19
    • 68849123940 scopus 로고    scopus 로고
    • Norris, B., Hartono, A., Gropp, W.: Annotations for productivity and performance portability. In: Petascale Computing: Algorithms and Applications. Computational Science, pp. 443-462. Chapman & Hall/CRC Press, Taylor and Francis Group (2007)
    • Norris, B., Hartono, A., Gropp, W.: Annotations for productivity and performance portability. In: Petascale Computing: Algorithms and Applications. Computational Science, pp. 443-462. Chapman & Hall/CRC Press, Taylor and Francis Group (2007)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.