메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1058-1067

Model-driven SIMD code generation for a multi-resolution tensor kernel

Author keywords

library generators; model driven code optimization; tensor contraction; vectorization

Indexed keywords

BLAS LIBRARY; CODE GENERATION; CODE GENERATORS; COMPILE TIME; LIBRARY GENERATORS; MODEL-DRIVEN; MULTI-RESOLUTIONS; SIMD ARCHITECTURE; TENSOR CONTRACTION; TENSOR CONTRACTION EXPRESSIONS; VECTORIZATION;

EID: 80053258041     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2011.101     Document Type: Conference Paper
Times cited : (13)

References (24)
  • 3
    • 0030661485 scopus 로고    scopus 로고
    • Optimizing matrix multiply using PHiPAC: A portable, highperformance, ANSI C coding methodology
    • J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: A portable, highperformance, ANSI C coding methodology. In International Conference on Supercomputing, pages 340-347, 1997.
    • (1997) International Conference on Supercomputing , pp. 340-347
    • Bilmes, J.1    Asanovic, K.2    Chin, C.-W.3    Demmel, J.4
  • 7
    • 0031636309 scopus 로고    scopus 로고
    • FFTW: An adaptive software architecture for the FFT. In
    • M. Frigo. FFTW: An adaptive software architecture for the FFT. In Proceedings of the ICASSP Conference, volume 3, page 1381, 1998.
    • (1998) Proceedings of the ICASSP Conference, Volume , vol.3 , pp. 1381
    • Frigo, M.1
  • 8
    • 44249094647 scopus 로고    scopus 로고
    • Anatomy of high-performance matrix multiplication
    • K. Goto and R. A. v. d. Geijn. Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw., 34(3):1-25, 2008.
    • (2008) ACM Trans. Math. Softw. , vol.34 , Issue.3 , pp. 1-25
    • Goto, K.1    Geijn, R.A.V.D.2
  • 9
    • 48849089104 scopus 로고    scopus 로고
    • High-performance implementation of the level-3 BLAS
    • K. Goto and R. van De Geijn. High-performance implementation of the level-3 BLAS. ACM Trans. Math. Softw., 35(1):1-14, 2008.
    • (2008) ACM Trans. Math. Softw. , vol.35 , Issue.1 , pp. 1-14
    • Goto, K.1    Van De Geijn, R.2
  • 11
    • 11044224123 scopus 로고    scopus 로고
    • Multiresolution quantum chemistry: Basic theory and initial applications
    • DOI 10.1063/1.1791051, 12
    • R. J. Harrison, G. I. Fann, T. Yanai, Z. Gan, and G. Beylkin. Multiresolution quantum chemistry: Basic theory and initial applications. Journal of Chemical Physics, 121(23):11587-11598, 2004. (Pubitemid 40044262)
    • (2004) Journal of Chemical Physics , vol.121 , Issue.23 , pp. 11587-11598
    • Harrison, R.J.1    Fann, G.I.2    Yanai, T.3    Gan, Z.4    Beylkin, G.5
  • 22
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • Proceedings of SciDAC 2005, Institute of Physics Publishing, June
    • R. Vuduc, J. Demmel, and K. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. In Proceedings of SciDAC 2005, volume 16 of Journal of Physics: Conference Series, pages 521-530. Institute of Physics Publishing, June 2005.
    • (2005) Journal of Physics: Conference Series , vol.16 , pp. 521-530
    • Vuduc, R.1    Demmel, J.2    Yelick, K.3
  • 23
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimization of software and the ATLAS project
    • Also available as University of Tennessee LAPACK Working Note #147, UTCS-00-448
    • R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001. Also available as University of Tennessee LAPACK Working Note #147, UTCS-00-448, 2000. www.netlib.org/lapack/lawns/lawn147.ps.
    • (2000) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3
  • 24
    • 4344648428 scopus 로고    scopus 로고
    • Multiresolution quantum chemistry in multiwavelet bases: Analytic derivatives for hartree-fock and density functional theory
    • T. Yanai, G. I. Fann, Z. Gan, R. J. Harrison, and G. Beylkin. Multiresolution quantum chemistry in multiwavelet bases: Analytic derivatives for hartree-fock and density functional theory. Journal of Chemical Physics, 121(7):2866-2876, 2004.
    • (2004) Journal of Chemical Physics , vol.121 , Issue.7 , pp. 2866-2876
    • Yanai, T.1    Fann, G.I.2    Gan, Z.3    Harrison, R.J.4    Beylkin, G.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.