메뉴 건너뛰기




Volumn , Issue , 2009, Pages

A Cross-Input Adaptive Framework for GPU Program Optimizations

Author keywords

Cross input adaptation; CUDA; Empirical search; G ADAPT; GPU; Program optimizations

Indexed keywords

ADAPTIVE FRAMEWORK; ADAPTIVE OPTIMIZATION; GENERAL-PURPOSE COMPUTING; GPU PROGRAMMING; GPU PROGRAMS; GRAPHIC PROCESSING UNITS; HIGH QUALITY; INPUT ADAPTATION; NEW DIMENSIONS; NUMERICAL APPLICATIONS; OPTIMAL CONFIGURATIONS; PREDICTIVE MODELS; SINGLE-CHIP;

EID: 70450103746     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2009.5160988     Document Type: Conference Paper
Times cited : (108)

References (25)
  • 1
    • 84869690427 scopus 로고    scopus 로고
    • NVIDIA CUDA
    • NVIDIA CUDA. http://www.nvidia.com/cuda.
  • 3
    • 57349180412 scopus 로고    scopus 로고
    • M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. In ICS'08: Proceedings of the 22nd Annual International Conference on Supercomputing, pages 225-234, 2008.
    • M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. In ICS'08: Proceedings of the 22nd Annual International Conference on Supercomputing, pages 225-234, 2008.
  • 8
    • 20744449792 scopus 로고    scopus 로고
    • The design and implementation of FFTW3
    • M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, 2005.
    • (2005) Proceedings of the IEEE , vol.93 , Issue.2 , pp. 216-231
    • Frigo, M.1    Johnson, S.G.2
  • 12
    • 1542501019 scopus 로고    scopus 로고
    • Sparsity: Optimizationframework for sparse matrix kernels
    • Eun-Jin Im, Katherine Yelick, and Richard Vuduc. Sparsity: Optimizationframework for sparse matrix kernels. Int. J. High Perform. Comput. Appl., 18(1):135-158, 2004.
    • (2004) Int. J. High Perform. Comput. Appl , vol.18 , Issue.1 , pp. 135-158
    • Im, E.-J.1    Yelick, K.2    Vuduc, R.3
  • 14
    • 35048854568 scopus 로고    scopus 로고
    • S. Lee, T. Johnson, and R. Eigenmann. Cetus - an extensible compiler infrastructure for source-to-source transformation. In In Proceedings of the 16th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC), pages 539-553, 2003.
    • S. Lee, T. Johnson, and R. Eigenmann. Cetus - an extensible compiler infrastructure for source-to-source transformation. In In Proceedings of the 16th Annual Workshop on Languages and Compilers for Parallel Computing (LCPC), pages 539-553, 2003.
  • 17
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • March/ April
    • John Nickolls, Ian Buck, Michael Garland, and Kevin Skadron. Scalable parallel programming with CUDA. ACM Queue, pages 40-53, March/ April 2008.
    • (2008) ACM Queue , pp. 40-53
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 25
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • R. C. Whaley, A. Petitet, and J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001.
    • (2001) Parallel Computing , vol.27 , Issue.1-2 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.