메뉴 건너뛰기




Volumn 9, Issue 4, 2013, Pages

Polyhedral parallel code generation for CUDA

Author keywords

C to CUDA; Code generation; Compilers; CUDA; GPU; Loop transformations; Par4All; Polyhedral model; PPCG

Indexed keywords

C-TO-CUDA; CODE GENERATION; CUDA; GPU; LOOP TRANSFORMATION; PAR4ALL; POLYHEDRAL MODELS; PPCG;

EID: 84872943015     PISSN: 15443566     EISSN: 15443973     Source Type: Journal    
DOI: 10.1145/2400682.2400713     Document Type: Article
Times cited : (311)

References (50)
  • 1
    • 0023438847 scopus 로고
    • Automatic translation of fortran programs to vector form
    • ALLEN, R. AND KENNEDY, K. 1987. Automatic translation of fortran programs to vector form. ACM Trans. Program. Lang. Syst. 9, 4, 491-542.
    • (1987) ACM Trans. Program. Lang. Syst. , vol.9 , Issue.4 , pp. 491-542
    • Allen, R.1    Kennedy, K.2
  • 11
    • 57349139452 scopus 로고    scopus 로고
    • A practical automatic polyhedral parallelizer and locality optimizer
    • BONDHUGULA, U., HARTONO, A., RAMANUJAM, J., AND SADAYAPPAN, P. 2008a. A practical automatic polyhedral parallelizer and locality optimizer. SIGPLAN Not. 43, 6, 101-113.
    • (2008) SIGPLAN Not. , vol.43 , Issue.6 , pp. 101-113
    • Bondhugula, U.1    Hartono, A.2    Ramanujam, J.3    Sadayappan, P.4
  • 13
    • 0032066690 scopus 로고    scopus 로고
    • Loop parallelization algorithms: From parallelism extraction to code generation
    • BOULET, P., DARTE, A., SILBER, G.-A., AND VIVIEN, F. 1998. Loop parallelization algorithms: From parallelism extraction to code generation. Parallel Comput. 24, 421-444.
    • (1998) Parallel Comput. , vol.24 , pp. 421-444
    • Boulet, P.1    Darte, A.2    Silber, G.-A.3    Vivien, F.4
  • 16
    • 0026109335 scopus 로고
    • Dataflow analysis of array and scalar references
    • FEAUTRIER, P. 1991. Dataflow analysis of array and scalar references. Int. J. Parallel Program. 20, 1, 23-53.
    • (1991) Int. J. Parallel Program. , vol.20 , Issue.1 , pp. 23-53
    • Feautrier, P.1
  • 17
    • 0026933251 scopus 로고
    • Some efficient solutions to the affine scheduling problem. Part I. one-dimensional time
    • FEAUTRIER, P. 1992a. Some efficient solutions to the affine scheduling problem. Part I. One-Dimensional time. Int. J. Parallel Program. 21, 313-347.
    • (1992) Int. J. Parallel Program. , vol.21 , pp. 313-347
    • Feautrier, P.1
  • 18
    • 0001448065 scopus 로고
    • Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time
    • FEAUTRIER, P. 1992b. Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time. Int. J. Parallel Program. 21, 389-420.
    • (1992) Int. J. Parallel Program. , vol.21 , pp. 389-420
    • Feautrier, P.1
  • 21
    • 70350627685 scopus 로고    scopus 로고
    • Precise management of scratchpad memories for localising array accesses in scientific codes
    • Springer
    • GRÖSSLINGER, A. 2009. Precise management of scratchpad memories for localising array accesses in scientific codes. In CC'09. Springer, 236-250.
    • (2009) CC'09 , pp. 236-250
    • Grösslinger, A.1
  • 31
    • 35948991669 scopus 로고    scopus 로고
    • NVIDIA Corporation, NVIDIA Corporation
    • NVIDIA Corporation 2011. NVIDIA CUDA Programming guide 4.0. NVIDIA Corporation.
    • (2011) NVIDIA CUDA Programming guide 4.0
  • 46
    • 78149237521 scopus 로고    scopus 로고
    • Isl: An integer set library for the polyhedral model
    • K. Fukuda, J. Hoeven, M. Joswig, and N. Takayama, Eds. Lecture Notes in Computer Science Series Springer
    • VERDOOLAEGE, S. 2010. isl: An integer set library for the polyhedral model. In International Conference on Mathematical Software (ICMS'10), K. Fukuda, J. Hoeven, M. Joswig, and N. Takayama, Eds. Lecture Notes in Computer Science Series, vol. 6327. Springer, 299-302.
    • (2010) International Conference on Mathematical Software (ICMS'10) , vol.6327 , pp. 299-302
    • Verdoolaege, S.1
  • 48
    • 13244279577 scopus 로고    scopus 로고
    • Minimizing development and maintenance costs in supporting persistently optimized BLAS
    • WHALEY, R. C. AND PETITET, A. 2005. Minimizing development and maintenance costs in supporting persistently optimized BLAS. Softw. Pract. Exper. 35, 2, 101-121. http://www.cs.utsa.edu/-whaley/papers/spercw04.ps+.
    • (2005) Softw. Pract. Exper. , vol.35 , Issue.2 , pp. 101-121
    • Whaley, R.C.1    Petitet, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.