메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Early evaluation of directive-based GPU programming models for productive exascale computing

Author keywords

[No Author keywords available]

Indexed keywords

EARLY EVALUATION; EXASCALE COMPUTING; GRAPHICS PROCESSING UNIT; HIGH PERFORMANCE COMPUTING; LEVELS OF ABSTRACTION; PARALLEL COMPUTER ARCHITECTURE; PERFORMANCE POTENTIALS; PROGRAMMING COMPLEXITY;

EID: 84877704241     PISSN: 21674329     EISSN: 21674337     Source Type: Conference Proceeding    
DOI: 10.1109/SC.2012.51     Document Type: Conference Paper
Times cited : (59)

References (25)
  • 7
    • 77951558943 scopus 로고    scopus 로고
    • A performance-oriented data parallel virtual machine for GPUs
    • New York, NY, USA: ACM
    • M. Peercy, M. Segal, and D. Gerstmann, "A performance-oriented data parallel virtual machine for GPUs," in SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches. New York, NY, USA: ACM, 2006, p. 184.
    • (2006) SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches , pp. 184
    • Peercy, M.1    Segal, M.2    Gerstmann, D.3
  • 8
    • 84870766925 scopus 로고    scopus 로고
    • CUDA, available: (accessed April 02, 2012)
    • CUDA, "NVIDIA CUDA [online]. available: http://developer.nvidia.com/ category/zone/cuda-zone," 2012, (accessed April 02, 2012).
    • (2012) NVIDIA CUDA [Online]
  • 9
    • 84870744206 scopus 로고    scopus 로고
    • OpenCL, Available: (accessed April 02, 2012)
    • OpenCL, "OpenCL [Online]. Available: http://www.khronos.org/opencl/, " 2012, (accessed April 02, 2012).
    • (2012) OpenCL [Online]
  • 10
    • 84877712851 scopus 로고    scopus 로고
    • Available: (accessed April 02, 2012)
    • OpenMP, "OpenMP [Online]. Available: http://openmp.org/wp/," 2012, (accessed April 02, 2012).
    • (2012) OpenMP [Online]
  • 13
    • 77952268356 scopus 로고    scopus 로고
    • PGI-Accelerator, Available: (accessed April 02, 2012)
    • PGI-Accelerator, "The Portland Group, PGI Fortran and C Accelarator Programming Model [Online]. Available: http://www.pgroup.com/resources/accel. htm," 2009, (accessed April 02, 2012).
    • (2009) PGI Fortran and C Accelarator Programming Model [Online]
  • 15
    • 77952264175 scopus 로고    scopus 로고
    • A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction
    • Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, ser. New York, NY, USA: ACM
    • A. Leung, N. Vasilache, B. Meister, M. Baskaran, D. Wohlford, C. Bastoul, and R. Lethin, "A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction," in Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, ser. GPGPU '10. New York, NY, USA: ACM, 2010, pp. 51-61.
    • (2010) GPGPU '10 , pp. 51-61
    • Leung, A.1    Vasilache, N.2    Meister, B.3    Baskaran, M.4    Wohlford, D.5    Bastoul, C.6    Lethin, R.7
  • 16
    • 84867263494 scopus 로고    scopus 로고
    • Available: (accessed April 02, 2012)
    • OpenACC, "OpenACC: Directives for Accelerators [Online]. Available: http://www.openacc-standard.org," 2011, (accessed April 02, 2012).
    • (2011) OpenACC: Directives for Accelerators [Online]
  • 20
    • 84877716238 scopus 로고    scopus 로고
    • Available: (accessed April 02, 2012)
    • L. L. Pilla, "Hpcgpu Project [Online]. Available: http://hpcgpu.codeplex.com/," 2012, (accessed April 02, 2012).
    • (2012) Hpcgpu Project [Online]
    • Pilla, L.L.1
  • 22
    • 77956200064 scopus 로고    scopus 로고
    • An effective GPU implementation of breadth-first search
    • Proceedings of the 47th Design Automation Conference, ser. New York, NY, USA: ACM
    • L. Luo, M. Wong, and W.-m. Hwu, "An effective GPU implementation of breadth-first search," in Proceedings of the 47th Design Automation Conference, ser. DAC '10. New York, NY, USA: ACM, 2010, pp. 52-55.
    • (2010) DAC '10 , pp. 52-55
    • Luo, L.1    Wong, M.2    Hwu, W.-M.3
  • 23
    • 84877693197 scopus 로고    scopus 로고
    • CUDA-reduction, available: (accessed April 02, 2012)
    • CUDA-reduction, "NVIDIA CUDA SDK - CUDA Parallel Reduction [online]. available: http://developer.nvidia.com/cuda-cc-sdk-code-samples#reduction, " 2012, (accessed April 02, 2012).
    • (2012) NVIDIA CUDA SDK - CUDA Parallel Reduction [Online]
  • 24
    • 80054871942 scopus 로고    scopus 로고
    • Performance implications of nonuniform device topologies in scalable heterogeneous architectures
    • [Online]. Available
    • J. S. Meredith, P. C. Roth, K. L. Spafford, and J. S. Vetter, "Performance implications of nonuniform device topologies in scalable heterogeneous architectures," IEEE Micro, vol. 31, no. 5, pp. 66-75, 2011. [Online]. Available: http://dx.doi.org/10.1109/MM.2011.79
    • (2011) IEEE Micro , vol.31 , Issue.5 , pp. 66-75
    • Meredith, J.S.1    Roth, P.C.2    Spafford, K.L.3    Vetter, J.S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.