메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1177-1185

OpenCL performance evaluation on modern multi core CPUs

Author keywords

Data Transfer; ILP; Locality; OpenCL Performance on CPU; Scheduling Overhead; Vectorization

Indexed keywords

DATA TRANSFER; PARALLEL PROGRAMMING; PROGRAM PROCESSORS; SCHEDULING;

EID: 84899746576     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPSW.2013.141     Document Type: Conference Paper
Times cited : (10)

References (23)
  • 1
    • 84899707201 scopus 로고    scopus 로고
    • Intel, "Sandy Bridge," http://software.intel.com/enus/articles/ sandy-bridge/.
    • Intel, Sandy Bridge
  • 2
    • 84899725358 scopus 로고    scopus 로고
    • AMD
    • AMD, "Fusion," http://fusion.amd.com/.
    • Fusion
  • 4
    • 84870550035 scopus 로고    scopus 로고
    • Intel Corporation
    • Intel Corporation, "Intel OpenCL SDK," http://software.intel. com/en-us/articles/intel-opencl-sdk/.
    • Intel OpenCL SDK
  • 5
    • 84899711242 scopus 로고    scopus 로고
    • NVIDIA Corporation
    • NVIDIA Corporation, "NVIDIA OpenCL SDK," http://developer. nvidia.com/cuda/opencl/.
    • NVIDIA OpenCL SDK
  • 11
  • 14
    • 34247142766 scopus 로고    scopus 로고
    • ICC
    • ICC, "Intel c++ compiler," http://www.intel.com/cd/software/ products/asmo-na/eng/compilers/clin/277618.htm.
    • Intel C++ Compiler
  • 15
    • 84870653904 scopus 로고    scopus 로고
    • Ispc: A SPMD compiler for high-performance CPU programming
    • May
    • M. Pharr and W. R. Mark, "ispc: A SPMD Compiler for High-Performance CPU Programming," in InPar 2012, May 2012.
    • (2012) Par 2012
    • Pharr, M.1    Mark, W.R.2
  • 16
    • 79953286075 scopus 로고    scopus 로고
    • A static task partitioning approach for heterogeneous systems using OpenCL
    • D. Grewe and M. F. P. O'Boyle, "A static task partitioning approach for heterogeneous systems using OpenCL," in CC'11, 2011, pp. 286-305.
    • (2011) CC'11 , pp. 286-305
    • Grewe, D.1    O'Boyle, M.F.P.2
  • 18
    • 70450231944 scopus 로고    scopus 로고
    • An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
    • S. Hong and H. Kim, "An Analytical Model for a GPU Architecture with Memory-level and Thread-level Parallelism Awareness," in ISCA, 2009, pp. 152-163.
    • (2009) ISCA , pp. 152-163
    • Hong, S.1    Kim, H.2
  • 19
    • 84899737954 scopus 로고    scopus 로고
    • CUDA Programming Guide V4.0, NVIDIA Corporation
    • CUDA Programming Guide, V4.0, NVIDIA Corporation.
  • 20
    • 84899769427 scopus 로고    scopus 로고
    • Intel Corporation, Intel Corporation
    • Intel Corporation, Intel Corporation, http://software.intel. com/.
  • 22
    • 84862909323 scopus 로고    scopus 로고
    • Performance characterization of the NAS Parallel Benchmarks in OpenCL
    • S. Seo, G. Jo, and J. Lee, "Performance characterization of the NAS Parallel Benchmarks in OpenCL," in IISWC'11, 2011, pp. 137-148.
    • (2011) IISWC'11 , pp. 137-148
    • Seo, S.1    Jo, G.2    Lee, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.