메뉴 건너뛰기




Volumn , Issue , 2012, Pages 557-568

Productive programming of GPU clusters with OmpSs

Author keywords

accelerators; Cluster programming; GPGPU computing; OpenMP

Indexed keywords

ASYNCHRONY; COMPUTATIONAL TASK; GPGPU COMPUTING; GPU CLUSTERS; HYBRID MODEL; OPENMP; PARALLELIZATIONS; REMOTE NODE; RUNTIME SYSTEMS; TASK PARALLELISM; TASK-BASED;

EID: 84866856745     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2012.58     Document Type: Conference Paper
Times cited : (135)

References (36)
  • 2
    • 57949083229 scopus 로고    scopus 로고
    • A dependency-aware task-based programming environment for multi-core architectures
    • September
    • J. M. Perez, R. M. Badia, and J. Labarta, "A dependency-aware task-based programming environment for multi-core architectures," IEEE Int. Conference on Cluster Computing, pp. 142-151, September 2008.
    • (2008) IEEE Int. Conference on Cluster Computing , pp. 142-151
    • Perez, J.M.1    Badia, R.M.2    Labarta, J.3
  • 3
    • 35649006026 scopus 로고    scopus 로고
    • CellSs: Making it easier to program the Cell Broadband Engine processor
    • September
    • J. M. Perez, P. Bellens, R. M. Badia, and J. Labarta, "CellSs: Making it easier to program the Cell Broadband Engine processor," IBM Journal of Research and Development, vol. 51, no. 5, pp. 593-604, September 2007.
    • (2007) IBM Journal of Research and Development , vol.51 , Issue.5 , pp. 593-604
    • Perez, J.M.1    Bellens, P.2    Badia, R.M.3    Labarta, J.4
  • 11
    • 79957528059 scopus 로고    scopus 로고
    • Trace-driven Simulation of Multithreaded Applications
    • to appear
    • "Trace-driven Simulation of Multithreaded Applications," in Proceedings of the 2011 ISPASS (to appear), 2011.
    • (2011) Proceedings of the 2011 ISPASS
  • 12
    • 84866874333 scopus 로고    scopus 로고
    • Master's thesis, Computer Architecture Department, Universitat Politècnica de Catalunya
    • L. Martinell, ""Memory usage improvements for the SMPSs runtime"," Master's thesis, Computer Architecture Department, Universitat Politècnica de Catalunya, 2010.
    • (2010) Memory Usage Improvements for the SMPSs Runtime
    • Martinell, L.1
  • 22
    • 79951728783 scopus 로고    scopus 로고
    • 8 December [Online]. Available
    • Khronos OpenCLWorking Group, The OpenCL Specification, version 1.0.29, 8 December 2008. [Online]. Available: http://khronos.org/registry/cl/specs/opencl- 1.0.29.pdf
    • (2008) The OpenCL Specification, Version 1.0.29
  • 24
    • 84856841346 scopus 로고    scopus 로고
    • Performance analysis of a hybrid mpi/cuda implementation of the naslu benchmark
    • March
    • S. J. Pennycook, S. D. Hammond, S. A. Jarvis, and G. R. Mudalige, "Performance analysis of a hybrid mpi/cuda implementation of the naslu benchmark," SIGMETRICS Perform. Eval. Rev., vol. 38, pp. 23-29, March 2011.
    • (2011) SIGMETRICS Perform. Eval. Rev. , vol.38 , pp. 23-29
    • Pennycook, S.J.1    Hammond, S.D.2    Jarvis, S.A.3    Mudalige, G.R.4
  • 29
    • 79952596877 scopus 로고    scopus 로고
    • Unified parallel c for gpu clusters: Language extensions and compiler implementation
    • Languages and Compilers for Parallel Computing, ser. K. Cooper, J. Mellor-Crummey, and V. Sarkar, Eds. Springer Berlin / Heidelberg
    • L. Chen, L. Liu, S. Tang, L. Huang, Z. Jing, S. Xu, D. Zhang, and B. Shou, "Unified parallel c for gpu clusters: Language extensions and compiler implementation," in Languages and Compilers for Parallel Computing, ser. Lecture Notes in Computer Science, K. Cooper, J. Mellor-Crummey, and V. Sarkar, Eds. Springer Berlin / Heidelberg, 2011, vol. 6548, pp. 151-165.
    • (2011) Lecture Notes in Computer Science , vol.6548 , pp. 151-165
    • Chen, L.1    Liu, L.2    Tang, S.3    Huang, L.4    Jing, Z.5    Xu, S.6    Zhang, D.7    Shou, B.8
  • 31
  • 33
    • 78649498878 scopus 로고    scopus 로고
    • Offload - Automating code migration to heterogeneous multicore systems
    • Lecture Notes in Computer Science
    • P. Cooper, U. Dolinsky, A. F. Donaldson, A. Richards, C. Riley, and G. Russell, "Offload - automating code migration to heterogeneous multicore systems," in Lecture Notes in Computer Science, HiPEAC Conference 2010, 2010, pp. 307-321.
    • (2010) HiPEAC Conference 2010 , pp. 307-321
    • Cooper, P.1    Dolinsky, U.2    Donaldson, A.F.3    Richards, A.4    Riley, C.5    Russell, G.6
  • 35
    • 84865717999 scopus 로고    scopus 로고
    • Portland Group Inc., Sep
    • Portland Group Inc., "PGI Accelerator Compilers," Sep 2011.
    • (2011) PGI Accelerator Compilers


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.