SCOPUS 정보 검색 플랫폼

International Conference for High Performance Computing, Networking, Storage and Analysis, SC

Volumn , Issue , 2012, Pages

Early evaluation of directive-based GPU programming models for productive exascale computing

(2) Lee, Seyong a Vetter, Jeffrey S a,b

a OAK RIDGE NATIONAL LABORATORY (United States)

b GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EARLY EVALUATION; EXASCALE COMPUTING; GRAPHICS PROCESSING UNIT; HIGH PERFORMANCE COMPUTING; LEVELS OF ABSTRACTION; PARALLEL COMPUTER ARCHITECTURE; PERFORMANCE POTENTIALS; PROGRAMMING COMPLEXITY;

COMPUTER ARCHITECTURE; COMPUTER GRAPHICS; PROGRAM PROCESSORS;

COMPUTER GRAPHICS EQUIPMENT;

EID: 84877704241 PISSN: 21674329 EISSN: 21674337 Source Type: Conference Proceeding
DOI: 10.1109/SC.2012.51 Document Type: Conference Paper

Times cited : (59)

References (25)

1
- 80052312080
- Keeneland: Bringing heterogeneous gpu computing to the computational science community
- J. S. Vetter, R. Glassbrook, J. Dongarra, K. Schwan, B. Loftis, S. McNally, J. Meredith, J. Rogers, P. Roth, K. Spafford, and S. Yalamanchili, "Keeneland: Bringing heterogeneous gpu computing to the computational science community," IEEE Computing in Science and Engineering, vol. 13, no. 5, pp. 90-95, 2011.
- (2011) IEEE Computing in Science and Engineering , vol.13 , Issue.5 , pp. 90-95
- Vetter, J.S.¹ Glassbrook, R.² Dongarra, J.³ Schwan, K.⁴ Loftis, B.⁵ McNally, S.⁶ Meredith, J.⁷ Rogers, J.⁸ Roth, P.⁹ Spafford, K.¹⁰ Yalamanchili, S.¹¹

2
- 79951595196
- The International Exascale Software Project RoadMap
- J. Dongarra, P. Beckman, T. Moore, P. Aerts, G. Aloisio, J.-. Andre, D. Barkai, J.-. Berthou, T. Boku, B. Braunschweig, F. Cappello, B. Chapman, X. Chi, A. Choudhary, S. Dosanjh, T. Dunning, S. Fiore, A. Geist, B. Gropp, RobertHarrison, M. Hereld, M. Heroux, A. Hoisie, K. Hotta, Y. Ishikawa, Z. Jin, F. Johnson, S. Kale, R. Kenway, D. Keyes, B. Kramer, J. Labarta, A. Lichnewsky, T. Lippert, B. Lucas, B. Maccabe, S. Matsuoka, P. Messina, P. Michielse, B. Mohr, M. Mueller, W. Nagel, H. Nakashima, M. E. Papka, D. Reed, M. Sato, E. Seidel, J. Shalf, D. Skinner, M. Snir, T. Sterling, R. Stevens, F. Streitz, B. Sugar, S. Sumimoto, W. Tang, J. Taylor, R. Thakur, A. Trefethen, M. Valero, A. van der Steen, J. Vetter, P. Williams, R. Wisniewski, and K. Yelick, "The International Exascale Software Project RoadMap," Journal of High Performance Computer Applications, vol. 25, no. 1, 2011.
- (2011) Journal of High Performance Computer Applications , vol.25 , Issue.1
- Dongarra, J.¹ Beckman, P.² Moore, T.³ Aerts, P.⁴ Aloisio, G.⁵ Andre, J.⁶ Barkai, D.⁷ Berthou, J.⁸ Boku, T.⁹ Braunschweig, B.¹⁰ Cappello, F.¹¹ Chapman, B.¹² Chi, X.¹³ Choudhary, A.¹⁴ Dosanjh, S.¹⁵ Dunning, T.¹⁶ Fiore, S.¹⁷ Geist, A.¹⁸ Gropp, B.¹⁹ Harrison, R.²⁰ more..

3
- 84877709144
- US Department of Energy, Tech. Rep.
- S. Amarasinghe, M. Hall, R. Lethin, K. Pingali, D. Quinlan, V. Sarkar, J. Shalf, R. Lucas, K. Yelick, P. Balaji, P. C. Diniz, A. Koniges, M. Snir, and S. R. Sachs, "Report of the 2011 workshop on exascale programming challenges," US Department of Energy, Tech. Rep., 2011.
- (2011) Report of the 2011 Workshop on Exascale Programming Challenges
- Amarasinghe, S.¹ Hall, M.² Lethin, R.³ Pingali, K.⁴ Quinlan, D.⁵ Sarkar, V.⁶ Shalf, J.⁷ Lucas, R.⁸ Yelick, K.⁹ Balaji, P.¹⁰ Diniz, P.C.¹¹ Koniges, A.¹² Snir, M.¹³ Sachs, S.R.¹⁴

4
- 84877708113
- Sh, available: (accessed April 02, 2012)
- Sh, "Sh: A metaprogramming language for programmable GPUs. [online]. available: http://www.libsh.org," (accessed April 02, 2012).
- Sh: A Metaprogramming Language for Programmable GPUs. [Online]

5
- 84877609547
- Brook for GPUs: Stream computing on graphics hardware
- New York, NY, USA: ACM
- I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan, "Brook for GPUs: stream computing on graphics hardware," in SIGGRAPH '04: ACM SIGGRAPH 2004 Papers. New York, NY, USA: ACM, 2004, pp. 777-786.
- (2004) SIGGRAPH '04: ACM SIGGRAPH 2004 Papers , pp. 777-786
- Buck, I.¹ Foley, T.² Horn, D.³ Sugerman, J.⁴ Fatahalian, K.⁵ Houston, M.⁶ Hanrahan, P.⁷

6
- 57349101237
- Data and computation transformations for brook streaming applications on multiprocessors
- Washington, DC, USA: IEEE Computer Society
- S. wei Liao, Z. Du, G. Wu, and G.-Y. Lueh, "Data and computation transformations for brook streaming applications on multiprocessors," in CGO '06: Proceedings of the International Symposium on Code Generation and Optimization. Washington, DC, USA: IEEE Computer Society, 2006, pp. 196-207.
- (2006) CGO '06: Proceedings of the International Symposium on Code Generation and Optimization , pp. 196-207
- Liao, S.W.¹ Du, Z.² Wu, G.³ Lueh, G.-Y.⁴

7
- 77951558943
- A performance-oriented data parallel virtual machine for GPUs
- New York, NY, USA: ACM
- M. Peercy, M. Segal, and D. Gerstmann, "A performance-oriented data parallel virtual machine for GPUs," in SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches. New York, NY, USA: ACM, 2006, p. 184.
- (2006) SIGGRAPH '06: ACM SIGGRAPH 2006 Sketches , pp. 184
- Peercy, M.¹ Segal, M.² Gerstmann, D.³

8
- 84870766925
- CUDA, available: (accessed April 02, 2012)
- CUDA, "NVIDIA CUDA [online]. available: http://developer.nvidia.com/ category/zone/cuda-zone," 2012, (accessed April 02, 2012).
- (2012) NVIDIA CUDA [Online]

9
- 84870744206
- OpenCL, Available: (accessed April 02, 2012)
- OpenCL, "OpenCL [Online]. Available: http://www.khronos.org/opencl/, " 2012, (accessed April 02, 2012).
- (2012) OpenCL [Online]

10
- 84877712851
- Available: (accessed April 02, 2012)
- OpenMP, "OpenMP [Online]. Available: http://openmp.org/wp/," 2012, (accessed April 02, 2012).
- (2012) OpenMP [Online]

11
- 78649898391
- Hicuda: High-level gpgpu programming
- T. D. Han and T. S. Abdelrahman, "hicuda: High-level gpgpu programming," IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 1, pp. 78-90, 2011.
- (2011) IEEE Transactions on Parallel and Distributed Systems , vol.22 , Issue.1 , pp. 78-90
- Han, T.D.¹ Abdelrahman, T.S.²

12
- 78650802947
- OpenMPC: Extended OpenMP programming and tuning for GPUs
- IEEE press
- S. Lee and R. Eigenmann, "OpenMPC: Extended OpenMP programming and tuning for GPUs," in SC'10: Proceedings of the 2010 ACM/IEEE conference on Supercomputing. IEEE press, 2010.
- (2010) SC'10: Proceedings of the 2010 ACM/IEEE Conference on Supercomputing
- Lee, S.¹ Eigenmann, R.²

13
- 77952268356
- PGI-Accelerator, Available: (accessed April 02, 2012)
- PGI-Accelerator, "The Portland Group, PGI Fortran and C Accelarator Programming Model [Online]. Available: http://www.pgroup.com/resources/accel. htm," 2009, (accessed April 02, 2012).
- (2009) PGI Fortran and C Accelarator Programming Model [Online]

14
- 84874036290
- HMPP, Available: (accessed April 02, 2012)
- HMPP, "HMPP Workbench, a directive-based compiler for hybrid computing [Online]. Available: www.caps-entreprise.com/hmpp.html," 2009, (accessed April 02, 2012).
- (2009) HMPP Workbench, a Directive-based Compiler for Hybrid Computing [Online]

15
- 77952264175
- A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction
- Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, ser. New York, NY, USA: ACM
- A. Leung, N. Vasilache, B. Meister, M. Baskaran, D. Wohlford, C. Bastoul, and R. Lethin, "A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction," in Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, ser. GPGPU '10. New York, NY, USA: ACM, 2010, pp. 51-61.
- (2010) GPGPU '10 , pp. 51-61
- Leung, A.¹ Vasilache, N.² Meister, B.³ Baskaran, M.⁴ Wohlford, D.⁵ Bastoul, C.⁶ Lethin, R.⁷

16
- 84867263494
- Available: (accessed April 02, 2012)
- OpenACC, "OpenACC: Directives for Accelerators [Online]. Available: http://www.openacc-standard.org," 2011, (accessed April 02, 2012).
- (2011) OpenACC: Directives for Accelerators [Online]

17
- 79959202540
- OpenMP for Accelerators
- J. C. Beyer, E. J. Stotzer, A. Hart, and B. R. de Supinski, "OpenMP for Accelerators." in IWOMP'11, 2011, pp. 108-121.
- (2011) IWOMP'11 , pp. 108-121
- Beyer, J.C.¹ Stotzer, E.J.² Hart, A.³ De Supinski, B.R.⁴

18
- 84877712802
- Experiences with High-Level Programming Directives for Porting Applications to GPUs
- Springer Berlin Heidelberg
- O. Hernandez, W. Ding, B. Chapman, C. Kartsaklis, R. Sankaran, and R. Graham, "Experiences with High-Level Programming Directives for Porting Applications to GPUs," in Facing the Multicore - Challenge II. Springer Berlin Heidelberg, 2012, pp. 96-107.
- (2012) Facing the Multicore - Challenge II , pp. 96-107
- Hernandez, O.¹ Ding, W.² Chapman, B.³ Kartsaklis, C.⁴ Sankaran, R.⁵ Graham, R.⁶

19
- 70649092154
- Rodinia: A benchmark suite for heterogeneous computing
- S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, S. ha Lee, and K. Skadron, "Rodinia: A benchmark suite for heterogeneous computing," in Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), 2009.
- Proceedings of the IEEE International Symposium on Workload Characterization (IISWC), 2009
- Che, S.¹ Boyer, M.² Meng, J.³ Tarjan, D.⁴ Sheaffer, J.W.⁵ Lee, S.H.⁶ Skadron, K.⁷

20
- 84877716238
- Available: (accessed April 02, 2012)
- L. L. Pilla, "Hpcgpu Project [Online]. Available: http://hpcgpu.codeplex.com/," 2012, (accessed April 02, 2012).
- (2012) Hpcgpu Project [Online]
- Pilla, L.L.¹

21
- 70350583252
- OpenMP to GPGPU: A compiler framework for automatic translation and optimization
- New York, NY, USA: ACM, Feb.
- S. Lee, S.-J. Min, and R. Eigenmann, "OpenMP to GPGPU: A compiler framework for automatic translation and optimization," in ACM SIG-PLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). New York, NY, USA: ACM, Feb. 2009, pp. 101-110.
- (2009) ACM SIG-PLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) , pp. 101-110
- Lee, S.¹ Min, S.-J.² Eigenmann, R.³

22
- 77956200064
- An effective GPU implementation of breadth-first search
- Proceedings of the 47th Design Automation Conference, ser. New York, NY, USA: ACM
- L. Luo, M. Wong, and W.-m. Hwu, "An effective GPU implementation of breadth-first search," in Proceedings of the 47th Design Automation Conference, ser. DAC '10. New York, NY, USA: ACM, 2010, pp. 52-55.
- (2010) DAC '10 , pp. 52-55
- Luo, L.¹ Wong, M.² Hwu, W.-M.³

23
- 84877693197
- CUDA-reduction, available: (accessed April 02, 2012)
- CUDA-reduction, "NVIDIA CUDA SDK - CUDA Parallel Reduction [online]. available: http://developer.nvidia.com/cuda-cc-sdk-code-samples#reduction, " 2012, (accessed April 02, 2012).
- (2012) NVIDIA CUDA SDK - CUDA Parallel Reduction [Online]

24
- 80054871942
- Performance implications of nonuniform device topologies in scalable heterogeneous architectures
- [Online]. Available
- J. S. Meredith, P. C. Roth, K. L. Spafford, and J. S. Vetter, "Performance implications of nonuniform device topologies in scalable heterogeneous architectures," IEEE Micro, vol. 31, no. 5, pp. 66-75, 2011. [Online]. Available: http://dx.doi.org/10.1109/MM.2011.79
- (2011) IEEE Micro , vol.31 , Issue.5 , pp. 66-75
- Meredith, J.S.¹ Roth, P.C.² Spafford, K.L.³ Vetter, J.S.⁴

25
- 84862695013
- The tradeoffs of fused memory hierarchies in heterogeneous architectures
- Cagliari, Italy: ACM
- K. Spafford, J. S. Meredith, S. Lee, D. Li, P. C. Roth, and J. S. Vetter, "The tradeoffs of fused memory hierarchies in heterogeneous architectures," in ACM Computing Frontiers (CF). Cagliari, Italy: ACM, 2012.
- (2012) ACM Computing Frontiers (CF)
- Spafford, K.¹ Meredith, J.S.² Lee, S.³ Li, D.⁴ Roth, P.C.⁵ Vetter, J.S.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.