SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6548 LNCS, Issue , 2011, Pages 151-165

Unified parallel C for GPU clusters: Language extensions and compiler implementation

(8) Chen, Li a Liu, Lei a Tang, Shenglin a Huang, Lei b Jing, Zheng a Xu, Shixiong a Zhang, Dingfei a Shou, Baojiang a

a INSTITUTE OF GEOLOGY AND GEOPHYSICS (China)

b University of Houston ^* (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CODE GENERATION; COMPILE TIME; COMPILER IMPLEMENTATION; DATA MANAGEMENT; EXECUTION MODEL; GPU CLUSTERS; GRAPHICS PROCESSING UNITS; HIERARCHICAL DATA; HIGH PERFORMANCE COMPUTING; LANGUAGE EXTENSIONS; LOOP TILING; MEMORY LAYOUT; MEMORY MODULES; MEMORY OPTIMIZATION; PARALLEL MACHINE; PROGRAMMABILITY; RUNTIME OPTIMIZATION; UNIFIED PARALLEL C;

CACHE MEMORY; COMPUTER SOFTWARE SELECTION AND EVALUATION; DATA TRANSFER; INFORMATION MANAGEMENT; OPTIMIZATION; PARALLEL ARCHITECTURES; SEMANTICS;

PROGRAM COMPILERS;

EID: 79952596877 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-19595-2_11 Document Type: Conference Paper

Times cited : (15)

References (17)

1
- 34548207355
- Sequoia: Programming the memory hierarchy
- November
- Fatahalian, K., Knight, T., Houston, M., Erez, M., Horn, D., Leem, L., Park, H., Ren, M., Aiken, A., Dally, W., Hanrahan, P.: Sequoia: Programming the Memory Hierarchy. In: Proceedings of Supercomputing 2006 (November 2006)
- (2006) Proceedings of Supercomputing 2006
- Fatahalian, K.¹ Knight, T.² Houston, M.³ Erez, M.⁴ Horn, D.⁵ Leem, L.⁶ Park, H.⁷ Ren, M.⁸ Aiken, A.⁹ Dally, W.¹⁰ Hanrahan, P.¹¹

2
- 77954395858
- Hierarchical place trees: A portable abstraction for task parallelism and data movement
- Gao, G.R., Pollock, L.L., Cavazos, J., Li, X. (eds.), LCPC 2009,Springer, Heidelberg
- Yan, Y., Zhao, J., Guo, Y., Sarkar, V.: Hierarchical place trees: A portable abstraction for task parallelism and data movement. In: Gao, G.R., Pollock, L.L., Cavazos, J., Li, X. (eds.) LCPC 2009. LNCS, vol. 5898, pp. 172-187. Springer, Heidelberg (2010)
- (2010) LNCS , vol.5898 , pp. 172-187
- Yan, Y.¹ Zhao, J.² Guo, Y.³ Sarkar, V.⁴

3
- 76749086882
- Programming for parallelism and locality with hierarchically tiled arrays
- New York, USA, March
- Bikshandi, G., Guo, J., Hoeflinger, D., Almasi, G., Fraguela, B.B., Garzarán, M.J., Padua, D., von Praun, C.: Programming for parallelism and locality with hierarchically tiled arrays. In: PPoPP, New York, USA, March 29-31 (2006)
- (2006) PPoPP , pp. 29-31
- Bikshandi, G.¹ Guo, J.² Hoeflinger, D.³ Almasi, G.⁴ Fraguela, B.B.⁵ Garzarán, M.J.⁶ Padua, D.⁷ Von Praun, C.⁸

4
- 70350625706
- Performance without pain = productivity: Data layout and collective communication in UPC
- Nishtala, R., Almasi, G., Cascaval, C.: Performance without pain = productivity: data layout and collective communication in UPC. In: Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2008 (2008)
- (2008) Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP 2008
- Nishtala, R.¹ Almasi, G.² Cascaval, C.³

5
- 58449127539
- CUDA-lite: Reducing GPU programming complexity
- Amaral, J.N. (ed.),LCPC 2008,Springer, Heidelberg
- Ueng, S., Lathara, M., Baghsorkhi, S.S., Hwu, W.W.: CUDA-lite: Reducing GPU programming complexity. In: Amaral, J.N. (ed.) LCPC 2008. LNCS, vol. 5335, pp. 1-15. Springer, Heidelberg (2008)
- (2008) LNCS , vol.5335 , pp. 1-15
- Ueng, S.¹ Lathara, M.² Baghsorkhi, S.S.³ Hwu, W.W.⁴

6
- 67650673468
- HiCUDA: A high-level directive-based language for GPU programming
- March
- Han, T.D., Abdelrahman, T.S.: hiCUDA: a high-level directive-based language for GPU programming. In: Workshop on General Purpose Processing on Graphics Processing Units (GPGPU), pp. 52-61 (March 2009)
- (2009) Workshop on General Purpose Processing on Graphics Processing Units (GPGPU) , pp. 52-61
- Han, T.D.¹ Abdelrahman, T.S.²

7
- 70350678845
- JCUDA: A programmer-friendly interface for accelerating java programs with CUDA
- Sips, H., Epema, D., Lin, H.-X. (eds.),Euro-Par 2009,Springer, Heidelberg
- Yan, Y., et al.: JCUDA: a Programmer-Friendly Interface for Accelerating Java Programs with CUDA. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 887-899. Springer, Heidelberg (2009)
- (2009) LNCS , vol.5704 , pp. 887-899
- Yan, Y.¹

8
- 67650081010
- OpenMP to GPGPU: A compiler framework for automatic translation and optimization
- February
- Lee, S., Min, S.-J., Eigenmann, R.: OpenMP to GPGPU: a compiler framework for automatic translation and optimization. In: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp. 101-110 (February 2009)
- (2009) ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) , pp. 101-110
- Lee, S.¹ Min, S.-J.² Eigenmann, R.³

9
- 79952578213
- March
- The Portland Group. PGI Fortran &C Accelerator Programming Model (March 2010), http://grape.pgroup.com/lit/whitepapers/ pgi-accel-prog-model-1.2. pdf
- (2010) PGI Fortran &c Accelerator Programming Model

10
- 79952576354
- http://www.caps-entreprise.com/fr/ page/index.php?id=49&p-p=36

11
- 79952589187
- NVIDIA CUDA, China campus programming contest (2009), http://cuda.csdn.net/contest/pro
- (2009) China Campus Programming Contest

12
- 77954691442
- A GPGPU Compiler for memory optimization and parallelism management
- June
- Yang, Y., Xiang, P., Kong, J., Zhou, H.: A GPGPU Compiler for Memory Optimization and Parallelism Management. In: The ACM SIGNPLAN 2010 Conference on Programming Language Design and Implementation, PLDI 2010 (June 2010)
- (2010) The ACM SIGNPLAN 2010 Conference on Programming Language Design and Implementation, PLDI 2010
- Yang, Y.¹ Xiang, P.² Kong, J.³ Zhou, H.⁴

13
- 79952598285
- A UPC specification extension proposal for hierarchical parallelism
- Virginia USA,October
- Serres, O., Kayi, A., Anbar, A., El-Ghazawi, T.: A UPC Specification Extension Proposal for Hierarchical Parallelism. In: The 3rd Conference on Partitioned Global Address Space Programming Models, Virginia, USA (October 2009)
- (2009) The 3rd Conference on Partitioned Global Address Space Programming Models
- Serres, O.¹ Kayi, A.² Anbar, A.³ El-Ghazawi, T.⁴

14
- 79952579235
- Teams for co-array fortran
- Virginia USA,October
- Numrich, R.: Teams for Co-Array Fortran. In: The 3rd Conference on Partitioned Global Address Space Programming Models, Virginia, USA (October 2009)
- (2009) The 3rd Conference on Partitioned Global Address Space Programming Models
- Numrich, R.¹

15
- 33746070421
- Shared memory programming for large scale machines
- Ottawa, Ontario, Canada, June
- Barton, C., Casçaval, C., Almási, G., Zheng, Y., Farreras, M., Chatterje, S., Amaral, J.N.: Shared memory programming for large scale machines. In: Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation, Ottawa, Ontario, Canada, June 11-14 (2006)
- (2006) Proceedings of the 2006 ACM SIGPLAN Conference on Programming Language Design and Implementation , pp. 11-14
- Barton, C.¹ Casçaval, C.² Almási, G.³ Zheng, Y.⁴ Farreras, M.⁵ Chatterje, S.⁶ Amaral, J.N.⁷

16
- 33745219957
- A performance analysis of the Berkeley UPC compiler
- San Francisco,CA, USA, June
- Husbands, P., Iancu, C., Yelick, K.: A performance analysis of the Berkeley UPC compiler. In: Proceedings of the 17th Annual International Conference on Supercomputing, San Francisco, CA, USA, June 23-26 (2003)
- (2003) Proceedings of the 17th Annual International Conference on Supercomputing , pp. 23-26
- Husbands, P.¹ Iancu, C.² Yelick, K.³

17
- 79952586457
- Bauer, M., Clark, J., Schkufza, E., Aiken, A.: Sequoia++ User Manual, http://sequoia.stanford.edu/
- Sequoia++ User Manual
- Bauer, M.¹ Clark, J.² Schkufza, E.³ Aiken, A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.