SCOPUS 정보 검색 플랫폼

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP

Volumn , Issue , 2009, Pages 101-110

OpenMP to GPGPU: A compiler framework for automatic translation and optimization

(3) Lee, Seyong a Min, Seung Jai a Eigenmann, Rudolf a

a PURDUE UNIVERSITY (United States)

Author keywords

Automatic translation; Compiler optimization; CUDA; GPU; OpenMP

Indexed keywords

AUTOMATIC TRANSLATION; COMPILER OPTIMIZATION; CUDA; GPU; OPENMP;

BENCHMARKING; COMPUTATIONAL GRAMMARS; OPTIMIZATION; PROGRAM COMPILERS;

PARALLEL PROGRAMMING;

EID: 67650081010 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1504176.1504194 Document Type: Conference Paper

Times cited : (198)

References (20)

1
- 0023438847
- AUTOMATIC TRANSLATION OF FORTRAN PROGRAMS TO VECTOR FORM.
- DOI 10.1145/29873.29875
- Randy Allen and Ken Kennedy. Automatic translation of FORTRAN programs to vector form. ACM Transactions on Programming Languages and Systems, 9(4):491-542, October 1987. (Pubitemid 18531687)
- (1987) ACM Transactions on Programming Languages and Systems , vol.9 , Issue.4 , pp. 491-542
- Allen Randy¹ Kennedy Ken²

2
- 57349180412
- A compiler framework for optimization of affine loop nests for GPGPUs
- M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. ACM International Conference on Supercomputing (ICS), 2008.
- (2008) ACM International Conference on Supercomputing (ICS)
- Baskaran, M.M.¹ Bondhugula, U.² Krishnamoorthy, S.³ Ramanujam, J.⁴ Rountev, A.⁵ Sadayappan, P.⁶

3
- 32844474242
- Towards automatic translation of OpenMP to MPI
- DOI 10.1145/1088149.1088174, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
- Ayon Basumallik and Rudolf Eigenmann. Towards automatic translation of OpenMP to MPI. ACM International Conference on Supercomputing (ICS), pages 189-198, 2005. (Pubitemid 43251323)
- (2005) Proceedings of the International Conference on Supercomputing , pp. 189-198
- Basumallik, A.¹ Eigenmann, R.²

4
- 84870629709
- online available
- NVIDIA CUDA [online]. available: http://developer.nvidia.com/object/cuda home.html.
- NVIDIA CUDA

5
- 67650016770
- online. available
- NVIDIA CUDA SDK - Data-Parallel Algorithms: Parallel Reduction [online]. available: http://developer.download.nvidia.com/compute/cuda/1 1/Website/Data-Parallel Algorithms.html.
- NVIDIA CUDA SDK - Data-Parallel Algorithms: Parallel Reduction

6
- 0012453312
- online, available
- Tim Davis. University of Florida Sparse Matrix Collection [online]. available: http://www.cise.ufl.edu/research/sparse/matrices/.
- University of Florida Sparse Matrix Collection
- Davis, T.¹

7
- 34548292052
- A memory model for scientific algorithms on graphics processors
- N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. International Conference for High Performance Computing, Networking, Storage and Analysys (SC), 2006.
- (2006) International Conference for High Performance Computing, Networking, Storage and Analysys (SC)
- Govindaraju, N.K.¹ Larsen, S.² Gray, J.³ Manocha, D.⁴

8
- 26444437628
- Cetus-an extensible compiler infrastructure for source-to-source transformation
- Sang Ik Lee, Troy Johnson, and Rudolf Eigenmann. Cetus - an extensible compiler infrastructure for source-to-source transformation. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2003.
- (2003) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Lee, S.I.¹ Johnson, T.² Eigenmann, R.³

9
- 0026407190
- A comparative study of automatic vectorizing compilers
- David Levine, David Callahan, and Jack Dongarra. A comparative study of automatic vectorizing compilers. Parallel Computing, 17, 1991.
- (1991) Parallel Computing , vol.17
- Levine, D.¹ Callahan, D.² Eongarra, J.³

10
- 0347133221
- Optimizing OpenMP programs on software distributed shared memory systems
- June
- Seung-Jai Min, Ayon Basumallik, and Rudolf Eigenmann. Optimizing OpenMP programs on software distributed shared memory systems. International Journel of Parallel Programming (IJPP), 31:225-249, June 2003.
- (2003) International Journel of Parallel Programming (IJPP) , vol.31 , pp. 225-249
- Min, S.-J.¹ Basumallik, A.² Eigenmann, R.³

11
- 57349170100
- Optimizing irregular sharedmemory applications for clusters
- Seung-Jai Min and Rudolf Eigenmann. Optimizing irregular sharedmemory applications for clusters. ACM International Conference on Supercomputing (ICS), pages 256-265, 2008.
- (2008) ACM International Conference on Supercomputing (ICS) , pp. 256-265
- Min, S.-J.¹ Eigenmann, R.²

12
- 43849085367
- Supporting OpenMP on Cell
- June
- K. O'Brien, K. O'Brien, Z. Sura, T. Chen, and T. Zhang. Supporting OpenMP on Cell. International Journel of Parallel Programming (IJPP), 36(3):289-311, June 2008.
- (2008) International Journel of Parallel Programming (IJPP) , vol.36 , Issue.3 , pp. 289-311
- O'Brien, K.¹ O'Brien, K.² Sura, Z.³ Chen, T.⁴ Zhang, T.⁵

13
- 67650022643
- online, available
- OpenMP [online]. available: http://openmp.org/wp/.
- OpenMP

14
- 79959466764
- Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
- S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, andW.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 73-82, 2008.
- (2008) ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) , pp. 73-82
- Ryoo, S.¹ Rodrigues, C.I.² Baghsorkhi, S.S.³ Stone, S.S.⁴ Kirk, D.B.⁵ Hwu, W.W.⁶

15
- 43449094719
- Program optimization space pruning for a multithreaded GPU
- DOI 10.1145/1356058.1356084, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
- S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded GPU. International Symposium on Code Generation and Optimization (CGO), 2008. (Pubitemid 351667266)
- (2008) Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization , pp. 195-204
- Ryoo, S.¹ Rodrigues, C.I.² Stone, S.S.³ Baghsorkhi, S.S.⁴ Ueng, S.-Z.⁵ Stratton, J.A.⁶ Hwu, W.-M.W.⁷

16
- 58449109179
- MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
- J. A. Stratton, S. S. Stone, and W. W. Hwu. MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008.
- (2008) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Stratton, J.A.¹ Stone, S.S.² Hwu, W.W.³

17
- 70450029523
- A framework for efficient and scalable execution of domainspecific templates on GPUs
- May
- Narayanan Sundaram, Anand Raghunathan, and Srimat T. Chakradhar. A framework for efficient and scalable execution of domainspecific templates on GPUs. IEEE International Parallel and Dis- tributed Processing Symposium (IPDPS), May 2009.
- (2009) IEEE International Parallel and Dis- tributed Processing Symposium (IPDPS)
- Sundaram, N.¹ Raghunathan, A.² Chakradhar, S.T.³

18
- 58449127539
- CUDA-lite: Reducing GPU programming complexity
- S. Ueng, M. Lathara, S. S. Baghsorkhi, and W. W. Hwu. CUDA-lite: Reducing GPU programming complexity. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008.
- (2008) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Ueng, S.¹ Lathara, M.² Baghsorkhi, S.S.³ Hwu, W.W.⁴

19
- 67650078822
- Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip
- Haitao Wei and Junqing Yu. Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip. International Workshop on OpenMP (IWOMP), 2007.
- (2007) International Workshop on OpenMP (IWOMP)
- Wei, H.¹ Yu, J.²

20
- 32844466554
- An integrated simdization framework using virtual vectors
- ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
- Peng Wu, Alexandre E. Eichenberger, Amy Wang, and Peng Zhao. An integrated simdization framework using virtual vectors. ACM International Conference on Supercomputing (ICS), pages 169-178, 2005. (Pubitemid 43251321)
- (2005) Proceedings of the International Conference on Supercomputing , pp. 169-178
- Wu, P.¹ Eichenberger, A.E.² Wang, A.³ Zhao, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.