SCOPUS 정보 검색 플랫폼

ACM SIGPLAN Notices

Volumn 44, Issue 4, 2009, Pages 101-110

Open MP to GPGPU: A compiler framework for automatic translation and optimization

(3) Lee, Seyong a Min, Seung Jai a Eigenmann, Rudolf a

a PURDUE UNIVERSITY (United States)

Author keywords

Automatic translation; Compiler optimization; CUDA; GPU; Open MP

Indexed keywords

AUTOMATIC TRANSLATION; COMPILER OPTIMIZATION; CUDA; GPU; OPEN MP;

COMPUTATIONAL GRAMMARS; COMPUTER GRAPHICS EQUIPMENT; COMPUTER SCIENCE; OPTIMIZATION; PROGRAM COMPILERS;

BENCHMARKING;

EID: 70350583252 PISSN: 15232867 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (150)

References (20)

1
- 0023438847
- Automatic translation of FORTRAN programs to vector form
- October
- Randy Allen and Ken Kennedy. Automatic translation of FORTRAN programs to vector form. ACM Transactions on Programming Languages and Systems, 9(4):491-542, October 1987.
- (1987) ACM Transactions on Programming Languages and Systems , vol.9 , Issue.4 , pp. 491-542
- Allen, R.¹ Kennedy, K.²

2
- 57349180412
- A compiler framework for optimization of affine loop nests for GPGPUs
- M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. ACM International Conference on Supercomputing (ICS), 2008.
- (2008) ACM International Conference on Supercomputing (ICS)
- Baskaran, M.M.¹ Bondhugula, U.² Krishnamoorthy, S.³ Ramanujam, J.⁴ Rountev, A.⁵ Sadayappan, P.⁶

3
- 32844474242
- Towards automatic translation of OpenMP to MPI
- Ayon Basumallik and Rudolf Eigenmann. Towards automatic translation of OpenMP to MPI. ACM International Conference on Supercomputing (ICS), pages 189-198, 2005.
- (2005) ACM International Conference on Supercomputing (ICS) , pp. 189-198
- Basumallik, A.¹ Eigenmann, R.²

4
- 84870629709
- [online]. available
- NVIDIA CUDA [online]. available: http://developer.nvidia.com/object/ cudahome.html.
- NVIDIA CUDA

5
- 67650016770
- [online]. available
- NVIDIA CUDA SDK-Data-Parallel Algorithms: Parallel Reduction [online]. available: http://developer.download.nvidia.com/compute/cuda/11/Website/Data- ParallelAlgorithms.html.
- NVIDIA CUDA SDK-Data-parallel Algorithms: Parallel Reduction

6
- 0012453312
- [online], available
- Tim Davis. University of Florida Sparse Matrix Collection [online]. available: http://www.cise.ufl.edu/research/sparse/matrices/.
- University of Florida Sparse Matrix Collection
- Tim, D.¹

7
- 34548292052
- A memory model for scientific algorithms on graphics processors
- N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. International Conference for High Performance Computing, Networking, Storage and Analysys (SC), 2006.
- (2006) International Conference for High Performance Computing, Networking, Storage and Analysys (SC)
- Govindaraju, N.K.¹ Larsen, S.² Gray, J.³ Manocha, D.⁴

8
- 26444437628
- Cetus-an extensible compiler infrastructure for source-to-source transformation
- Sang Ik Lee, Troy Johnson, and Rudolf Eigenmann. Cetus-an extensible compiler infrastructure for source-to-source transformation. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2003.
- (2003) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Lee, S.I.¹ Johnson, T.² Eigenmann, R.³

9
- 0026407190
- A comparative study of automatic vectorizing compilers
- David Levine, David Callahan, and Jack Dongarra. A comparative study of automatic vectorizing compilers. Parallel Computing, 17, 1991.
- (1991) Parallel Computing , vol.17
- Levine, D.¹ Callahan, D.² Dongarra, J.³

10
- 0347133221
- Optimizing OpenMP programs on software distributed shared memory systems
- June
- Seung-Jai Min, Ayon Basumallik, and Rudolf Eigenmann. Optimizing OpenMP programs on software distributed shared memory systems. International Journel of Parallel Programming (IJPP), 31:225-249, June 2003.
- (2003) International Journel of Parallel Programming (IJPP) , vol.31 , pp. 225-249
- Min, S.-J.¹ Basumallik, A.² Eigenmann, R.³

11
- 57349170100
- Optimizing irregular sharedmemory applications for clusters
- Seung-Jai Min and Rudolf Eigenmann. Optimizing irregular sharedmemory applications for clusters. ACM International Conference on Supercomputing (ICS), pages 256-265, 2008.
- (2008) ACM International Conference on Supercomputing (ICS) , pp. 256-265
- Min, S.-J.¹ Eigenmann, R.²

12
- 43849085367
- Supporting OpenMP on Cell
- June
- K. O'Brien, K. O'Brien, Z. Sura, T. Chen, and T. Zhang. Supporting OpenMP on Cell. International Journel of Parallel Programming (IJPP), 36(3):289-311, June 2008.
- (2008) International Journel of Parallel Programming (IJPP) , vol.36 , Issue.3 , pp. 289-311
- O'Brien, K.¹ O'Brien, K.² Sura, Z.³ Chen, T.⁴ Zhang, T.⁵

13
- 70350615738
- [online], available
- OpenMP [online]. available: http://openmp.org/wp/.
- OpenMP

14
- 79959466764
- Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
- S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, andW.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 73-82, 2008.
- (2008) ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP) , pp. 73-82
- Ryoo, S.¹ Rodrigues, C.I.² Baghsorkhi, S.S.³ Stone, S.S.⁴ Kirk, D.B.⁵ Hwu, W.W.⁶

15
- 43449094719
- Program optimization space pruning for a multithreaded GPU
- S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded GPU. International Symposium on Code Generation and Optimization (CGO), 2008.
- (2008) International Symposium on Code Generation and Optimization (CGO)
- Ryoo, S.¹ Rodrigues, C.I.² Stone, S.S.³ Baghsorkhi, S.S.⁴ Ueng, S.⁵ Stratton, J.A.⁶ Hwu, W.W.⁷

16
- 58449109179
- MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs
- J. A. Stratton, S. S. Stone, and W. W. Hwu. MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008.
- (2008) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Stratton, J.A.¹ Stone, S.S.² Hwu, W.W.³

17
- 70450029523
- A framework for efficient and scalable execution of domainspecific templates on GPUs
- May
- Narayanan Sundaram, Anand Raghunathan, and Srimat T. Chakradhar. A framework for efficient and scalable execution of domainspecific templates on GPUs. IEEE International Parallel and Dis-tributed Processing Symposium (IPDPS), May 2009.
- (2009) IEEE International Parallel and Dis-tributed Processing Symposium (IPDPS)
- Narayanan, S.¹ Raghunathan, A.² Chakradhar, S.T.³

18
- 58449127539
- CUDA-lite: Reducing GPU programming complexity
- S. Ueng, M. Lathara, S. S. Baghsorkhi, and W. W. Hwu. CUDA-lite: Reducing GPU programming complexity. International Workshop on Languages and Compilers for Parallel Computing (LCPC), 2008.
- (2008) International Workshop on Languages and Compilers for Parallel Computing (LCPC)
- Ueng, S.¹ Lathara, M.² Baghsorkhi, S.S.³ Hwu, W.W.⁴

19
- 67650078822
- Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip
- Haitao Wei and Junqing Yu. Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip. International Workshop on OpenMP (IWOMP), 2007.
- (2007) International Workshop on OpenMP (IWOMP)
- Wei, H.¹ Junqing, Yu.²

20
- 32844466554
- An integrated simdization framework using virtual vectors
- Peng Wu, Alexandre E. Eichenberger, Amy Wang, and Peng Zhao. An integrated simdization framework using virtual vectors. ACM International Conference on Supercomputing (ICS), pages 169-178, 2005.
- (2005) ACM International Conference on Supercomputing (ICS) , pp. 169-178
- Peng, Wu.¹ Alexandre, E.² Eichenberger, A.W.³ Peng, Z.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.