-
2
-
-
57349180412
-
A compiler framework for optimization of affine loop nests for GPGPUs
-
M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. ACM International Conference on Supercomputing (ICS), 2008.
-
(2008)
ACM International Conference on Supercomputing (ICS)
-
-
Baskaran, M.M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
4
-
-
84870629709
-
-
[online]. available
-
NVIDIA CUDA [online]. available: http://developer.nvidia.com/object/ cudahome.html.
-
NVIDIA CUDA
-
-
-
7
-
-
34548292052
-
A memory model for scientific algorithms on graphics processors
-
N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. International Conference for High Performance Computing, Networking, Storage and Analysys (SC), 2006.
-
(2006)
International Conference for High Performance Computing, Networking, Storage and Analysys (SC)
-
-
Govindaraju, N.K.1
Larsen, S.2
Gray, J.3
Manocha, D.4
-
9
-
-
0026407190
-
A comparative study of automatic vectorizing compilers
-
David Levine, David Callahan, and Jack Dongarra. A comparative study of automatic vectorizing compilers. Parallel Computing, 17, 1991.
-
(1991)
Parallel Computing
, vol.17
-
-
Levine, D.1
Callahan, D.2
Dongarra, J.3
-
12
-
-
43849085367
-
Supporting OpenMP on Cell
-
June
-
K. O'Brien, K. O'Brien, Z. Sura, T. Chen, and T. Zhang. Supporting OpenMP on Cell. International Journel of Parallel Programming (IJPP), 36(3):289-311, June 2008.
-
(2008)
International Journel of Parallel Programming (IJPP)
, vol.36
, Issue.3
, pp. 289-311
-
-
O'Brien, K.1
O'Brien, K.2
Sura, Z.3
Chen, T.4
Zhang, T.5
-
13
-
-
70350615738
-
-
[online], available
-
OpenMP [online]. available: http://openmp.org/wp/.
-
OpenMP
-
-
-
14
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, andW.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 73-82, 2008.
-
(2008)
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.W.6
-
15
-
-
43449094719
-
Program optimization space pruning for a multithreaded GPU
-
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded GPU. International Symposium on Code Generation and Optimization (CGO), 2008.
-
(2008)
International Symposium on Code Generation and Optimization (CGO)
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.5
Stratton, J.A.6
Hwu, W.W.7
-
19
-
-
67650078822
-
Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip
-
Haitao Wei and Junqing Yu. Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip. International Workshop on OpenMP (IWOMP), 2007.
-
(2007)
International Workshop on OpenMP (IWOMP)
-
-
Wei, H.1
Junqing, Yu.2
-
20
-
-
32844466554
-
An integrated simdization framework using virtual vectors
-
Peng Wu, Alexandre E. Eichenberger, Amy Wang, and Peng Zhao. An integrated simdization framework using virtual vectors. ACM International Conference on Supercomputing (ICS), pages 169-178, 2005.
-
(2005)
ACM International Conference on Supercomputing (ICS)
, pp. 169-178
-
-
Peng, Wu.1
Alexandre, E.2
Eichenberger, A.W.3
Peng, Z.4
|