-
1
-
-
0023438847
-
AUTOMATIC TRANSLATION OF FORTRAN PROGRAMS TO VECTOR FORM.
-
DOI 10.1145/29873.29875
-
Randy Allen and Ken Kennedy. Automatic translation of FORTRAN programs to vector form. ACM Transactions on Programming Languages and Systems, 9(4):491-542, October 1987. (Pubitemid 18531687)
-
(1987)
ACM Transactions on Programming Languages and Systems
, vol.9
, Issue.4
, pp. 491-542
-
-
Allen Randy1
Kennedy Ken2
-
2
-
-
57349180412
-
A compiler framework for optimization of affine loop nests for GPGPUs
-
M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A compiler framework for optimization of affine loop nests for GPGPUs. ACM International Conference on Supercomputing (ICS), 2008.
-
(2008)
ACM International Conference on Supercomputing (ICS)
-
-
Baskaran, M.M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
3
-
-
32844474242
-
Towards automatic translation of OpenMP to MPI
-
DOI 10.1145/1088149.1088174, ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
Ayon Basumallik and Rudolf Eigenmann. Towards automatic translation of OpenMP to MPI. ACM International Conference on Supercomputing (ICS), pages 189-198, 2005. (Pubitemid 43251323)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 189-198
-
-
Basumallik, A.1
Eigenmann, R.2
-
4
-
-
84870629709
-
-
online available
-
NVIDIA CUDA [online]. available: http://developer.nvidia.com/object/cuda home.html.
-
NVIDIA CUDA
-
-
-
7
-
-
34548292052
-
A memory model for scientific algorithms on graphics processors
-
N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. International Conference for High Performance Computing, Networking, Storage and Analysys (SC), 2006.
-
(2006)
International Conference for High Performance Computing, Networking, Storage and Analysys (SC)
-
-
Govindaraju, N.K.1
Larsen, S.2
Gray, J.3
Manocha, D.4
-
9
-
-
0026407190
-
A comparative study of automatic vectorizing compilers
-
David Levine, David Callahan, and Jack Dongarra. A comparative study of automatic vectorizing compilers. Parallel Computing, 17, 1991.
-
(1991)
Parallel Computing
, vol.17
-
-
Levine, D.1
Callahan, D.2
Eongarra, J.3
-
12
-
-
43849085367
-
Supporting OpenMP on Cell
-
June
-
K. O'Brien, K. O'Brien, Z. Sura, T. Chen, and T. Zhang. Supporting OpenMP on Cell. International Journel of Parallel Programming (IJPP), 36(3):289-311, June 2008.
-
(2008)
International Journel of Parallel Programming (IJPP)
, vol.36
, Issue.3
, pp. 289-311
-
-
O'Brien, K.1
O'Brien, K.2
Sura, Z.3
Chen, T.4
Zhang, T.5
-
13
-
-
67650022643
-
-
online, available
-
OpenMP [online]. available: http://openmp.org/wp/.
-
OpenMP
-
-
-
14
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, andW.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 73-82, 2008.
-
(2008)
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
, pp. 73-82
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.W.6
-
15
-
-
43449094719
-
Program optimization space pruning for a multithreaded GPU
-
DOI 10.1145/1356058.1356084, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
-
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded GPU. International Symposium on Code Generation and Optimization (CGO), 2008. (Pubitemid 351667266)
-
(2008)
Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
Hwu, W.-M.W.7
-
19
-
-
67650078822
-
Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip
-
Haitao Wei and Junqing Yu. Mapping OpenMP to Cell: An effective compiler framework for heterogeneous multi-core chip. International Workshop on OpenMP (IWOMP), 2007.
-
(2007)
International Workshop on OpenMP (IWOMP)
-
-
Wei, H.1
Yu, J.2
-
20
-
-
32844466554
-
An integrated simdization framework using virtual vectors
-
ICS05 - Proceedings of the 19th ACM International Conference on Supercomputing
-
Peng Wu, Alexandre E. Eichenberger, Amy Wang, and Peng Zhao. An integrated simdization framework using virtual vectors. ACM International Conference on Supercomputing (ICS), pages 169-178, 2005. (Pubitemid 43251321)
-
(2005)
Proceedings of the International Conference on Supercomputing
, pp. 169-178
-
-
Wu, P.1
Eichenberger, A.E.2
Wang, A.3
Zhao, P.4
|