-
2
-
-
56349135455
-
XARK: An extensible framework for automatic recognition of computational kernels
-
M. Arenaz, J. Tourino, and R. Doallo, "XARK: An Extensible Framework for Automatic Recognition of Computational Kernels, " ACM Transactions on Programming Languages and Systems (TOPLAS), vol. 30, issue 6, 2008.
-
(2008)
ACM Transactions on Programming Languages and Systems (TOPLAS)
, vol.30
, Issue.6
-
-
Arenaz, M.1
Tourino, J.2
Doallo, R.3
-
3
-
-
36049035884
-
Parallel prefix sum (Scan) with CUDA
-
chapter 39, Addison Wesley, August
-
Mark Harris, Shubhabrata Sengupta, and John D. Owens. "Parallel Prefix Sum (Scan) with CUDA". GPU Gems 3, chapter 39, pages 851-876. Addison Wesley, August 2007.
-
(2007)
GPU Gems
, vol.3
, pp. 851-876
-
-
Harris, M.1
Sengupta, S.2
Owens., J.D.3
-
5
-
-
0347496407
-
PTRAN II - A compiler for high performance fortran
-
M. Gupta, S. Midkiff, E. Schonberg, P. Sweeney, K. Y. Wang, and M. Burke, "PTRAN II - A Compiler for High Performance Fortran, " 4th International Workshop on Compilers for Parallel Computers, 1993.
-
(1993)
4th International Workshop on Compilers for Parallel Computers
-
-
Gupta, M.1
Midkiff, S.2
Schonberg, E.3
Sweeney, P.4
Wang, K.Y.5
Burke, M.6
-
8
-
-
84976813879
-
Compiling fortran D for MIMD distributed-memory machines
-
August
-
S. Hiranandani, K. Kennedy, and C.-W. Tseng, "Compiling Fortran D for MIMD Distributed-Memory Machines, " Communications of the ACM, vol. 35, no. 8, pp. 66-80, August 1992.
-
(1992)
Communications of the ACM
, vol.35
, Issue.8
, pp. 66-80
-
-
Hiranandani, S.1
Kennedy, K.2
Tseng, C.-W.3
-
9
-
-
33845187300
-
Parallelizing user-defined and implicit reductions globally on multiprocessors
-
Lecture Notes in Computer Science, Springer-Verlag, Shanghai, PRC, September
-
S.-w. Liao, "Parallelizing User-Defined and Implicit Reductions Globally on Multiprocessors, " Lecture Notes in Computer Science, Springer-Verlag. Also in Proceedings of Annual Asia-Pacific Computer Architecture Conference (ACSAC06), Shanghai, PRC, September 2006.
-
(2006)
Proceedings of Annual Asia-Pacific Computer Architecture Conference (ACSAC06)
-
-
Liao, S.-W.1
-
12
-
-
21244455265
-
On the parallelization of irregular and dynamic programs
-
June
-
O. Plata, R. Asenjo, E. Gutierrez, F. Corbera, Angeles Navarro, and Emilio L. Zapata, "On the Parallelization of Irregular and Dynamic Programs, " Parallel Computing, vol. 31, issue 6, pp. 544-562, June 2005.
-
(2005)
Parallel Computing
, vol.31
, Issue.6
, pp. 544-562
-
-
Plata, O.1
Asenjo, R.2
Gutierrez, E.3
Corbera, F.4
Navarro, A.5
Zapata, E.L.6
-
13
-
-
4244132752
-
Parallelization in the presence of generalized induction and reduction variables
-
Cntr. for Supercomputing Res. & Dev. January
-
B. Pottenger and R. Eigenmann, "Parallelization in the Presence of Generalized Induction and Reduction Variables, " Technical Report 1396, Univ. of Illinois at Urbana-Champaign, Cntr. for Supercomputing Res. & Dev., January 1995.
-
(1995)
Technical Report 1396, Univ. of Illinois at Urbana-Champaign
-
-
Pottenger, B.1
Eigenmann, R.2
-
15
-
-
0029723171
-
Detection and global optimization of reduction operations for distributed parallel machines
-
Philadelphia, PA, May
-
T. Suganuma, H. Komatsu, and T. Nakatani, "Detection and Global Optimization of Reduction Operations for Distributed Parallel Machines, " in Proceedings of the 1996 ACM International Conference on Supercomputing, Philadelphia, PA, May 1996.
-
(1996)
Proceedings of the 1996 ACM International Conference on Supercomputing
-
-
Suganuma, T.1
Komatsu, H.2
Nakatani, T.3
|