-
1
-
-
0004072686
-
-
Pearson Education
-
A.V. Aho, Ravi Sethi, and J.D. Ullman. Compilers, Principles, Techniques, & Tools, Pearson Education, 2007.
-
(2007)
Compilers, Principles Techniques & Tools
-
-
Aho, A.V.1
Sethi, R.2
Ullman, J.D.3
-
2
-
-
57349180412
-
A compiler framework for optimization of affine loop nests for GPGPUs
-
M.M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In Proc. International Conference on Supercomputing, 2008.
-
(2008)
Proc. International Conference on Supercomputing
-
-
Baskaran, M.M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
3
-
-
79959456077
-
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
-
M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories. In Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008.
-
(2008)
Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
-
-
Baskaran, M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
4
-
-
84968470212
-
An algorithm for the machine calculation of complex Fourier series
-
J. Cooley and J.W. Tukey. An algorithm for the machine calculation of complex Fourier series, In Math. Comput, 1965.
-
(1965)
Math. Comput
-
-
Cooley, J.1
Tukey, J.W.2
-
6
-
-
60849099135
-
High performance discrete Fourier transforms on graphics processors
-
N. Govindaraju, B. Lloyd, Y. Dotsenko, B. Smith, and J. Manferdelli. High performance discrete Fourier transforms on graphics processors. In Proc. Supercomputing, 2008.
-
(2008)
Proc. Supercomputing
-
-
Govindaraju, N.1
Lloyd, B.2
Dotsenko, Y.3
Smith, B.4
Manferdelli, J.5
-
7
-
-
70450231944
-
An analytical model for GPU architecture with memory-level and thread-level parallelism awareness
-
S. Hong and H. Kim. An analytical model for GPU architecture with memory-level and thread-level parallelism awareness. In Proc. International Symposium on Computer Architecture, 2009.
-
(2009)
Proc. International Symposium on Computer Architecture
-
-
Hong, S.1
Kim, H.2
-
11
-
-
34547683700
-
Iterative optimization in the polyhedral mode: Part I, on dimensional time
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache. Iterative optimization in the polyhedral mode: Part I, on dimensional time. In Proc. International Symposium on Code Generation and Optimization, 2007
-
(2007)
Proc. International Symposium on Code Generation and Optimization
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Vasilache, N.4
-
13
-
-
43449094719
-
Optimization space pruning for a multithreaded GPU
-
S. Ryoo, C.I. Rodrigues, S.S. Stone, S.S. Baghsorkhi, S. Ueng, J.A. Stratton, and W.W. Hwu. Optimization space pruning for a multithreaded GPU. In Proc. International Symposium on Code Generation and Optimization, 2008.
-
(2008)
Proc. International Symposium on Code Generation and Optimization
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.5
Stratton, J.A.6
Hwu, W.W.7
-
14
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
S. Ryoo, C.I. Rodrigues, S.S. Baghsorkhi, S.S. Stone, D.B. Kirk, and W.W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008.
-
(2008)
Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
-
-
Ryoo, S.1
Rodrigues, C.I.2
Baghsorkhi, S.S.3
Stone, S.S.4
Kirk, D.B.5
Hwu, W.W.6
-
15
-
-
77957561221
-
An adaptive performance modling tool for GPU architectures
-
S.S. Baghsorkhi, M. Delahaye, S.J. Patel, W.D. Gropp, and W.W. Hwu. An adaptive performance modling tool for GPU architectures. In Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010.
-
(2010)
Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
-
-
Baghsorkhi, S.S.1
Delahaye, M.2
Patel, S.J.3
Gropp, W.D.4
Hwu, W.W.5
-
19
-
-
77957557473
-
-
NVIDIA CUDA Programming Guide, Version 2.1
-
NVIDIA CUDA Programming Guide, Version 2.1, 2008
-
(2008)
-
-
-
20
-
-
77957551580
-
-
http://code.google.com/p/gpgpucompiler/
-
-
-
|