-
1
-
-
84976766536
-
Scanning polyhedra with do loops
-
C. Ancourt and F. Irigoin. Scanning polyhedra with do loops. In PPoPP'91, pages 39-50, 1991.
-
(1991)
PPoPP'91
, pp. 39-50
-
-
Ancourt, C.1
Irigoin, F.2
-
2
-
-
79959456077
-
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories
-
Feb
-
M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. In A CM SIGPLAN PPoPP 2008, Feb. 2008.
-
(2008)
A CM SIGPLAN PPoPP 2008
-
-
Baskaran, M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
3
-
-
10444289646
-
Code generation in the polyhedral model is easier than you think
-
C. Bastoul. Code generation in the polyhedral model is easier than you think. In PACT'04, pages 7-16, 2004.
-
(2004)
PACT'04
, pp. 7-16
-
-
Bastoul, C.1
-
4
-
-
57349145904
-
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
-
Apr
-
U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model. In International Conference on Compiler Construction (ETAPS CC), Apr. 2008.
-
(2008)
International Conference on Compiler Construction (ETAPS CC)
-
-
Bondhugula, U.1
Baskaran, M.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
6
-
-
10644248153
-
Brook for GPUs: Stream computing on graphics hardware
-
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. In S1GGRAPH'04, pages 777-786, 2004.
-
(2004)
S1GGRAPH'04
, pp. 777-786
-
-
Buck, I.1
Foley, T.2
Horn, D.3
Sugerman, J.4
Fatahalian, K.5
Houston, M.6
Hanrahan, P.7
-
9
-
-
0026109335
-
Dataflow analysis of array and scalar references
-
P. Feautrier. Dataflow analysis of array and scalar references. IJPP, 20(1):23-53, 1991.
-
(1991)
IJPP
, vol.20
, Issue.1
, pp. 23-53
-
-
Feautrier, P.1
-
10
-
-
0026933251
-
Some efficient solutions to the affine scheduling problem, part I: One-dimensional time
-
P. Feautrier. Some efficient solutions to the affine scheduling problem, part I: one-dimensional time. IJPP, 21(5):313-348, 1992.
-
(1992)
IJPP
, vol.21
, Issue.5
, pp. 313-348
-
-
Feautrier, P.1
-
11
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, part II: Multidimensional time
-
P. Feautrier. Some efficient solutions to the affine scheduling problem, part II: multidimensional time. IJPP, 21(6):389-420, 1992.
-
(1992)
IJPP
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
12
-
-
34548292052
-
A memory model for scientific algorithms on graphics processors
-
N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. In SC'06, 2006.
-
(2006)
SC'06
-
-
Govindaraju, N.K.1
Larsen, S.2
Gray, J.3
Manocha, D.4
-
13
-
-
57349162527
-
-
General-Purpose Computation Using Graphics Hardware. http://www.gpgpu. org/.
-
General-Purpose Computation Using Graphics Hardware. http://www.gpgpu. org/.
-
-
-
-
14
-
-
57349100116
-
-
Automatic Parallelization of Loop Programs for Distributed Memory Architectures. FMI, University of Passau, Habilitation Thesis
-
M. Griebl. Automatic Parallelization of Loop Programs for Distributed Memory Architectures. FMI, University of Passau, 2004. Habilitation Thesis.
-
(2004)
-
-
Griebl, M.1
-
15
-
-
57349101237
-
Data and computation transformations for Brook streaming applications on multiprocessors
-
S.-W. Liao, Z. Du, G. Wu, and G.-Y. Lueh. Data and computation transformations for Brook streaming applications on multiprocessors. In CGO'06, pages 196-207, 2006.
-
(2006)
CGO'06
, pp. 196-207
-
-
Liao, S.-W.1
Du, Z.2
Wu, G.3
Lueh, G.-Y.4
-
17
-
-
0030645995
-
Maximizing parallelism and minimizing synchronization with affine transforms
-
A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine transforms. In POPL, pages 201-214, 1997.
-
(1997)
POPL
, pp. 201-214
-
-
Lim, A.W.1
Lam, M.S.2
-
18
-
-
57349189733
-
-
NVIDIA CUDA
-
NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html.
-
-
-
-
19
-
-
57349128633
-
-
NVIDIA GeForce 8800. http://www.nvidia.com/page/geforce-8800.html.
-
, vol.8800
-
-
-
21
-
-
34547683700
-
Iterative optimization in the polyhedral model: Part I, one-dimensional time
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache. Iterative optimization in the polyhedral model: Part I, one-dimensional time. In CGO'07, pages 144-156, 2007.
-
(2007)
CGO'07
, pp. 144-156
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Vasilache, N.4
-
22
-
-
84976676720
-
The Omega test: A fast and practical integer programming algorithm for dependence analysis
-
Aug
-
W. Pugh. The Omega test: a fast and practical integer programming algorithm for dependence analysis. Communications of the ACM, 8:102-114, Aug. 1992.
-
(1992)
Communications of the ACM
, vol.8
, pp. 102-114
-
-
Pugh, W.1
-
23
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
F. Quilleré, S. V. Rajopadhye, and D. Wilde. Generation of efficient nested loops from polyhedra. IJPP, 28(5):469-498, 2000.
-
(2000)
IJPP
, vol.28
, Issue.5
, pp. 469-498
-
-
Quilleré, F.1
Rajopadhye, S.V.2
Wilde, D.3
-
24
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
Feb
-
S. Ryoo, C. Rodrigues, S. Baghsorkhi, S. Stone, D. Kirk, and W. Hwu. Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In ACM SIGPLAN PPoPP 2008, Feb. 2008.
-
(2008)
ACM SIGPLAN PPoPP 2008
-
-
Ryoo, S.1
Rodrigues, C.2
Baghsorkhi, S.3
Stone, S.4
Kirk, D.5
Hwu, W.6
-
25
-
-
51449106975
-
Program optimization study on a 128-core GPU
-
October
-
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, and W. Hwu. Program optimization study on a 128-core GPU. In The First Workshop on General Purpose Processing on Graphics Processing Units, October 2007.
-
(2007)
The First Workshop on General Purpose Processing on Graphics Processing Units
-
-
Ryoo, S.1
Rodrigues, C.2
Stone, S.3
Baghsorkhi, S.4
Ueng, S.5
Hwu, W.6
-
26
-
-
43449094719
-
-
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, J. Stratton, and W. Hwu. Program optimization space pruning for a multithreaded GPU. In CGO, 2008.
-
S. Ryoo, C. Rodrigues, S. Stone, S. Baghsorkhi, S. Ueng, J. Stratton, and W. Hwu. Program optimization space pruning for a multithreaded GPU. In CGO, 2008.
-
-
-
-
27
-
-
33947595619
-
Accelerator: Using data parallelism to program GPUs for general-purpose uses
-
D. Tarditi, S. Puri, and J. Oglesby. Accelerator: using data parallelism to program GPUs for general-purpose uses. In ASPLOS-XII, pages 325-335, 2006.
-
(2006)
ASPLOS-XII
, pp. 325-335
-
-
Tarditi, D.1
Puri, S.2
Oglesby, J.3
|