-
1
-
-
84976766536
-
Scanning polyhedra with do loops
-
Ancourt, C., Irigoin, F.: Scanning polyhedra with do loops. In: PPoPP 1991, pp. 39-50 (1991)
-
(1991)
PPoPP 1991
, pp. 39-50
-
-
Ancourt, C.1
Irigoin, F.2
-
2
-
-
77951547168
-
A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs
-
June
-
Baskaran, M., Bondhugula, U., Krishnamoorthy, S., Ramanujam, J., Rountev, A., Sadayappan, P.: A Compiler Framework for Optimization of Affine Loop Nests for GPGPUs. In: ACM ICS (June 2008)
-
(2008)
ACM ICS
-
-
Baskaran, M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
3
-
-
79959456077
-
Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories
-
February
-
Baskaran, M., Bondhugula, U., Krishnamoorthy, S., Ramanujam, J., Rountev, A., Sadayappan, P.: Automatic Data Movement and Computation Mapping for Multi-level Parallel Architectures with Explicitly Managed Memories. In: ACM SIGPLAN PPoPP (February 2008)
-
(2008)
ACM SIGPLAN PPoPP
-
-
Baskaran, M.1
Bondhugula, U.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
4
-
-
10444289646
-
Code generation in the polyhedral model is easier than you think
-
Bastoul, C.: Code generation in the polyhedral model is easier than you think. In: PACT 2004, pp. 7-16 (2004)
-
PACT 2004
, vol.2004
, pp. 7-16
-
-
Bastoul, C.1
-
5
-
-
47249156196
-
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
-
Hendren, L. (ed.) CC 2008. Springer, Heidelberg
-
Bondhugula, U., Baskaran, M., Krishnamoorthy, S., Ramanujam, J., Rountev, A., Sadayappan, P.: Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model. In: Hendren, L. (ed.) CC 2008. LNCS, vol. 4959, pp. 132-146. Springer, Heidelberg (2008)
-
(2008)
LNCS
, vol.4959
, pp. 132-146
-
-
Bondhugula, U.1
Baskaran, M.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
6
-
-
57349139452
-
A practical automatic polyhedral parallelizer and locality optimizer
-
Bondhugula, U., Hartono, A., Ramanujan, J., Sadayappan, P.: A practical automatic polyhedral parallelizer and locality optimizer. In: ACMSIGPLAN Programming Languages Design and Implementation, PLDI 2008 (2008)
-
(2008)
ACMSIGPLAN Programming Languages Design and Implementation, PLDI 2008
-
-
Bondhugula, U.1
Hartono, A.2
Ramanujan, J.3
Sadayappan, P.4
-
7
-
-
77951602079
-
-
CLooG: The Chunky Loop Generator, http://www.cloog.org
-
-
-
-
8
-
-
78651269052
-
Understanding the efficiency of GPU algorithms for matrix-matrix multiplication
-
Fatahalian, K., Sugerman, J., Hanrahan, P.: Understanding the efficiency of GPU algorithms for matrix-matrix multiplication. In: ACM SIGGRAPH/ EUROGRAPHICS Conference on Graphics Hardware, pp. 133-137 (2004)
-
(2004)
ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware
, pp. 133-137
-
-
Fatahalian, K.1
Sugerman, J.2
Hanrahan, P.3
-
9
-
-
0026109335
-
Dataflow analysis of array and scalar references
-
Feautrier, P.: Dataflow analysis of array and scalar references. IJPP 20(1), 23-53 (1991)
-
(1991)
IJPP
, vol.20
, Issue.1
, pp. 23-53
-
-
Feautrier, P.1
-
10
-
-
0026933251
-
Some efficient solutions to the affine scheduling problem, part I: One-dimensional time
-
Feautrier, P.: Some efficient solutions to the affine scheduling problem, part I: one-dimensional time. IJPP 21(5), 313-348 (1992)
-
(1992)
IJPP
, vol.21
, Issue.5
, pp. 313-348
-
-
Feautrier, P.1
-
11
-
-
84957027384
-
Automatic parallelization in the polytope model
-
Perrin, G.-R., Darte, A. (eds.) The Data Parallel Programming Model. Springer, Heidelberg
-
Feautrier, P.: Automatic parallelization in the polytope model. In: Perrin, G.-R., Darte, A. (eds.) The Data Parallel Programming Model. LNCS, vol. 1132, pp. 79-103. Springer, Heidelberg (1996)
-
(1996)
LNCS
, vol.1132
, pp. 79-103
-
-
Feautrier, P.1
-
12
-
-
77951575756
-
A memory model for scientific algorithms on graphics processors
-
Löwe, W., Südholt, M. (eds.) SC 2006. Springer, Heidelberg
-
Govindaraju, N.K., Larsen, S., Gray, J., Manocha, D.: A memory model for scientific algorithms on graphics processors. In: Löwe, W., Südholt, M. (eds.) SC 2006. LNCS, vol. 4089. Springer, Heidelberg (2006)
-
(2006)
LNCS
, vol.4089
-
-
Govindaraju, N.K.1
Larsen, S.2
Gray, J.3
Manocha, D.4
-
13
-
-
77951581325
-
-
General-Purpose Computation Using Graphics Hardware, http://www.gpgpu. org/
-
-
-
-
17
-
-
67650081010
-
Openmp to gpgpu: A compiler framework for automatic translation and optimization
-
Lee, S., Min, S.-J., Eigenmann, R.: Openmp to gpgpu: A compiler framework for automatic translation and optimization. In: PPoPP 2009, pp. 101-110 (2009)
-
(2009)
PPoPP 2009
, pp. 101-110
-
-
Lee, S.1
Min, S.-J.2
Eigenmann, R.3
-
19
-
-
70450103746
-
A cross-input adaptive framework for gpu programs optimizations
-
May
-
Liu, Y., Zhang, E.Z., Shen, X.: A cross-input adaptive framework for gpu programs optimizations. In: IPDPS (May 2009)
-
(2009)
IPDPS
-
-
Liu, Y.1
Zhang, E.Z.2
Shen, X.3
-
20
-
-
77951584344
-
-
NVIDIA CUDA, http://developer.nvidia.com/object/cuda.html
-
-
-
-
21
-
-
77951530755
-
-
Parboil Benchmark Suite, http://impact.crhc.illinois.edu/parboil.php
-
-
-
-
23
-
-
34547683700
-
Iterative optimization in the polyhedral model: Part I, one-dimensional time
-
DOI 10.1109/CGO.2007.21, 4145111, International Symposium on Code Generation and Optimization, CGO 2007
-
Pouchet, L.-N., Bastoul, C., Cohen, A., Vasilache, N.: Iterative optimization in the polyhedral model: Part I, one-dimensional time. In: CGO 2007, pp. 144-156 (2007) (Pubitemid 47214305)
-
(2007)
International Symposium on Code Generation and Optimization, CGO 2007
, pp. 144-156
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Vasilache, N.4
-
24
-
-
84976676720
-
The Omega test: A fast and practical integer programming algorithm for dependence analysis
-
Pugh, W.: The Omega test: a fast and practical integer programming algorithm for dependence analysis. Communications of the ACM 8, 102-114 (1992)
-
(1992)
Communications of the ACM
, vol.8
, pp. 102-114
-
-
Pugh, W.1
-
25
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
Quilleré, F., Rajopadhye, S.V., Wilde, D.: Generation of efficient nested loops from polyhedra. IJPP 28(5), 469-498 (2000)
-
(2000)
IJPP
, vol.28
, Issue.5
, pp. 469-498
-
-
Quilleré, F.1
Rajopadhye, S.V.2
Wilde, D.3
-
26
-
-
79959466764
-
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA
-
February
-
Ryoo, S., Rodrigues, C., Baghsorkhi, S., Stone, S., Kirk, D., Hwu, W.: Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. In: ACM SIGPLAN PPoPP 2008 (February 2008)
-
(2008)
ACM SIGPLAN PPoPP 2008
-
-
Ryoo, S.1
Rodrigues, C.2
Baghsorkhi, S.3
Stone, S.4
Kirk, D.5
Hwu, W.6
-
27
-
-
51449106975
-
Program optimization study on a 128-core GPU
-
October
-
Ryoo, S., Rodrigues, C., Stone, S., Baghsorkhi, S., Ueng, S., Hwu, W.: Program optimization study on a 128-core GPU. In: The First Workshop on General Purpose Processing on Graphics Processing Units (October 2007)
-
(2007)
The First Workshop on General Purpose Processing on Graphics Processing Units
-
-
Ryoo, S.1
Rodrigues, C.2
Stone, S.3
Baghsorkhi, S.4
Ueng, S.5
Hwu, W.6
-
28
-
-
43449094719
-
Program optimization space pruning for a multithreaded GPU
-
DOI 10.1145/1356058.1356084, Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
-
Ryoo, S., Rodrigues, C., Stone, S., Baghsorkhi, S., Ueng, S., Stratton, J., Hwu, W.: Program optimization space pruning for a multithreaded GPU. In: CGO (2008) (Pubitemid 351667266)
-
(2008)
Proceedings of the 2008 CGO - Sixth International Symposium on Code Generation and Optimization
, pp. 195-204
-
-
Ryoo, S.1
Rodrigues, C.I.2
Stone, S.S.3
Baghsorkhi, S.S.4
Ueng, S.-Z.5
Stratton, J.A.6
Hwu, W.-M.W.7
-
29
-
-
67650016545
-
Violated dependence analysis
-
June
-
Vasilache, N., Bastoul, C., Girbal, S., Cohen, A.: Violated dependence analysis. In: ACM ICS (June 2006)
-
(2006)
ACM ICS
-
-
Vasilache, N.1
Bastoul, C.2
Girbal, S.3
Cohen, A.4
|