-
2
-
-
10844259103
-
Synthesizing transformations for locality enhancement of imperfectly-nested loops
-
Oct
-
N. Ahmed, N. Mateev, and K. Pingali. Synthesizing transformations for locality enhancement of imperfectly-nested loops. Intl. J. of Parallel Programming, 29(5), Oct. 2001.
-
(2001)
Intl. J. of Parallel Programming
, vol.29
, Issue.5
-
-
Ahmed, N.1
Mateev, N.2
Pingali, K.3
-
5
-
-
0142134964
-
Optimal semi-oblique tiling
-
R. Andonov, S. Balev, S. Rajopadhye, and N. Yanev. Optimal semi-oblique tiling. IEEE Trans. Par. & Dist. Sys., 14(9):944-960, 2003.
-
(2003)
IEEE Trans. Par. & Dist. Sys
, vol.14
, Issue.9
, pp. 944-960
-
-
Andonov, R.1
Balev, S.2
Rajopadhye, S.3
Yanev, N.4
-
6
-
-
10444289646
-
Code generation in the polyhedral model is easier than you think
-
Sept
-
C. Bastoul. Code generation in the polyhedral model is easier than you think. In IEEE PACT, pages 7-16, Sept. 2004.
-
(2004)
IEEE PACT
, pp. 7-16
-
-
Bastoul, C.1
-
8
-
-
57349110181
-
Affine transformations for communication minimal parallelization and locality optimization of arbitrarily-nested loop sequences
-
Technical Report OSU-CISRC-5/07-TR43, The Ohio State University, May
-
U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Affine transformations for communication minimal parallelization and locality optimization of arbitrarily-nested loop sequences. Technical Report OSU-CISRC-5/07-TR43, The Ohio State University, May 2007.
-
(2007)
-
-
Bondhugula, U.1
Baskaran, M.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
9
-
-
57349145904
-
Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model
-
Apr
-
U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model. In Intl. Conf. on Compiler Construction (ETAPS CC), Apr. 2008.
-
(2008)
Intl. Conf. on Compiler Construction (ETAPS CC)
-
-
Bondhugula, U.1
Baskaran, M.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
10
-
-
47249137843
-
Pluto: A practical and fully automatic polyhedral parallelizer and locality optimizer
-
Technical Report OSU-CISRC-10/07-TR70, The Ohio State University, Oct
-
U. Bondhugula, J. Ramanujam, and P. Sadayappan. Pluto: A practical and fully automatic polyhedral parallelizer and locality optimizer. Technical Report OSU-CISRC-10/07-TR70, The Ohio State University, Oct. 2007.
-
(2007)
-
-
Bondhugula, U.1
Ramanujam, J.2
Sadayappan, P.3
-
11
-
-
0028482686
-
(Pen)-ultimate tiling?
-
P. Boulet, A. Darte, T. Risset, and Y. Robert. (Pen)-ultimate tiling? Integration, the VLSIJournal, 17(1):33-51, 1994.
-
(1994)
Integration, the VLSIJournal
, vol.17
, Issue.1
, pp. 33-51
-
-
Boulet, P.1
Darte, A.2
Risset, T.3
Robert, Y.4
-
12
-
-
0032066690
-
Loop parallelization algorithms: From parallelism extraction to code generation
-
P. Boulet, A. Darte, G.-A. Silber, and F. Vivien. Loop parallelization algorithms: From parallelism extraction to code generation. Parallel Computing, 24(3-4):421-444, 1998.
-
(1998)
Parallel Computing
, vol.24
, Issue.3-4
, pp. 421-444
-
-
Boulet, P.1
Darte, A.2
Silber, G.-A.3
Vivien, F.4
-
14
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
June
-
A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, and N. Vasilache. Facilitating the search for compositions of program transformations. In ACM Intl. Conf. on Supercomputing, pages 151-160, June 2005.
-
(2005)
ACM Intl. Conf. on Supercomputing
, pp. 151-160
-
-
Cohen, A.1
Girbal, S.2
Parello, D.3
Sigler, M.4
Temam, O.5
Vasilache, N.6
-
16
-
-
0342782260
-
Combining retiming and scheduling techniques for loop parallelization and loop tiling
-
A. Darte, G.-A. Silber, and F. Vivien. Combining retiming and scheduling techniques for loop parallelization and loop tiling. Parallel Processing Letters, 7(4):379-392, 1997.
-
(1997)
Parallel Processing Letters
, vol.7
, Issue.4
, pp. 379-392
-
-
Darte, A.1
Silber, G.-A.2
Vivien, F.3
-
17
-
-
0031358458
-
Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs
-
Dec
-
A. Darte and F. Vivien. Optimal fine and medium grain parallelism detection in polyhedral reduced dependence graphs. Intl. J. Parallel Programming, 25(6):447-496, Dec. 1997.
-
(1997)
Intl. J. Parallel Programming
, vol.25
, Issue.6
, pp. 447-496
-
-
Darte, A.1
Vivien, F.2
-
18
-
-
0001023389
-
Parametric integer programming
-
P. Feautrier. Parametric integer programming. RAIRO Recherche Operationnell1é, 22(3):243-268, 1988.
-
(1988)
RAIRO Recherche Operationnell1é
, vol.22
, Issue.3
, pp. 243-268
-
-
Feautrier, P.1
-
19
-
-
0026109335
-
Dataflow analysis of scalar and array references
-
Feb
-
P. Feautrier. Dataflow analysis of scalar and array references. Intl. J. of Parallel Programming, 20(1):23-53, Feb. 1991.
-
(1991)
Intl. J. of Parallel Programming
, vol.20
, Issue.1
, pp. 23-53
-
-
Feautrier, P.1
-
20
-
-
0026933251
-
Some efficient solutions to the affine scheduling problem: I. one-dimensional time
-
P. Feautrier. Some efficient solutions to the affine scheduling problem: I. one-dimensional time. Intl. J. of Parallel Programming, 21(5):313-348, 1992.
-
(1992)
Intl. J. of Parallel Programming
, vol.21
, Issue.5
, pp. 313-348
-
-
Feautrier, P.1
-
21
-
-
0001448065
-
Some efficient solutions to the affine scheduling problem, part II. multidimensional time
-
P. Feautrier. Some efficient solutions to the affine scheduling problem, part II. multidimensional time. Intl. J. of Parallel Programming, 21(6):389-420, 1992.
-
(1992)
Intl. J. of Parallel Programming
, vol.21
, Issue.6
, pp. 389-420
-
-
Feautrier, P.1
-
22
-
-
33746593747
-
Semi-automatic composition of loop transformations
-
June
-
S. Girbal, N. Vasilache, C. Bastoul, A. Cohen, D. Parello, M. Sigler, and O. Temam. Semi-automatic composition of loop transformations. Intl. J. of Parallel Programming, 34(3):261-317, June 2006.
-
(2006)
Intl. J. of Parallel Programming
, vol.34
, Issue.3
, pp. 261-317
-
-
Girbal, S.1
Vasilache, N.2
Bastoul, C.3
Cohen, A.4
Parello, D.5
Sigler, M.6
Temam, O.7
-
23
-
-
0036756475
-
Code Generation Methods for Tiling Transformations
-
Sep
-
G. Goumas, M. Athanasaki, and N. Koziris. Code Generation Methods for Tiling Transformations. J. of Information Science and Engineering, 18(5):667-691, Sep. 2002.
-
(2002)
J. of Information Science and Engineering
, vol.18
, Issue.5
, pp. 667-691
-
-
Goumas, G.1
Athanasaki, M.2
Koziris, N.3
-
24
-
-
67650089094
-
-
Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, Habilitation thesis
-
M. Griebl. Automatic Parallelization of Loop Programs for Distributed Memory Architectures. University of Passau, 2004. Habilitation thesis.
-
(2004)
-
-
Griebl, M.1
-
26
-
-
0036958653
-
On time optimal supernode shape
-
E. Hodzic and W. Shang. On time optimal supernode shape. IEEE Trans. Par. & Dist. Sys., 13(12):1220-1233, 2002.
-
(2002)
IEEE Trans. Par. & Dist. Sys
, vol.13
, Issue.12
, pp. 1220-1233
-
-
Hodzic, E.1
Shang, W.2
-
27
-
-
0032642196
-
Selecting tile shape for minimal execution time
-
K. Hogstedt, L. Carter, and J. Ferrante. Selecting tile shape for minimal execution time. In SPAA, pages 201-211, 1999.
-
(1999)
SPAA
, pp. 201-211
-
-
Hogstedt, K.1
Carter, L.2
Ferrante, J.3
-
29
-
-
34547500808
-
Implicit and explicit optimization for stencil computations
-
S. Kamil, K. Datta, S. Williams, L. Oliker, J. Shalf, and K. Yellick. Implicit and explicit optimization for stencil computations. In ACM SIGPLAN workshop on Memory Systems Perofmance and Correctness, 2006.
-
(2006)
ACM SIGPLAN workshop on Memory Systems Perofmance and Correctness
-
-
Kamil, S.1
Datta, K.2
Williams, S.3
Oliker, L.4
Shalf, J.5
Yellick, K.6
-
30
-
-
0038220747
-
A unifying framework for iteration reordering transformations
-
Technical Report CS-TR-3430, Dept. of Computer Science, University of Maryland, College Park
-
W. Kelly and W. Pugh. A unifying framework for iteration reordering transformations. Technical Report CS-TR-3430, Dept. of Computer Science, University of Maryland, College Park, 1995.
-
(1995)
-
-
Kelly, W.1
Pugh, W.2
-
34
-
-
0028409782
-
A singular loop transformation framework based on non-singular matrices
-
W. Li and K. Pingali. A singular loop transformation framework based on non-singular matrices. Intl. J. of Parallel Programming, 22(2): 183-205, 1994.
-
(1994)
Intl. J. of Parallel Programming
, vol.22
, Issue.2
, pp. 183-205
-
-
Li, W.1
Pingali, K.2
-
35
-
-
0034823777
-
Blocking and array contraction across arbitrarily nested loops using affine partitioning
-
A. Lim, S. Liao, and M. Lam. Blocking and array contraction across arbitrarily nested loops using affine partitioning. In ACM SIGPLAN PPoPP, pages 103-112, 2001.
-
(2001)
ACM SIGPLAN PPoPP
, pp. 103-112
-
-
Lim, A.1
Liao, S.2
Lam, M.3
-
36
-
-
0032662841
-
An affine partitioning algorithm to maximize parallelism and minimize communication
-
A. W. Lim, G. I. Cheong, and M. S. Lam. An affine partitioning algorithm to maximize parallelism and minimize communication. In ACM Intl. Conf. on Supercomputing, pages 228-237, 1999.
-
(1999)
ACM Intl. Conf. on Supercomputing
, pp. 228-237
-
-
Lim, A.W.1
Cheong, G.I.2
Lam, M.S.3
-
37
-
-
0032067773
-
Maximizing parallelism and minimizing synchronization with affine partitions
-
A. W. Lim and M. S. Lam. Maximizing parallelism and minimizing synchronization with affine partitions. Parallel Computing, 24(3-4):445-475, 1998.
-
(1998)
Parallel Computing
, vol.24
, Issue.3-4
, pp. 445-475
-
-
Lim, A.W.1
Lam, M.S.2
-
38
-
-
84869377033
-
-
The LooPo Project, Loop parallelization in the polytope model
-
The LooPo Project - Loop parallelization in the polytope model. http://www.fmi.uni-passau.de/loopo.
-
-
-
-
43
-
-
34547682861
-
GRAPHITE: Loop optimizations based on the polyhedral model for GCC
-
Ottawa, Canada, June
-
S. Pop, A. Cohen, C. Bastoul, S. Girbal, P. Jouvelot, G.-A. Silber, and N. Vasilache. GRAPHITE: Loop optimizations based on the polyhedral model for GCC. In Proc. of the 4th GCC Developper's summit, Ottawa, Canada, June 2006.
-
(2006)
Proc. of the 4th GCC Developper's summit
-
-
Pop, S.1
Cohen, A.2
Bastoul, C.3
Girbal, S.4
Jouvelot, P.5
Silber, G.-A.6
Vasilache, N.7
-
44
-
-
57349167317
-
Iterative optimization in the polyhedral model: Part II, multidimensional time
-
Tucson, Arizona, June
-
L.-N. Pouchet, C. Bastoul, J. Cavazos, and A. Cohen. Iterative optimization in the polyhedral model: Part II, multidimensional time. In PLDI'08, Tucson, Arizona, June 2008.
-
(2008)
PLDI'08
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cavazos, J.3
Cohen, A.4
-
45
-
-
51049121269
-
Iterative optimization in the polyhedral model: Part I, one-dimensional time
-
Mar
-
L.-N. Pouchet, C. Bastoul, A. Cohen, and N. Vasilache. Iterative optimization in the polyhedral model: Part I, one-dimensional time. In ACM CGO, Mar. 2007.
-
(2007)
ACM CGO
-
-
Pouchet, L.-N.1
Bastoul, C.2
Cohen, A.3
Vasilache, N.4
-
46
-
-
84976676720
-
The omega test: A fast and practical integer programming algorithm for dependence analysis
-
Aug
-
W. Pugh. The omega test: a fast and practical integer programming algorithm for dependence analysis. Communications of the ACM, 8:102-114, Aug. 1992.
-
(1992)
Communications of the ACM
, vol.8
, pp. 102-114
-
-
Pugh, W.1
-
47
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
F. Quillere, S. V Rajopadhye, and D. Wilde. Generation of efficient nested loops from polyhedra. Intl. J. of Parallel Programming, 28(5):469-498, 2000.
-
(2000)
Intl. J. of Parallel Programming
, vol.28
, Issue.5
, pp. 469-498
-
-
Quillere, F.1
Rajopadhye, S.V.2
Wilde, D.3
-
48
-
-
38249009019
-
Tiling multidimensional iteration spaces for multicomputers
-
J. Ramanujam and P. Sadayappan. Tiling multidimensional iteration spaces for multicomputers. JPDC, 16(2): 108-230, 1992.
-
(1992)
JPDC
, vol.16
, Issue.2
, pp. 108-230
-
-
Ramanujam, J.1
Sadayappan, P.2
-
49
-
-
35448985754
-
Parameterized tiled loops for free
-
L. Renganarayana, D. Kim, S. Rajopadhye, and M. M. Strout. Parameterized tiled loops for free. In PLDI, pages 405-414, 2007.
-
(2007)
PLDI
, pp. 405-414
-
-
Renganarayana, L.1
Kim, D.2
Rajopadhye, S.3
Strout, M.M.4
-
50
-
-
0003929457
-
Automatic blocking of nested loops
-
Technical report, University of Tennessee, Knoxville, TN, Aug
-
R. Schreiber and J. Dongarra. Automatic blocking of nested loops. Technical report, University of Tennessee, Knoxville, TN, Aug. 1990.
-
(1990)
-
-
Schreiber, R.1
Dongarra, J.2
-
52
-
-
0032635362
-
New tiling techniques to improve cache temporal locality
-
Y. Song and Z. Li. New tiling techniques to improve cache temporal locality. In PLDI, pages 215-228, 1999.
-
(1999)
PLDI
, pp. 215-228
-
-
Song, Y.1
Li, Z.2
-
53
-
-
57349127962
-
-
Program Optimization Techniques in the Polyhedral Model. PhD thesis, Universite de Paris-Sud, INRIA, Futurs, Sept
-
N. Vasilache. Scalable Program Optimization Techniques in the Polyhedral Model. PhD thesis, Universite de Paris-Sud, INRIA, Futurs, Sept. 2007.
-
(2007)
Scalable
-
-
Vasilache, N.1
-
56
-
-
0035064349
-
Automated Empirical Optimizations of Software and the ATLAS Project
-
R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Software and the ATLAS Project. Parallel Computing, 2000.
-
(2000)
Parallel Computing
-
-
Whaley, R.1
Petitet, A.2
Dongarra, J.3
-
57
-
-
0004005802
-
A library for doing polyhedral operations
-
Technical Report RR-2157, IRIS A, 1993
-
D. K. Wilde. A library for doing polyhedral operations. Technical Report RR-2157, IRIS A, 1993.
-
-
-
Wilde, D.K.1
-
58
-
-
84976827033
-
A data locality optimizing algorithm
-
M. Wolf and M. S. Lam. A data locality optimizing algorithm. In ACM SIGPLAN PLDI '91, pages 30-44, 1991.
-
(1991)
ACM SIGPLAN PLDI '91
, pp. 30-44
-
-
Wolf, M.1
Lam, M.S.2
-
59
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
M. Wolf and M. S. Lam. A loop transformation theory and an algorithm to maximize parallelism. IEEE Trans. Parallel Distrib. Syst., 2(4):452-471, 1991.
-
(1991)
IEEE Trans. Parallel Distrib. Syst
, vol.2
, Issue.4
, pp. 452-471
-
-
Wolf, M.1
Lam, M.S.2
-
60
-
-
0003125942
-
Communication-minimal tiling of uniform dependence loops
-
J. Xue. Communication-minimal tiling of uniform dependence loops. JPDC, 42(1):42-59, 1997.
-
(1997)
JPDC
, vol.42
, Issue.1
, pp. 42-59
-
-
Xue, J.1
-
61
-
-
0442303278
-
-
Kluwer Academic Publishers, Norwell, MA, USA
-
J. Xue. Loop tiling for parallelism. Kluwer Academic Publishers, Norwell, MA, USA, 2000.
-
(2000)
Loop tiling for parallelism
-
-
Xue, J.1
-
62
-
-
1242285116
-
Transforming complex loop nests for locality
-
Q. Yi, K. Kennedy, and V Adve. Transforming complex loop nests for locality. J. of Supercomputing, 27(3):219-264, 2004.
-
(2004)
J. of Supercomputing
, vol.27
, Issue.3
, pp. 219-264
-
-
Yi, Q.1
Kennedy, K.2
Adve, V.3
-
63
-
-
0038378242
-
A comparison of empirical and model-driven optimization
-
K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzaran, D. A. Padua, K. Pingali, P. Stodghill, and P. Wu. A comparison of empirical and model-driven optimization. In PLDI'03, pages 63-76, 2003.
-
(2003)
PLDI'03
, pp. 63-76
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Cibulskis, M.4
DeJong, G.5
Garzaran, M.6
Padua, D.A.7
Pingali, K.8
Stodghill, P.9
Wu, P.10
|