-
2
-
-
57349145904
-
Automatic transformations for communication- minimized parallelization and locality optimization in the polyhedral model
-
April
-
Uday Bondhugula, M. Baskaran, Sriram Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan. Automatic transformations for communication- minimized parallelization and locality optimization in the polyhedral model. In International conference on Compiler Construction (ETAPS CC), April 2008.
-
(2008)
International conference on Compiler Construction (ETAPS CC)
-
-
Bondhugula, U.1
Baskaran, M.2
Krishnamoorthy, S.3
Ramanujam, J.4
Rountev, A.5
Sadayappan, P.6
-
4
-
-
0025447908
-
Improving register allocation for subscripted variables
-
New York, NY, USA, ACM Press
-
David Callahan, Steve Carr, and Ken Kennedy. Improving register allocation for subscripted variables. In PLDI '90: Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation, pages 53-65, New York, NY, USA, 1990. ACM Press.
-
(1990)
PLDI '90: Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
, pp. 53-65
-
-
Callahan, D.1
Carr, S.2
Kennedy, K.3
-
5
-
-
0031380928
-
Unroll-and-jam using uniformly generated sets
-
Washington, DC, USA, IEEE Computer Society
-
Steve Carr and Yiping Guan. Unroll-and-jam using uniformly generated sets. In MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture, pages 349-357, Washington, DC, USA, 1997. IEEE Computer Society.
-
(1997)
MICRO 30: Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
, pp. 349-357
-
-
Carr, S.1
Guan, Y.2
-
6
-
-
0348252150
-
An experimental evaluation of scalar replacement on scientific benchmarks
-
Steve Carr and Philip Sweany. An experimental evaluation of scalar replacement on scientific benchmarks. Software Practice and Experience, 33(15):1419-1445, 2003.
-
(2003)
Software Practice and Experience
, vol.33
, Issue.15
, pp. 1419-1445
-
-
Carr, S.1
Sweany, P.2
-
7
-
-
0002741087
-
Hierarchical tiling: A methodology for high performance
-
Technical Report CS96-508, UCSD, Nov
-
L. Carter, J. Ferrante, F. Hummel, B. Alpern, and K.S. Gatlin. Hierarchical tiling: A methodology for high performance. Technical Report CS96-508, UCSD, Nov. 1996.
-
(1996)
-
-
Carter, L.1
Ferrante, J.2
Hummel, F.3
Alpern, B.4
Gatlin, K.S.5
-
8
-
-
0029235623
-
Hierarchical tiling for improved superscalar performance
-
Washington, DC, USA, IEEE Computer Society
-
Larry Carter, Jeanne Ferrante, and Susan Flynn Hummel. Hierarchical tiling for improved superscalar performance. In IPPS '95: Proceedings of the 9th International Symposium on Parallel Processing, pages 239-245, Washington, DC, USA, 1995. IEEE Computer Society.
-
(1995)
IPPS '95: Proceedings of the 9th International Symposium on Parallel Processing
, pp. 239-245
-
-
Carter, L.1
Ferrante, J.2
Flynn Hummel, S.3
-
9
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
June
-
Albert Cohen, Sylvain Girbal, David Parello, M. Sigler, Olivier Temam, and Nicolas Vasilache. Facilitating the search for compositions of program transformations. In ACM International conference on Supercomputing, pages 151-160, June 2005.
-
(2005)
ACM International conference on Supercomputing
, pp. 151-160
-
-
Cohen, A.1
Girbal, S.2
David Parello, M.S.3
Temam, O.4
Vasilache, N.5
-
10
-
-
70449702074
-
Parametric multi-level tiling of imperfectly nested loops
-
New York, NY, USA, ACM
-
Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, and P. Sadayappan. Parametric multi-level tiling of imperfectly nested loops. In ICS '09: Proceedings of the 23rd international conference on Supercomputing, pages 147-157, New York, NY, USA, 2009. ACM.
-
(2009)
ICS '09: Proceedings of the 23rd international conference on Supercomputing
, pp. 147-157
-
-
Hartono, A.1
Manikandan Baskaran, M.2
Bastoul, C.3
Cohen, A.4
Krishnamoorthy, S.5
Norris, B.6
Ramanujam, J.7
Sadayappan, P.8
-
13
-
-
0038895757
-
Register tiling in nonrectangular iteration spaces
-
Marta Jiménez, José M. Llabería, and Agustín Fernández. Register tiling in nonrectangular iteration spaces. ACM Trans. Program. Lang. Syst., 24(4):409-453, 2002.
-
(2002)
ACM Trans. Program. Lang. Syst
, vol.24
, Issue.4
, pp. 409-453
-
-
Jiménez, M.1
Llabería, J.M.2
Fernández, A.3
-
14
-
-
0442295621
-
The effect of cache models on iterative compilation for combined tiling and unrolling: Research articles
-
P. M. W. Knijnenburg, T. Kisuki, K. Gallivan, and M. F. P. O'Boyle. The effect of cache models on iterative compilation for combined tiling and unrolling: Research articles. Concurr. Comput. : Pract. Exper., 16(2-3):247-270, 2004.
-
(2004)
Concurr. Comput. : Pract. Exper
, vol.16
, Issue.2-3
, pp. 247-270
-
-
Knijnenburg, P.M.W.1
Kisuki, T.2
Gallivan, K.3
O'Boyle, M.F.P.4
-
15
-
-
0032308685
-
Quantifying the multi-level nature of tiling interactions
-
June
-
N. Mitchell, K. Hogstedt, L. Carter, and J. Ferrante. Quantifying the multi-level nature of tiling interactions. International Journal of Parallel Programming, 26(6):641-670, June 1998.
-
(1998)
International Journal of Parallel Programming
, vol.26
, Issue.6
, pp. 641-670
-
-
Mitchell, N.1
Hogstedt, K.2
Carter, L.3
Ferrante, J.4
-
16
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
Fabien Quilleré;, Sanjay Rajopadhye, and Doran Wilde. Generation of efficient nested loops from polyhedra. International Journal Parallel Programming, 28(5):469-498, 2000.
-
(2000)
International Journal Parallel Programming
, vol.28
, Issue.5
, pp. 469-498
-
-
Quilleré, F.1
Rajopadhye, S.2
Wilde, D.3
-
18
-
-
35448985754
-
Parameterized tiled loops for free
-
New York, NY, USA, ACM Press
-
Lakshminarayanan Renganarayanan, DaeGon Kim, Sanjay Rajopadhye, and Michelle Mills Strout. Parameterized tiled loops for free. In PLDI '07: ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 405-414, New York, NY, USA, 2007. ACM Press.
-
(2007)
PLDI '07: ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 405-414
-
-
Renganarayanan, L.1
Kim, D.2
Rajopadhye, S.3
Mills Strout, M.4
-
19
-
-
0033705822
-
Optimized unrolling of nested loops
-
New York, NY, USA, ACM Press
-
Vivek Sarkar. Optimized unrolling of nested loops. In ICS '00: Proceedings of the 14th international conference on Supercomputing, pages 153-166, New York, NY, USA, 2000. ACM Press.
-
(2000)
ICS '00: Proceedings of the 14th international conference on Supercomputing
, pp. 153-166
-
-
Sarkar, V.1
-
21
-
-
57349127962
-
-
Program Optimization Techniques in the Polyhedral Model. PhD thesis, Université de Paris-Sud, INRIA Futurs, September
-
Nicolas Vasilache. Scalable Program Optimization Techniques in the Polyhedral Model. PhD thesis, Université de Paris-Sud, INRIA Futurs, September 2007.
-
(2007)
Scalable
-
-
Vasilache, N.1
-
23
-
-
0442303278
-
-
Kluwer Academic Publishers, Norwell, MA, USA
-
Jingling Xue. Loop tiling for parallelism. Kluwer Academic Publishers, Norwell, MA, USA, 2000.
-
(2000)
Loop tiling for parallelism
-
-
Xue, J.1
-
24
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
K. Yotov, Xiaoming Li, Gang Ren, M. J. S. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proceedings of the IEEE, 93:358-386, 2005.
-
(2005)
Proceedings of the IEEE
, vol.93
, pp. 358-386
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.J.S.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
|