-
2
-
-
0027802136
-
Communication optimization and code generation for distributed memory machines
-
New York, NY, USA, ACM Press
-
S. P. Amarasinghe and M. S. Lam. Communication optimization and code generation for distributed memory machines. In PLDI '93: Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation, pages 126-138, New York, NY, USA, 1993. ACM Press.
-
(1993)
PLDI '93: Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
, pp. 126-138
-
-
Amarasinghe, S.P.1
Lam, M.S.2
-
5
-
-
33751022080
-
-
Ganesh Bikshandi, Jia Guo, Daniel Hoefiinger, Gheorghe Almasi, Basilio B. Fraguela, Maria J. Garzaran, David Padua, and Christoph von Praun. Programming for parallelism and locality with hierarchically tiled arrays. In PPoPP '06: Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 48-57, 2006.
-
Ganesh Bikshandi, Jia Guo, Daniel Hoefiinger, Gheorghe Almasi, Basilio B. Fraguela, Maria J. Garzaran, David Padua, and Christoph von Praun. Programming for parallelism and locality with hierarchically tiled arrays. In PPoPP '06: Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, pages 48-57, 2006.
-
-
-
-
6
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
ACM Press
-
Jeff Bilmes, Krste Asanovic, Chee-Whye Chin, and Jim Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In Proceedings of the 11th international conference on Supercomputing, pages 340-347. ACM Press, 1997.
-
(1997)
Proceedings of the 11th international conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
7
-
-
0029235623
-
Hierarchical tiling for improved superscalar performance
-
Washington, DC, USA, IEEE Computer Society
-
Larry Carter, Jeanne Ferrante, and Susan Flynn Hummel. Hierarchical tiling for improved superscalar performance. In IPPS '95: Proceedings of the 9th International Symposium on Parallel Processing, pages 239-245, Washington, DC, USA, 1995. IEEE Computer Society.
-
(1995)
IPPS '95: Proceedings of the 9th International Symposium on Parallel Processing
, pp. 239-245
-
-
Carter, L.1
Ferrante, J.2
Flynn Hummel, S.3
-
8
-
-
34548207355
-
-
Kayvon Fatahalian, Daniel R.eiter Horn, Timothy J. Knight, Larkhoon Leem, Mike Houston, Ji Young Park, Mattan Erez, Manman Ren, Alex Aiken, William J. Dally, and Pat Hanrahan. Sequoia: programming the memory hierarchy. In Proceedings of international conference on Supercomputing SC, page 83, 2006.
-
Kayvon Fatahalian, Daniel R.eiter Horn, Timothy J. Knight, Larkhoon Leem, Mike Houston, Ji Young Park, Mattan Erez, Manman Ren, Alex Aiken, William J. Dally, and Pat Hanrahan. Sequoia: programming the memory hierarchy. In Proceedings of international conference on Supercomputing SC, page 83, 2006.
-
-
-
-
11
-
-
56749128391
-
-
A. Größlinger, M. Griebl, and C. Lengauer. Introducing non-linear parameters to the polyhedron model. In Michael Gerndt and Edmond Kereku, editors, Proc. 11th Workshop on Compilers for Parallel Computers (CPC 2004), Research Report Series, pages 1-12. LRR-TUM, Technische Universitat München, July 2004.
-
A. Größlinger, M. Griebl, and C. Lengauer. Introducing non-linear parameters to the polyhedron model. In Michael Gerndt and Edmond Kereku, editors, Proc. 11th Workshop on Compilers for Parallel Computers (CPC 2004), Research Report Series, pages 1-12. LRR-TUM, Technische Universitat München, July 2004.
-
-
-
-
15
-
-
0242578180
-
A cost-effective implementation of multilevel tiling
-
Marta Jiménez, José M. Llabería, and Agustin Fernández. A cost-effective implementation of multilevel tiling. IEEE Trans. Parallel Distrib. Syst., 14(10):1006-1020, 2003.
-
(2003)
IEEE Trans. Parallel Distrib. Syst
, vol.14
, Issue.10
, pp. 1006-1020
-
-
Jiménez, M.1
Llabería, J.M.2
Fernández, A.3
-
17
-
-
0034512401
-
-
T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, page 237, Washington, DC, USA, 2000. IEEE Computer Society.
-
T. Kisuki, P. M. W. Knijnenburg, and M. F. P. O'Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT '00: Proceedings of the 2000 International Conference on Parallel Architectures and Compilation Techniques, page 237, Washington, DC, USA, 2000. IEEE Computer Society.
-
-
-
-
18
-
-
84949235179
-
Iterative compilation
-
Springer-Verlag New York, Inc, New York, NY, USA
-
P. M. W. Knijnenburg, T. Kisuki, and M. F. P. O'Boyle. Iterative compilation. In Embedded processor-design challenges: systems, architectures, modeling, and simulation-SAMOS, pages 171-187. Springer-Verlag New York, Inc., New York, NY, USA, 2002.
-
(2002)
Embedded processor-design challenges: Systems, architectures, modeling, and simulation-SAMOS
, pp. 171-187
-
-
Knijnenburg, P.M.W.1
Kisuki, T.2
O'Boyle, M.F.P.3
-
19
-
-
35449000510
-
A data locality optimizing algorithm (with retrospective)
-
M. S. Lam and M. E. Wolf. A data locality optimizing algorithm (with retrospective). In Best of PLDI, pages 442-459, 1991.
-
(1991)
Best of PLDI
, pp. 442-459
-
-
Lam, M.S.1
Wolf, M.E.2
-
20
-
-
56749088230
-
-
H. Le Verge, V. Van Dongen, and D. Wilde. La synthèse de nids de boucles avec la bibliothèque polyédrique. In R.enPar'6, Lyon, France, June 1994. English version Loop Nest Synthesis Using the Polyhedral Libraryin IRISA TR 830, May 1994.
-
H. Le Verge, V. Van Dongen, and D. Wilde. La synthèse de nids de boucles avec la bibliothèque polyédrique. In R.enPar'6, Lyon, France, June 1994. English version "Loop Nest Synthesis Using the Polyhedral Library"in IRISA TR 830, May 1994.
-
-
-
-
21
-
-
0342782302
-
Loop nest synthesis using the polyhedral library
-
IRISA, Rennes, France, May, Also published as INRIA Research Report 2288
-
H. Le Verge, V. Van Dongen, and D. Wilde. Loop nest synthesis using the polyhedral library. Technical Report PI 830, IRISA, Rennes, France, May 1994. Also published as INRIA Research Report 2288.
-
(1994)
Technical Report PI
, vol.830
-
-
Le Verge, H.1
Van Dongen, V.2
Wilde, D.3
-
22
-
-
0034207513
-
Accurately selecting block size at runtime in pipelined parallel programs
-
D. K. Lowenthal. Accurately selecting block size at runtime in pipelined parallel programs. Int. J. Parallel Program., 28(3):245-274, 2000.
-
(2000)
Int. J. Parallel Program
, vol.28
, Issue.3
, pp. 245-274
-
-
Lowenthal, D.K.1
-
24
-
-
84976676720
-
Omega test: A practical algorithm for exact array dependency analysis
-
W. Pugh. Omega test: A practical algorithm for exact array dependency analysis. Comm. of the ACM, 35(8):102, 1992.
-
(1992)
Comm. of the ACM
, vol.35
, Issue.8
, pp. 102
-
-
Pugh, W.1
-
25
-
-
19344368072
-
Spiral: Code generation for dsp transforms
-
February
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. Singer, J. Xiong, F. Franchetti, A. Gacic, Y. Voronenko, K. Chen, R,. W. Johnson, and N. Rizzolo. Spiral: Code generation for dsp transforms. Proceedings of the IEEE, 93(2):232-275, February 2005.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
, pp. 232-275
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.6
Xiong, J.7
Franchetti, F.8
Gacic, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
26
-
-
0034299275
-
Generation of efficient nested loops from polyhedra
-
F. Quilleré, S. Rajopadhye, and D. Wilde. Generation of efficient nested loops from polyhedra. International Journal Parallel Programming, 28(5):469-498, 2000.
-
(2000)
International Journal Parallel Programming
, vol.28
, Issue.5
, pp. 469-498
-
-
Quilleré, F.1
Rajopadhye, S.2
Wilde, D.3
-
28
-
-
35448985754
-
Parameterized tiled loops for free
-
New York, NY, USA, ACM Press
-
Lakshminarayanan Renganarayanan, DaeGon Kim, Sanjay Rajopadhye, and Michelle Mills Strout. Parameterized tiled loops for free. In PLDI '07: ACM SIGPLAN Conference on Programming Language Design and Implementation, pages 405-414, New York, NY, USA, 2007. ACM Press.
-
(2007)
PLDI '07: ACM SIGPLAN Conference on Programming Language Design and Implementation
, pp. 405-414
-
-
Renganarayanan, L.1
Kim, D.2
Rajopadhye, S.3
Mills Strout, M.4
-
29
-
-
84950305207
-
Locality optimizations for multi-level caches
-
New York, NY, USA, ACM Press
-
Gabriel Rivera and Chau-Wen Tseng. Locality optimizations for multi-level caches. In Supercomputing '99: Proceedings of the. 1999 ACM/IEEE conference on Supercomputing (CDROM), page 2, New York, NY, USA, 1999. ACM Press.
-
(1999)
Supercomputing '99: Proceedings of the. 1999 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 2
-
-
Rivera, G.1
Tseng, C.-W.2
-
30
-
-
56749144380
-
-
R. Schreiber and J. Dongarra. Automatic blocking of nested loops. Technical R.eport 90.38, MACS, NASA Ames Research Center, August 1990.
-
R. Schreiber and J. Dongarra. Automatic blocking of nested loops. Technical R.eport 90.38, MACS, NASA Ames Research Center, August 1990.
-
-
-
-
31
-
-
84943297310
-
Automatically tuned linear algebra software
-
Washington, DC, USA, IEEE Computer Society
-
R. C. Whaley and J. J. Dongarra. Automatically tuned linear algebra software. In Supercomputing '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM), pages 1-27, Washington, DC, USA, 1998. IEEE Computer Society.
-
(1998)
Supercomputing '98: Proceedings of the 1998 ACM/IEEE conference on Supercomputing (CDROM)
, pp. 1-27
-
-
Whaley, R.C.1
Dongarra, J.J.2
-
32
-
-
84976692695
-
SUIF: An infrastructure for research on parallelizing and optimizing compilers
-
R. P. Wilson, R. S. French, C. S. Wilson, S. P. Amarasinghe, J. M. Anderson, S. W. K. Tjiang, S-W. Liao, C-W. Tseng, M. W. Hall, M. S. Lam, and J. L. Hennessy. SUIF: An infrastructure for research on parallelizing and optimizing compilers. SIGPLAN Notices, 29(12):31-37, 1994.
-
(1994)
SIGPLAN Notices
, vol.29
, Issue.12
, pp. 31-37
-
-
Wilson, R.P.1
French, R.S.2
Wilson, C.S.3
Amarasinghe, S.P.4
Anderson, J.M.5
Tjiang, S.W.K.6
Liao, S.-W.7
Tseng, C.-W.8
Hall, M.W.9
Lam, M.S.10
Hennessy, J.L.11
|