-
2
-
-
84964748976
-
Compiler blockability of numerical algorithms
-
November
-
Steve Carr and Ken Kennedy, Compiler blockability of numerical algorithms, J. Supercomputing, pp. 114-124 (November 1992).
-
(1992)
J. Supercomputing
, pp. 114-124
-
-
Carr, S.1
Kennedy, K.2
-
3
-
-
85009364061
-
Compiler optimizations for improving data locality
-
San Jose, California, Oct.
-
Steve Carr, Kathryn S. McKinley, and Chau-Wen Tseng, Compiler optimizations for improving data locality, Sixth Intl. Conf. Archit. Support Progr. Lang. Oper. Syst., San Jose, California, Oct. 1994.
-
(1994)
Sixth Intl. Conf. Archit. Support Progr. Lang. Oper. Syst.
-
-
Carr, S.1
McKinley, K.S.2
Tseng, C.-W.3
-
4
-
-
0028549474
-
Improving the ratio of memory operations to floatingpoint operations in loops
-
November
-
Steve Carr and Ken Kennedy, Improving the ratio of memory operations to floatingpoint operations in loops, Trans. Progr. Lang. Syst. 16(6):1768-1810 (November 1994).
-
(1994)
Trans. Progr. Lang. Syst.
, vol.16
, Issue.6
, pp. 1768-1810
-
-
Carr, S.1
Kennedy, K.2
-
6
-
-
0026232450
-
A loop transformation theory and an algorithm to maximize parallelism
-
Michael E. Wolf and Monica S. Lam, A loop transformation theory and an algorithm to maximize parallelism, IEEE Trans. Parallel Distrib. Syst. 2(4):452-471 (1991).
-
(1991)
IEEE Trans. Parallel Distrib. Syst.
, vol.2
, Issue.4
, pp. 452-471
-
-
Wolf, M.E.1
Lam, M.S.2
-
7
-
-
0026933251
-
Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time
-
October
-
Paul Feautrier, Some efficient solutions to the affine scheduling problem, Part I, one-dimensional time, IJPP 21(5):xx-xx (October 1992).
-
(1992)
IJPP
, vol.21
, Issue.5
-
-
Feautrier, P.1
-
9
-
-
2342480327
-
Unrolling-based optimizations for modulo scheduling
-
December
-
Daniel Lavery and Wen-mei Hwu, Unrolling-based optimizations for modulo scheduling, 28th Int'l. Symp. Microarchit., pp. 126-141 (December 1995).
-
(1995)
28th Int'l. Symp. Microarchit.
, pp. 126-141
-
-
Lavery, D.1
Hwu, W.-M.2
-
13
-
-
0030379246
-
Combining loop transformations considering caches and scheduling
-
December
-
Michael E. Wolf, Dror Maydan, and Ding-Kai Chen, Combining loop transformations considering caches and scheduling, 29th Int'l. Symp. Microarchit. (December 1996).
-
(1996)
29th Int'l. Symp. Microarchit.
-
-
Wolf, M.E.1
Maydan, D.2
Chen, D.-K.3
-
14
-
-
0002433589
-
Iteration space tiling for memory hierarchies
-
Michael J. Wolfe, Iteration space tiling for memory hierarchies, Parallel Processing for Sci. Comput., pp. 357-361 (1987).
-
(1987)
Parallel Processing for Sci. Comput.
, pp. 357-361
-
-
Wolfe, M.J.1
-
15
-
-
0002238004
-
Tiling multidimensional iteration spaces for nonshared memory machines
-
November
-
J. Ramanujam and P. Sadayappan, Tiling multidimensional iteration spaces for nonshared memory machines, Supercomputing (November 1991).
-
(1991)
Supercomputing
-
-
Ramanujam, J.1
Sadayappan, P.2
-
16
-
-
0022874874
-
Advanced compiler optimizations for supercomputers
-
December
-
David A. Padua and Michael J. Wolfe, Advanced compiler optimizations for supercomputers, Commun. ACM 29(12):1184-1201 (December 1986).
-
(1986)
Commun. ACM
, vol.29
, Issue.12
, pp. 1184-1201
-
-
Padua, D.A.1
Wolfe, M.J.2
-
17
-
-
0001366267
-
Strategies for cache and local memory management by global program transformation
-
October
-
Dennis Gannon, William Jalby, and Kyle Gallivan, Strategies for cache and local memory management by global program transformation, J. Parallel and Distrib. Comput., Vol. 5, No. 5 (October 1988).
-
(1988)
J. Parallel and Distrib. Comput.
, vol.5
, Issue.5
-
-
Gannon, D.1
Jalby, W.2
Gallivan, K.3
-
19
-
-
0024935630
-
More iteration space tiling
-
Michael J. Wolfe, More iteration space tiling, Supercomputing, pp. 655-664 (1989).
-
(1989)
Supercomputing
, pp. 655-664
-
-
Wolfe, M.J.1
-
20
-
-
0026137116
-
The cache performance and optimizations of blocked algorithms
-
Palo Alto, California April
-
Monica S. Lam, Edward E. Rothberg, and Michael E. Wolf, The cache performance and optimizations of blocked algorithms, ASPLOS-IV , Palo Alto, California (April 1991).
-
(1991)
ASPLOS-IV
-
-
Lam, M.S.1
Rothberg, E.E.2
Wolf, M.E.3
-
21
-
-
0003207812
-
Unimodular transformations of double loops
-
Irvine, California August
-
Utpal Banerjee, Unimodular transformations of double loops, in Progr. Lang. Compilers for Parallel Computing, Irvine, California (August 1990).
-
(1990)
Progr. Lang. Compilers for Parallel Computing
-
-
Banerjee, U.1
-
24
-
-
0003934689
-
Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors
-
Anant Agarwal, David Kranz, and Venkat Natarajan, Automatic partitioning of parallel loops and data arrays for distributed shared memory multiprocessors, Int'l. Conf. Parallel Computing (1993).
-
(1993)
Int'l. Conf. Parallel Computing
-
-
Agarwal, A.1
Kranz, D.2
Natarajan, V.3
-
25
-
-
84949655044
-
A general framework for iteration-reordering loop transformations, Technical Summary
-
Vivek Sarkar and Radhika Thekkath, A general framework for iteration-reordering loop transformations, Technical Summary, Progr. Lang. Design and Implementation (1992).
-
(1992)
Progr. Lang. Design and Implementation
-
-
Sarkar, V.1
Thekkath, R.2
-
26
-
-
0029749714
-
Combining optimization for cache and instruction-level parallelism
-
Steve Carr, Combining optimization for cache and instruction-level parallelism, PACT '96, pp. 238-247 (1996).
-
(1996)
PACT '96
, pp. 238-247
-
-
Carr, S.1
-
27
-
-
0001465739
-
Maximizing loop parallelism and improving data locality via loop fusion and distribution
-
Ken Kennedy and Kathryn S. McKinley, Maximizing loop parallelism and improving data locality via loop fusion and distribution, Lang. Compilers for Parallel Computing (1993).
-
(1993)
Lang. Compilers for Parallel Computing
-
-
Kennedy, K.1
McKinley, K.S.2
-
28
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Jeff Bilmes, Krste Asanović, Chee-Whye Chin, and Jim Demmel, Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology, Intl. Conf. Supercomputing (1997).
-
(1997)
Intl. Conf. Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.-W.3
Demmel, J.4
-
29
-
-
0002741087
-
Hierarchical tiling: A methodology for high performance
-
UCSD, Department of Computer Science and Engineering November
-
Larry Carter, Jeanne Ferrante, Susan Flynn Hummel, Bowen Alpern, and Kang Su Gatlin, Hierarchical tiling: A methodology for high performance, Technical Report CS96-508, UCSD, Department of Computer Science and Engineering (November 1996).
-
(1996)
Technical Report CS96-508
-
-
Carter, L.1
Ferrante, J.2
Hummel, S.F.3
Alpern, B.4
Gatlin, K.S.5
|