-
1
-
-
20344398112
-
A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness
-
Nov.
-
D. F. Bacon, J. Chow, R. Ju, K. Muthukumar, and V. Sarkar. A compiler framework for restructuring data declarations to enhance cache and TLB effectiveness. In Proc. of the Conference of the Center for Advanced Studies on Collaborative Research, Nov. 1994.
-
(1994)
Proc. of the Conference of the Center for Advanced Studies on Collaborative Research
-
-
Bacon, D.F.1
Chow, J.2
Ju, R.3
Muthukumar, K.4
Sarkar, V.5
-
2
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
June
-
J. Bilmes, K. Asanović, C. Chin, and J. Demmel. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In Proc. of the International Conference on Supercomputing, June 1997.
-
(1997)
Proc. of the International Conference on Supercomputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.3
Demmel, J.4
-
3
-
-
0034268943
-
A portable programming interface for performance evaluation on modern processors
-
Aug.
-
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. A portable programming interface for performance evaluation on modern processors. International Journal of High Performance Computing Applications, 14(3):189-204, Aug. 2000.
-
(2000)
International Journal of High Performance Computing Applications
, vol.14
, Issue.3
, pp. 189-204
-
-
Browne, S.1
Dongarra, J.2
Garner, N.3
Ho, G.4
Mucci, P.5
-
4
-
-
17044422621
-
Compiler-directed page coloring for multiprocessors
-
Oct.
-
E. Bugnion, J. M. Anderson, T. C. Mowry, M. Rosenblum, and M. S. Lam. Compiler-directed page coloring for multiprocessors. In Proc. of the International Conference on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.
-
(1996)
Proc. of the International Conference on Architectural Support for Programming Languages and Operating Systems
-
-
Bugnion, E.1
Anderson, J.M.2
Mowry, T.C.3
Rosenblum, M.4
Lam, M.S.5
-
5
-
-
0028549474
-
Improving the ratio of memory operations to floating-point operations in loops
-
Nov.
-
S. Carr and K. Kennedy. Improving the ratio of memory operations to floating-point operations in loops. ACM Transactions on Programming Languages and Systems, 16(6):1768-1810, Nov. 1994.
-
(1994)
ACM Transactions on Programming Languages and Systems
, vol.16
, Issue.6
, pp. 1768-1810
-
-
Carr, S.1
Kennedy, K.2
-
8
-
-
17144430151
-
Optimizing for reduced code space using genetic algorithms
-
May
-
K. D. Cooper, P. J. Schielke, and D. Subramanian. Optimizing for reduced code space using genetic algorithms. In Proc. of the Workshop on Languages, Compilers, and Tools for Embedded Systems, May 1999.
-
(1999)
Proc. of the Workshop on Languages, Compilers, and Tools for Embedded Systems
-
-
Cooper, K.D.1
Schielke, P.J.2
Subramanian, D.3
-
11
-
-
0442295621
-
The effect of cache models on iterative compilation for combined tiling and unrolling
-
P. M. W. Knijnenburg, T. Kisuki, K. Gallivan, and M. F. P. O'Boyle. The effect of cache models on iterative compilation for combined tiling and unrolling. Concurrency and Computation: Practice and Experience, 16(2-3):247-270, 2004.
-
(2004)
Concurrency and Computation: Practice and Experience
, vol.16
, Issue.2-3
, pp. 247-270
-
-
Knijnenburg, P.M.W.1
Kisuki, T.2
Gallivan, K.3
O'Boyle, M.F.P.4
-
14
-
-
0042235298
-
Tiling, block data layout, and memory hierarchy performance
-
July
-
N. Park, B. Hong, and V. K. Prasanna. Tiling, block data layout, and memory hierarchy performance. IEEE Transactions on Parallel and Distributed Systems, 14(7):640-654, July 2003.
-
(2003)
IEEE Transactions on Parallel and Distributed Systems
, vol.14
, Issue.7
, pp. 640-654
-
-
Park, N.1
Hong, B.2
Prasanna, V.K.3
-
15
-
-
34548789419
-
Better tiling and array contraction for compiling scientific programs
-
Nov.
-
G. Pike and P. N. Hilfinger. Better tiling and array contraction for compiling scientific programs. In Proc. of Supercomputing'02, Nov. 2002.
-
(2002)
Proc. of Supercomputing'02
-
-
Pike, G.1
Hilfinger, P.N.2
-
17
-
-
0141696394
-
Stochastic search for signal processing algorithm optimization
-
Nov.
-
B. Singer and M. Veloso. Stochastic search for signal processing algorithm optimization. In Proc. of Supercomputing'01, Nov. 2001.
-
(2001)
Proc. of Supercomputing'01
-
-
Singer, B.1
Veloso, M.2
-
19
-
-
0027764718
-
To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts
-
Nov.
-
O. Temam, E. D. Granston, and W. Jalby. To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts. In Proc. of Supercomputing'93, Nov. 1993.
-
(1993)
Proc. of Supercomputing'93
-
-
Temam, O.1
Granston, E.D.2
Jalby, W.3
-
21
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Jan.
-
R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, Jan. 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
26
-
-
0038378242
-
A comparison of empirical and model-driven optimization
-
June
-
K. Yotov, X. Li, G. Ren, M. Cibulskis, G. DeJong, M. Garzaran, D. Padua, K. Pingali, P. Stodghill, and P. Wu. A comparison of empirical and model-driven optimization. In Proc. of the Conference on Programming Language Design and Implementation, June 2003.
-
(2003)
Proc. of the Conference on Programming Language Design and Implementation
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Cibulskis, M.4
DeJong, G.5
Garzaran, M.6
Padua, D.7
Pingali, K.8
Stodghill, P.9
Wu, P.10
-
27
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
Feb.
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proceedings of the IEEE, 93(2), Feb. 2005.
-
(2005)
Proceedings of the IEEE
, vol.93
, Issue.2
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
|