-
1
-
-
84947242665
-
Eco: An empirical-based compilation and optimization system
-
N. Baradaran, J. Chame, C. Chen, P. Diniz, M. Hall, Y.-J. Lee, B. Liu, and R. Lucas. Eco: An empirical-based compilation and optimization system. In International Parallel and Distributed Processing Symposium, 2003.
-
International Parallel and Distributed Processing Symposium, 2003
-
-
Baradaran, N.1
Chame, J.2
Chen, C.3
Diniz, P.4
Hall, M.5
Lee, Y.-J.6
Liu, B.7
Lucas, R.8
-
2
-
-
0030661485
-
Optimizing matrix multiply using phipac: A portable, high-performance, ansi c coding methodology
-
New York, NY, USA, ACM Press
-
J. Bilmes, K. Asanovic, C.-W. Chin, and J. Demmel. Optimizing matrix multiply using phipac: a portable, high-performance, ansi c coding methodology. In Proc. the 11th international conference on Supercomputing, pages 340-347, New York, NY, USA, 1997. ACM Press.
-
(1997)
Proc. the 11th International Conference on Supercomputing
, pp. 340-347
-
-
Bilmes, J.1
Asanovic, K.2
Chin, C.-W.3
Demmel, J.4
-
5
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
New York, NY, USA, ACM
-
A. Cohen, M. Sigler, S. Girbal, O. Temam, D. Parello, and N. Vasilache. Facilitating the search for compositions of program transformations. In ICS '05: Proceedings of the 19th annual international conference on Supercomputing, pages 151-160, New York, NY, USA, 2005. ACM.
-
(2005)
ICS '05: Proceedings of the 19th Annual International Conference on Supercomputing
, pp. 151-160
-
-
Cohen, A.1
Sigler, M.2
Girbal, S.3
Temam, O.4
Parello, D.5
Vasilache, N.6
-
6
-
-
79957526167
-
A language for the compact representation of multiple program versions
-
S. Donadio, J. Brodman, T. Roeder, K. Yotov, D. Barthou, A. Cohen, M. J. Garzarán, D. Padua, and K. Pingali. A language for the compact representation of multiple program versions. In LCPC, October 2005.
-
LCPC, October 2005
-
-
Donadio, S.1
Brodman, J.2
Roeder, T.3
Yotov, K.4
Barthou, D.5
Cohen, A.6
Garzarán, M.J.7
Padua, D.8
Pingali, K.9
-
7
-
-
0031636309
-
FFTW: An Adaptive Software Architecture for the FFT
-
M. Frigo and S. Johnson. FFTW: An Adaptive Software Architecture for the FFT. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 3, page 1381, 1998.
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.3
, pp. 1381
-
-
Frigo, M.1
Johnson, S.2
-
8
-
-
79957487184
-
Loop transformation recipes for code generation and auto-tuning
-
M. Hall, J. Chame, C. Chen, J. Shin, G. Rudy, and M. M. Khan. Loop transformation recipes for code generation and auto-tuning. In LCPC, October 2009.
-
LCPC, October 2009
-
-
Hall, M.1
Chame, J.2
Chen, C.3
Shin, J.4
Rudy, G.5
Khan, M.M.6
-
9
-
-
0002363292
-
Iterative compilation in program optimization
-
T. Kisuki, P. Knijnenburg, M. O'Boyle, and H. Wijsho. Iterative compilation in program optimization. In Compilers for Parallel Computers, pages 35-44, 2000.
-
(2000)
Compilers for Parallel Computers
, pp. 35-44
-
-
Kisuki, T.1
Knijnenburg, P.2
O'Boyle, M.3
Wijsho, H.4
-
11
-
-
0030190854
-
Improving Data Locality with Loop Transformations
-
K. McKinley, S. Carr, and C. Tseng. Improving data locality with loop transformations. ACM Transactions on Programming Languages and Systems, 18(4):424-453, July 1996. (Pubitemid 126422522)
-
(1996)
ACM Transactions on Programming Languages and Systems
, vol.18
, Issue.4
, pp. 424-453
-
-
Mckinley, K.S.1
Carr, S.2
Tseng, C.-W.3
-
14
-
-
79957534486
-
Better tiling and array contraction for compiling scientific programs
-
G. Pike and P. Hilfinger. Better tiling and array contraction for compiling scientific programs. In SC, Baltimore, MD, USA, November 2002.
-
SC, Baltimore, MD, USA, November 2002
-
-
Pike, G.1
Hilfinger, P.2
-
15
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. W. Singer, J. Xiong, F. Franchetti, A. Gačić, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code generation for DSP transforms. IEEE special issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
IEEE Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.W.6
Xiong, J.7
Franchetti, F.8
Gačić, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
16
-
-
33646676076
-
Automatic tuning of whole applications using direct search and a performance-based transformation system
-
DOI 10.1007/s11227-006-7957-2, Computer Science Research Supporting High-Performance Applications
-
A. Qasem, K. Kennedy, and J. Mellor-Crummey. Automatic tuning of whole applications using direct search and a performance-based transformation system. The Journal of Supercomputing, 36(2):183-196, 2006. (Pubitemid 43742166)
-
(2006)
Journal of Supercomputing
, vol.36
, Issue.2
, pp. 183-196
-
-
Qasem, A.1
Kennedy, K.2
Mellor-Crummey, J.3
-
17
-
-
79952975588
-
Automated empirical tuning of scientific codes for performance and power consumption
-
to appear, Heraklion, Greece, Jan
-
S. F. Rahman, J. Guo, and Q. Yi. Automated empirical tuning of scientific codes for performance and power consumption. In HIPEAC:High- Performance and Embedded Architectures and Compilers (to appear), Heraklion, Greece, Jan 2011.
-
(2011)
HIPEAC:High- Performance and Embedded Architectures and Compilers
-
-
Rahman, S.F.1
Guo, J.2
Yi, Q.3
-
18
-
-
33646834588
-
Predicting unroll factors using supervised classification
-
M. Stephenson and S. Amarasinghe. Predicting unroll factors using supervised classification. In CGO, San Jose, CA, USA, March 2005.
-
CGO, San Jose, CA, USA, March 2005
-
-
Stephenson, M.1
Amarasinghe, S.2
-
21
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
R. C. Whaley, A. Petitet, and J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27(1):3-25, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1
, pp. 3-25
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.3
-
24
-
-
79957533572
-
Applying loop optimizations to object-oriented abstractions through general classification of array semantics
-
Q. Yi and D. Quinlan. Applying loop optimizations to object-oriented abstractions through general classification of array semantics. In The 17th International Workshop on Languages and Compilers for Parallel Computing, West Lafayette, Indiana, USA, Sep 2004.
-
The 17th International Workshop on Languages and Compilers for Parallel Computing, West Lafayette, Indiana, USA, Sep 2004
-
-
Yi, Q.1
Quinlan, D.2
-
25
-
-
77952410868
-
POET: Parameterized optimizations for empirical tuning
-
Q. Yi, K. Seymour, H. You, R. Vuduc, and D. Quinlan. POET: Parameterized optimizations for empirical tuning. In Workshop on Performance Optimization for High-Level Languages and Libraries, Mar 2007.
-
Workshop on Performance Optimization for High-Level Languages and Libraries, Mar 2007
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
27
-
-
33745128949
-
A comparison of empirical and model-driven optimization
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. A comparison of empirical and model-driven optimization. IEEE special issue on Program Generation, Optimization, and Adaptation, 2005.
-
(2005)
IEEE Special Issue on Program Generation, Optimization, and Adaptation
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
|