-
1
-
-
70749096103
-
Design of the CodeBoost transformation system for domain-specific optimisation of C++ programs
-
D. Binkley and P. Tonella, editors, Amsterdam, The Netherlands, September, IEEE Computer Society Press
-
O. S. Bagge, K. T. Kalleberg, M. Haveraaen, and E. Visser. Design of the CodeBoost transformation system for domain-specific optimisation of C++ programs. In D. Binkley and P. Tonella, editors, Third International Workshop on Source Code Analysis and Manipulation (SCAM 2003), pages 65-75, Amsterdam, The Netherlands, September 2003. IEEE Computer Society Press.
-
(2003)
Third International Workshop on Source Code Analysis and Manipulation (SCAM 2003)
, pp. 65-75
-
-
Bagge, O.S.1
Kalleberg, K.T.2
Haveraaen, M.3
Visser, E.4
-
2
-
-
20744453223
-
Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models
-
G. Baumgartner, A. Auer, D. E. Bernholdt, A. Bibireata, V. Choppella, D. Cociorva, X. Gao, R. J. Harrison, S. Hirata, S. Krishnamoorthy, S. Krishnan, C.-C. Lam, Q. Lu, M. Nooijen, R. M. Pitzer, J. Ramanujam, P. Sadayappan, and A. Sibiryakov. Synthesis of high-performance parallel programs for a class of ab initio quantum chemistry models. Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Baumgartner, G.1
Auer, A.2
Bernholdt, D.E.3
Bibireata, A.4
Choppella, V.5
Cociorva, D.6
Gao, X.7
Harrison, R.J.8
Hirata, S.9
Krishnamoorthy, S.10
Krishnan, S.11
Lam, C.-C.12
Lu, Q.13
Nooijen, M.14
Pitzer, R.M.15
Ramanujam, J.16
Sadayappan, P.17
Sibiryakov, A.18
-
3
-
-
84947286016
-
Runtime code generation in C++ as a foundation for domain-specific optimisation
-
C. Lengauer, D. Batory, C. Consel, and M. Odersky, editors, Domain-Specific Program Generation, Internationnal Seminar, Dagstuhl Castle, Germany, March 23-28, Revised Papers, 2004
-
O. Beckmann, A. Houghton, M. Mellor, and P. H. J. Kelly. Runtime code generation in C++ as a foundation for domain-specific optimisation. In C. Lengauer, D. Batory, C. Consel, and M. Odersky, editors, Domain-Specific Program Generation, volume LNCS 3016, pages 291-306, Internationnal Seminar, Dagstuhl Castle, Germany, March 23-28, 2003, Revised Papers, 2004.
-
(2003)
LNCS
, vol.3016
, pp. 291-306
-
-
Beckmann, O.1
Houghton, A.2
Mellor, M.3
Kelly, P.H.J.4
-
4
-
-
17644412337
-
The science of deriving dense linear algebra algorithms
-
March
-
P. Bientinesi, J. A. Gunnels, M. E. Myers, E. Quintana-Orti, and R. van de Geijn. The science of deriving dense linear algebra algorithms. ACM Transactions on Mathematical Software, 31(1):1-26, March 2005.
-
(2005)
ACM Transactions on Mathematical Software
, vol.31
, Issue.1
, pp. 21-26
-
-
Bientinesi, P.1
Gunnels, J.A.2
Myers, M.E.3
Quintana-Orti, E.4
Van De Geijn, R.5
-
5
-
-
33646828918
-
Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy
-
San Jose, CA, USA, March
-
C. Chen, J. Chame, and M. Hall. Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy. In CGO, San Jose, CA, USA, March 2005.
-
(2005)
CGO
-
-
Chen, C.1
Chame, J.2
Hall, M.3
-
6
-
-
32844473507
-
Facilitating the search for compositions of program transformations
-
Boston, MA, USA, June
-
A. Cohen, S. Girbal, D. Parello, M. Sigler, O. Temam, and N. Vasilache. Facilitating the search for compositions of program transformations. In ICS, pages 151-160, Boston, MA, USA, June 2005.
-
(2005)
ICS
, pp. 151-160
-
-
Cohen, A.1
Girbal, S.2
Parello, D.3
Sigler, M.4
Temam, O.5
Vasilache, N.6
-
7
-
-
20744452904
-
Self adapting linear algebra algorithms and software
-
J. Demmel, J. Dongarra, V. Eijkhout, E. Fuentes, A. Petitet, R. Vuduc, C. Whaley, and K. Yelick. Self adapting linear algebra algorithms and software. Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Demmel, J.1
Dongarra, J.2
Eijkhout, V.3
Fuentes, E.4
Petitet, A.5
Vuduc, R.6
Whaley, C.7
Yelick, K.8
-
8
-
-
34547709622
-
A language for the compact representation of multiple program versions
-
Hawthorne, NY, USA, October
-
S. Donadio, J. Brodman, T. Roeder, K. Yotov, D. Barthou, A. Cohen, M. J. Garzaŕan, D. Padua, and K. Pingali. A language for the compact representation of multiple program versions. In LCPC, Hawthorne, NY, USA, October 2005.
-
(2005)
LCPC
-
-
Donadio, S.1
Brodman, J.2
Roeder, T.3
Yotov, K.4
Barthou, D.5
Cohen, A.6
Garzaŕan, M.J.7
Padua, D.8
Pingali, K.9
-
9
-
-
0029712698
-
C: A language for highlevel, efficient, and machine-independent code generation
-
D. R. Engler, W. Hsieh, and M. Kaashoek. 'C: A language for highlevel, efficient, and machine-independent code generation. In POPL, pages 131-144, 1996. .
-
(1996)
POPL
, pp. 131-144
-
-
Engler, D.R.1
Hsieh, W.2
Kaashoek, M.3
-
10
-
-
0031636309
-
FFTW: An adaptive software architecture for the FFT
-
M. Frigo and S. Johnson. FFTW: An Adaptive Software Architecture for the FFT. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), volume 3, page 1381, 1998.
-
(1998)
Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, vol.3
, pp. 1381
-
-
Frigo, M.1
Johnson, S.2
-
11
-
-
20744449792
-
The design and implementation of FFTW3
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Frigo, M.1
Johnson, S.G.2
-
12
-
-
34548788908
-
A pratical method for quickly evaluating program optimizations
-
November
-
G. Fursin, A. Cohen, M. O'Boyle, and O. Temam. A pratical method for quickly evaluating program optimizations. In HiPEAC, November 2005.
-
(2005)
HiPEAC
-
-
Fursin, G.1
Cohen, A.2
O'Boyle, M.3
Temam, O.4
-
13
-
-
0000052043
-
DyC: An expressive annotation-directed dynamic compiler for C
-
B. Grant, M. Mock, M. Philipose, C. Chambers, and S. J. Eggers. DyC: An expressive annotation-directed dynamic compiler for C. Theoretical Computer Science, 248:147-199, 2000.
-
(2000)
Theoretical Computer Science
, vol.248
, pp. 147-199
-
-
Grant, B.1
Mock, M.2
Philipose, M.3
Chambers, C.4
Eggers, S.J.5
-
15
-
-
0034512401
-
-
PACT, Philadelphia, PA, October
-
T. Kisuki, P. M. Knijnenburg, and M. F. O'Boyle. Combined selection of tile sizes and unroll factors using iterative compilation. In PACT, Philadelphia, PA, October 2000.
-
(2000)
Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation
-
-
Kisuki, T.1
Knijnenburg, P.M.2
O'Boyle, M.F.3
-
16
-
-
34548789419
-
-
SC, Baltimore, MD, USA, November
-
G. Pike and P. Hilfinger. Better tiling and array contraction for compiling scientific programs. In SC, Baltimore, MD, USA, November 2002.
-
(2002)
Better Tiling and Array Contraction for Compiling Scientific Programs
-
-
Pike, G.1
Hilfinger, P.2
-
17
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
M. Püschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. W. Singer, J. Xiong, F. Franchetti, A. Gäcíc, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code generation for DSP transforms. Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Püschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.W.6
Xiong, J.7
Franchetti, F.8
Gäcíc, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
18
-
-
34548788365
-
A cache-conscious profitability model for empirical tuning of loop fusion
-
October
-
A. Qasem and K. Kennedy. A cache-conscious profitability model for empirical tuning of loop fusion. In LCPC, October 2005.
-
(2005)
LCPC
-
-
Qasem, A.1
Kennedy, K.2
-
20
-
-
84858900853
-
-
See page for details
-
See page for details. FFTW homepage. http://www.fftw.org/.
-
FFTW Homepage
-
-
-
21
-
-
33646834588
-
Predicting unroll factors using supervised classification
-
San Jose, CA, USA, March
-
M. Stephenson and S. Amarasinghe. Predicting unroll factors using supervised classification. In CGO, San Jose, CA, USA, March 2005.
-
(2005)
CGO
-
-
Stephenson, M.1
Amarasinghe, S.2
-
23
-
-
18244401637
-
A survey of strategies in rule-based program transformation systems
-
Special issue on Reduction Strategies in Rewriting and Programming
-
E. Visser. A survey of strategies in rule-based program transformation systems. J. Symbolic Computation, 40(1):831-873, 2005. Special issue on Reduction Strategies in Rewriting and Programming.
-
(2005)
J. Symbolic Computation
, vol.40
, Issue.1
, pp. 831-873
-
-
Visser, E.1
-
24
-
-
1542710758
-
Statistical models for automatic performance tuning
-
R. Vuduc, J. Demmel, and J. Bilmes. Statistical models for automatic performance tuning. International Journal of High Performance Computing Applications, 18(1):65-94, 2004.
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 65-94
-
-
Vuduc, R.1
Demmel, J.2
Bilmes, J.3
-
26
-
-
13244279577
-
Minimizing development and maintenance costs in supporting persistently optimized BLAS
-
February
-
R. C. Whaley and A. Petitet. Minimizing development and maintenance costs in supporting persistently optimized BLAS. Software: Practice and Experience, 35(2):101-121, February 2005. http://www.cs.utsa.edu/∼whaley/papers/ spercw04.ps.
-
(2005)
Software: Practice and Experience
, vol.35
, Issue.2
, pp. 101-121
-
-
Whaley, R.C.1
Petitet, A.2
-
27
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Also available as University of Tennessee LAPACK Working Note #147, UT-CS-00-448, 2000
-
R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001. Also available as University of Tennessee LAPACK Working Note #147, UT-CS-00-448, 2000 (www.netlib.org/lapack/lawns/lawn147.ps).
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.J.3
-
30
-
-
77952410868
-
Poet: Parameterized optimizations for empirical tuning
-
Mar
-
Q. Yi, K. Seymour, H. You, R. Vuduc, and D. Quinlan. Poet: Parameterized optimizations for empirical tuning. In Workshop on Performance Optimization for High-Level Languages and Libraries, Mar 2007.
-
(2007)
Workshop on Performance Optimization for High-Level Languages and Libraries
-
-
Yi, Q.1
Seymour, K.2
You, H.3
Vuduc, R.4
Quinlan, D.5
-
31
-
-
20744459570
-
Is search really necessary to generate high-performance BLAS?
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance BLAS? Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation, 93(2), 2005.
-
(2005)
Proc. IEEE, Special Issue on Program Generation, Optimization, and Adaptation
, vol.93
, Issue.2
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
-
32
-
-
34547481296
-
Parameterizing loop fusion for automated empirical tuning
-
Center for Applied Scientific Computing Lawrence Livermore National Laboratory December
-
Y. Zhao, Q. Yi, K. Kennedy, D. Quinlan, and R. Vuduc. Parameterizing loop fusion for automated empirical tuning. Technical Report UCRLTR- 217808, Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, December 2005.
-
(2005)
Technical Report UCRLTR- 217808
-
-
Zhao, Y.1
Yi, Q.2
Kennedy, K.3
Quinlan, D.4
Vuduc, R.5
|