-
1
-
-
34548804868
-
-
S. Browne, J. Dongarra, N. Garner, K. London, and P. Mucci. A scalable cross-platform infrastructure for application performance tuning using hardware counters, sc, 00:42, 2000.
-
S. Browne, J. Dongarra, N. Garner, K. London, and P. Mucci. A scalable cross-platform infrastructure for application performance tuning using hardware counters, sc, 00:42, 2000.
-
-
-
-
2
-
-
0032681068
-
-
M. Frigo. A Fast Fourier Transform Compiler. In Proc. of Programing Language Design and Implementation, 1999.
-
M. Frigo. A Fast Fourier Transform Compiler. In Proc. of Programing Language Design and Implementation, 1999.
-
-
-
-
3
-
-
20744449792
-
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, 2005. special issue on Program Generation, Optimization, and Platform Adaptation.
-
M. Frigo and S. G. Johnson. The design and implementation of FFTW3. Proceedings of the IEEE, 93(2):216-231, 2005. special issue on "Program Generation, Optimization, and Platform Adaptation".
-
-
-
-
6
-
-
0027270704
-
A novel framework of register allocation for software pipelining
-
New York, NY, USA, ACM Press
-
Q. Ning and G. R. Gao. A novel framework of register allocation for software pipelining. In POPL '93: Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pages 29-42, New York, NY, USA, 1993. ACM Press.
-
(1993)
POPL '93: Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
, pp. 29-42
-
-
Ning, Q.1
Gao, G.R.2
-
7
-
-
19344368072
-
SPIRAL: Code generation for DSP transforms
-
M. Puschel, J. M. F. Moura, J. Johnson, D. Padua, M. Veloso, B. W. Singer, J. Xiong, F. Franchetti, A. Gačić, Y. Voronenko, K. Chen, R. W. Johnson, and N. Rizzolo. SPIRAL: Code generation for DSP transforms. Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation", 93(2):232-275, 2005.
-
(2005)
Proceedings of the IEEE, special issue on "Program Generation, Optimization, and Adaptation"
, vol.93
, Issue.2
, pp. 232-275
-
-
Puschel, M.1
Moura, J.M.F.2
Johnson, J.3
Padua, D.4
Veloso, M.5
Singer, B.W.6
Xiong, J.7
Franchetti, F.8
Gačić, A.9
Voronenko, Y.10
Chen, K.11
Johnson, R.W.12
Rizzolo, N.13
-
9
-
-
84930370731
-
Evaluating compiler technology for control-flow optimizations for multimedia extension architectures
-
June
-
J. Shin, M. Hall, and J. Chame. Evaluating compiler technology for control-flow optimizations for multimedia extension architectures. 6th Workshop on Media and Streaming Processors (MSP6), June 2004.
-
(2004)
6th Workshop on Media and Streaming Processors (MSP6)
-
-
Shin, J.1
Hall, M.2
Chame, J.3
-
10
-
-
0022665487
-
On computing the split-radix fft
-
February
-
B. SORENSEN, HEIDEMAN. On computing the split-radix fft. IEEE Transactions on Acoustics, Speech, and Signal Processing, ASSP-34:152-156, February 1986.
-
(1986)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.ASSP-34
, pp. 152-156
-
-
SORENSEN, B.1
HEIDEMAN2
-
11
-
-
0343462141
-
Automated Empirical Optimizations of Sofware and the ATLAS Project
-
R. Whaley, A. Petitet, and J. Dongarra. Automated Empirical Optimizations of Sofware and the ATLAS Project. Parallel Computing, 27(1-2):3-35, 2001.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Whaley, R.1
Petitet, A.2
Dongarra, J.3
-
12
-
-
20744459570
-
Is search really necessary to generate high-performance blas
-
K. Yotov, X. Li, G. Ren, M. Garzaran, D. Padua, K. Pingali, and P. Stodghill. Is search really necessary to generate high-performance blas. Proceedings of the IEEE, 93(2), 2005. special issue on Program Generation, Optimization, and Adaptation., 2005.
-
(2005)
Proceedings of the IEEE, 93(2), 2005. special issue on Program Generation, Optimization, and Adaptation
-
-
Yotov, K.1
Li, X.2
Ren, G.3
Garzaran, M.4
Padua, D.5
Pingali, K.6
Stodghill, P.7
|