-
1
-
-
0000718681
-
Measuring cache and tlb performance and their effect on benchmark runtimes
-
Saavedra AJ, Smith RH. Measuring cache and tlb performance and their effect on benchmark runtimes. Transactions on Computers 1995; 44(10):1223-1235.
-
(1995)
Transactions on Computers
, vol.44
, Issue.10
, pp. 1223-1235
-
-
Saavedra, A.J.1
Smith, R.H.2
-
3
-
-
33244459867
-
Automatic measurement of memory hierarchy parameters
-
ACM: New York, NY, U.S.A
-
Yotov K, Pingali K, Stodghill P. Automatic measurement of memory hierarchy parameters. SIGMETRICS '05: Proceedings of the 2005 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. ACM: New York, NY, U.S.A., 2005; 181-192.
-
(2005)
SIGMETRICS '05: Proceedings of the 2005 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems
, pp. 181-192
-
-
Yotov, K.1
Pingali, K.2
Stodghill, P.3
-
4
-
-
0030661485
-
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology
-
Vienna, Austria, July
-
Bilmes J, Asanović K, Chin CW, Demmel J. Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology. Proceedings of the ACM SIGARC International Conference on SuperComputing, Vienna, Austria, July 1997.
-
(1997)
Proceedings of the ACM SIGARC International Conference on SuperComputing
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.W.3
Demmel, J.4
-
5
-
-
19344363982
-
Efficient utilization of simd extensions
-
Franchetti F, Krai S, Lorenz J, Ueberhuber C. Efficient utilization of simd extensions. Proceedings of the IEEE (Special Issue on Program Generation, Optimization, and Adaptation) 2005; 93(2).
-
(2005)
Proceedings of the IEEE (Special Issue on Program Generation, Optimization, and Adaptation)
, vol.93
, Issue.2
-
-
Franchetti, F.1
Krai, S.2
Lorenz, J.3
Ueberhuber, C.4
-
6
-
-
0003533835
-
The fastest Fourier transform in the west
-
Technical Report MIT-LCS-TR-728, Massachusetts Institute of Technology
-
Frigo M, Johnson SG. The fastest Fourier transform in the west. Technical Report MIT-LCS-TR-728, Massachusetts Institute of Technology, 1997.
-
(1997)
-
-
Frigo, M.1
Johnson, S.G.2
-
7
-
-
0031636309
-
-
Frigo M, Johnson S. FFTW: An adaptive software architecture for the FFT. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 3, 1998; 1381.
-
Frigo M, Johnson S. FFTW: An adaptive software architecture for the FFT. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 3, 1998; 1381.
-
-
-
-
8
-
-
57049102173
-
-
Automatically tuned linear algebra software. Technical Report UT-CS-97-366, University of Tennessee, December, Available at
-
Clint Whaley R, Dongarra J. Automatically tuned linear algebra software. Technical Report UT-CS-97-366, University of Tennessee, December 1997. Available at: http://www.netlib.org/lapack/lawns/lawn131.ps.
-
(1997)
-
-
Clint Whaley, R.1
Dongarra, J.2
-
9
-
-
57049107219
-
-
̃whaley/papers/atlas-sc98.ps.
-
̃whaley/papers/atlas-sc98.ps.
-
-
-
-
10
-
-
35348918478
-
Automatically tuned linear algebra software
-
San Antonio, TX, U.S.A, CD-ROM Proceedings
-
Clint Whaley R, Dongarra J. Automatically tuned linear algebra software. Ninth SIAM Conference on Parallel Processing for Scientific Computing, San Antonio, TX, U.S.A., 1999. CD-ROM Proceedings.
-
(1999)
Ninth SIAM Conference on Parallel Processing for Scientific Computing
-
-
Clint Whaley, R.1
Dongarra, J.2
-
11
-
-
0343462141
-
Automated empirical optimization of software and the ATLAS project
-
Clint Whaley R, Petitet A, Dongarra JJ. Automated empirical optimization of software and the ATLAS project. Parallel Computing 2001; 27(1-2):3-35.
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-35
-
-
Clint Whaley, R.1
Petitet, A.2
Dongarra, J.J.3
-
12
-
-
57049086253
-
-
home [1 January
-
Clint Whaley R, Petitet A. Atlas homepage, http://math-atlas.sourceforge. net/ [1 January 2008].
-
(2008)
Clint Whaley R, Petitet A. Atlas
-
-
-
13
-
-
19344368072
-
-
Pushel M, Moura J, Johnson J, Padua D, Veloso M, Singer B, Xiong J, Frenchetti F, Cacic A, Voronenko Y, Chen K, Johnson R, Rizzolo N. Spiral: Code generation for dsp transforms. Proceedings of the IEEE (Special Issue on Program Generation, Optimization, and Adaptation) 2005; 93(2).
-
Pushel M, Moura J, Johnson J, Padua D, Veloso M, Singer B, Xiong J, Frenchetti F, Cacic A, Voronenko Y, Chen K, Johnson R, Rizzolo N. Spiral: Code generation for dsp transforms. Proceedings of the IEEE (Special Issue on Program Generation, Optimization, and Adaptation) 2005; 93(2).
-
-
-
-
14
-
-
13244250227
-
Spiral: Automatic implementation of signal processing algorithms
-
MIT Lincoln Laboratories: Boston, MA
-
Moura J, Johnson J, Johnson R, Padua D, Puschel M, Veloso M. Spiral: Automatic implementation of signal processing algorithms. Proceedings of the Conference on High-performance Embedded Computing. MIT Lincoln Laboratories: Boston, MA, 2000.
-
(2000)
Proceedings of the Conference on High-performance Embedded Computing
-
-
Moura, J.1
Johnson, J.2
Johnson, R.3
Padua, D.4
Puschel, M.5
Veloso, M.6
-
15
-
-
24344485098
-
OSKI: A library of automatically tuned sparse matrix kernels
-
San Francisco, CA, U.S.A, June, Institute of Physics Publishing
-
Vuduc R, Demmel JW, Yelick KA. OSKI: A library of automatically tuned sparse matrix kernels. Proceedings of SciDAC 2005, Journal of Physics: Conference Series, San Francisco, CA, U.S.A., June 2005. Institute of Physics Publishing, 2005.
-
(2005)
Proceedings of SciDAC 2005, Journal of Physics: Conference Series
-
-
Vuduc, R.1
Demmel, J.W.2
Yelick, K.A.3
-
16
-
-
0002363292
-
Iterative compilation in program optimization
-
Aussois, France
-
Kisuki T, Knijnenburg P, O'Boyle M, Wijsho H. Iterative compilation in program optimization. Proceedings of the Eighth International Workshop on Compilers for Parallel Computers, Aussois, France, 2000; 35-44.
-
(2000)
Proceedings of the Eighth International Workshop on Compilers for Parallel Computers
, pp. 35-44
-
-
Kisuki, T.1
Knijnenburg, P.2
O'Boyle, M.3
Wijsho, H.4
-
17
-
-
33745158525
-
-
Master's Thesis, Leiden Institute of Advanced Computer Science
-
van der Mark P. Iterative compilation. Master's Thesis, Leiden Institute of Advanced Computer Science, 1999.
-
(1999)
Iterative compilation
-
-
van der Mark, P.1
-
18
-
-
0006095489
-
Using iterative compilation for managing software pipeline - unrolling tradoffs
-
St. Goar, Germany
-
van der Mark P, Rohou E, Bodin F, Chamski Z, Eisenbeis C. Using iterative compilation for managing software pipeline - unrolling tradoffs. SCOPES99, St. Goar, Germany, 1999.
-
(1999)
SCOPES99
-
-
van der Mark, P.1
Rohou, E.2
Bodin, F.3
Chamski, Z.4
Eisenbeis, C.5
-
20
-
-
0004302191
-
-
Morgan Kaufmann Publishers, Inc, San Francisco, CA
-
Hennessy J, Patterson D. Computer Architecture, A Quantitative Approach. Morgan Kaufmann Publishers, Inc.: San Francisco, CA, 1990.
-
(1990)
Computer Architecture, A Quantitative Approach
-
-
Hennessy, J.1
Patterson, D.2
-
21
-
-
0003625523
-
-
7th edn, Prentice-Hall: Upper Saddle River, NJ
-
Walpole R, Myers R, Myers S, Ye K. Probability & Statistics for Engineers & Scientists (7th edn). Prentice-Hall: Upper Saddle River, NJ, 2002.
-
(2002)
Probability & Statistics for Engineers & Scientists
-
-
Walpole, R.1
Myers, R.2
Myers, S.3
Ye, K.4
|