-
4
-
-
56749158843
-
-
Williams S, Oliker L, Vuduc R, Shalf J, Yelick K and Demmel J 2007 Optimization of sparse matrix-vector multiplication on emerging multicore platforms Proc. SC2007: High performance computing, networking, and storage conference
-
(2007)
Optimization of Sparse Matrix-vector Multiplication on Emerging Multicore Platforms
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
-
5
-
-
70350771127
-
-
Datta K, Murphy M, Volkov V, Williams S, Carter J, Oliker L, Patterson D, Shalf J and Yelick K 2008 Stencil computation optimization autotuning state-of-the-art multicore architectures Preprint
-
(2008)
Stencil Computation Optimization Autotuning State-of-the-art Multicore Architectures
-
-
Datta, K.1
Murphy, M.2
Volkov, V.3
Williams, S.4
Carter, J.5
Oliker, L.6
Patterson, D.7
Shalf, J.8
Yelick, K.9
-
10
-
-
65649143554
-
-
Williams S and Patterson d 2008 The roofline model: An insightful multicore performance model Preprint
-
(2008)
-
-
Williams, S.1
Patterson, D.2
-
11
-
-
68949198052
-
The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures
-
Williams S, Patterson D, Oliker L, Shalf J and Yelick K 2008 The roofline model: A pedagogical tool for auto-tuning kernels on multicore architectures Hot Chips 20: Stanford University, stanford, California, August 24-26, 2008 (IEEE MIcro)
-
(2008)
Hot Chips 20: Stanford University, Stanford, California, August 24-26, 2008
-
-
Williams, S.1
Patterson, D.2
Oliker, L.3
Shalf, J.4
Yelick, K.5
-
12
-
-
0343462141
-
Automated empirical optimizations of software and the ATLAS project
-
Whaley R C, Petitet A and Dongarra J 2001 Automated empirical optimizations of software and the ATLAS project Parallel Computing 27 (1) 3-25
-
(2001)
Parallel Computing
, vol.27
, Issue.1-2
, pp. 3-25
-
-
Whaley, R.C.1
Petitet, A.2
Dongarra, J.3
-
13
-
-
24344465959
-
-
Bilmes J, Asanović K, Chin C W and Demmel J 1997 Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology Proc. Int. Conf. Supercomputing (Vienna, Austria)
-
(1997)
Optimizing Matrix Multiply Using PHiPAC: A Portable, High-performance, ANSI C Coding Methodology
-
-
Bilmes, J.1
Asanović, K.2
Chin, C.W.3
Demmel, J.4
-
15
-
-
65649101948
-
SPIRAL: Automatic implementation of signal processing algorithms
-
Moura J M F, Johnson J, Johnson R W, Padua D, Prasanna V K, Püschel M and Veloso M 2000 SPIRAL: Automatic implementation of signal processing algorithms High Performance Embedded Computing (HPEC)
-
(2000)
High Performance Embedded Computing (HPEC)
-
-
Moura, J.M.F.1
Johnson, J.2
Johnson, R.W.3
Padua, D.4
Prasanna, V.K.5
Püschel, M.6
Veloso, M.7
-
16
-
-
0024903997
-
Evaluating associativity in CPU caches
-
Hill M D and Smith A J 1989 Evaluating associativity in CPU caches IEEE Trans. Comput. 38 (12) 1612-1630
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.12
, pp. 1612-1630
-
-
Hill, M.D.1
Smith, A.J.2
-
17
-
-
33646924323
-
Microarchitectures for systems on a chip in small process geometries
-
Sylvester D and Keutzer K 2001 Microarchitectures for systems on a chip in small process geometries Proc. IEEE 467-489
-
(2001)
Proc. IEEE
, vol.89
, Issue.4
, pp. 467-489
-
-
Sylvester, D.1
Keutzer, K.2
|