-
3
-
-
78650279432
-
Pattern-based sparse matrix representation for memory-efficient SMVM kernels
-
M. Belgin, G. Back, C.J. Ribbens, Pattern-based sparse matrix representation for memory-efficient SMVM kernels, in: Proceedings of the International Conference on Supercomputing, 2009, pp. 100-109.
-
(2009)
Proceedings of the International Conference on Supercomputing
, pp. 100-109
-
-
Belgin, M.1
Back, G.2
Ribbens, C.J.3
-
4
-
-
70350368872
-
Efficient sparse matrix-vector multiplication on CUDA
-
N. Bell, M. Garland, Efficient sparse matrix-vector multiplication on CUDA, Technical report, NVIDIA, 2008.
-
(2008)
Technical Report, NVIDIA
-
-
Bell, N.1
Garland, M.2
-
6
-
-
77953998137
-
Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
-
J. Bolz, I. Farmer, E. Grinspun, P. Schröder, Sparse matrix solvers on the GPU: conjugate gradients and multigrid, in: Proceedings of ACM SIGGRAPH, 2005, pp. 171-178.
-
(2005)
Proceedings of ACM SIGGRAPH
, pp. 171-178
-
-
J. Bolz1
-
7
-
-
35549013711
-
Performance optimization and modeling of blocked sparse kernels
-
DOI 10.1177/1094342007083801
-
A. Buttari, V. Eijkhout, J. Langou, and S. Filippone Performance optimization and modeling of blocked sparse kernels International Journal of High Performance Computing Applications 21 4 2007 467 484 (Pubitemid 350011340)
-
(2007)
International Journal of High Performance Computing Applications
, vol.21
, Issue.4
, pp. 467-484
-
-
Buttari, A.1
Eijkhout, V.2
Langou, J.3
Filippone, S.4
-
8
-
-
77749340082
-
Model-driven autotuning of sparse matrix-vector multiply on gpus
-
J.W. Choi, A. Singh, R. Vuduc, Model-driven autotuning of sparse matrix-vector multiply on gpus, in: Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010, pp. 115-126.
-
(2010)
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 115-126
-
-
Choi, J.W.1
Singh, A.2
Vuduc, R.3
-
9
-
-
33645913852
-
Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
-
A.L.G.A. Coutinho, M.A.D. Martins, R.M. Sydenstricker, and R.N. Elias Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations International Journal for Numerical Methods in Engineering 66 3 2006 431 460
-
(2006)
International Journal for Numerical Methods in Engineering
, vol.66
, Issue.3
, pp. 431-460
-
-
Coutinho, A.L.G.A.1
Martins, M.A.D.2
Sydenstricker, R.M.3
Elias, R.N.4
-
11
-
-
0013269731
-
University of Florida sparse matrix collection
-
T. Davis University of Florida sparse matrix collection NA Digest 97 23 1997 < http://www.cise.ufl.edu/research/sparse/matrices >
-
(1997)
NA Digest
, vol.97
, Issue.23
-
-
Davis, T.1
-
12
-
-
0033189408
-
Memory hierarchy performance prediction for blocked sparse algorithms
-
B.B. Fraguela, R. Doallo, and E.L. Zapata Memory hierarchy performance prediction for blocked sparse algorithms Parallel Processing Letters 9 3 1999 347 360 (Pubitemid 30568721)
-
(1999)
Parallel Processing Letters
, vol.9
, Issue.3
, pp. 347-360
-
-
Fraguela, B.B.1
-
14
-
-
70749149210
-
A comparative study of blocking storage methods for sparse matrices on multicore architectures
-
V. Karakasis, G. Goumas, N. Koziris, A comparative study of blocking storage methods for sparse matrices on multicore architectures, in: Proceedings of IEEE International Conference on Computational Science and Engineering, 2009, pp. 247-256.
-
(2009)
Proceedings of IEEE International Conference on Computational Science and Engineering
, pp. 247-256
-
-
Karakasis, V.1
Goumas, G.2
Koziris, N.3
-
16
-
-
0029713939
-
Block algorithms for sparse matrix computations on high performance workstations
-
J.J. Navarro, E. García, J.L. Larriba-Pey, T. Juan, Block algorithms for sparse matrix computations on high performance workstations, in: Proceedings IEEE International Conference on Supercomputing, 1996, pp. 301-309.
-
(1996)
Proceedings IEEE International Conference on Supercomputing
, pp. 301-309
-
-
Navarro, J.J.1
-
17
-
-
0036734103
-
Effects of ordering strategies and programming paradigms on sparse matrix computations
-
L. Oliker, X. Li, P. Husbands, and R. Biswas Effects of ordering strategies and programming paradigms on sparse matrix computations SIAM Review 44 3 2002 373 393
-
(2002)
SIAM Review
, vol.44
, Issue.3
, pp. 373-393
-
-
Oliker, L.1
Li, X.2
Husbands, P.3
Biswas, R.4
-
18
-
-
25644439819
-
Performance optimization of irregular codes based on the combination of reordering and blocking techniques
-
DOI 10.1016/j.parco.2005.04.012, PII S0167819105000803
-
J.C. Pichel, D.B. Heras, J.C. Cabaleiro, and F.F. Rivera Performance optimization of irregular codes based on the combination of reordering and blocking techniques Parallel Computing 31 8-9 2005 858 876 (Pubitemid 41383385)
-
(2005)
Parallel Computing
, vol.31
, Issue.8-9
, pp. 858-876
-
-
Pichel, J.C.1
Heras, D.B.2
Cabaleiro, J.C.3
Rivera, F.F.4
-
19
-
-
56349128909
-
Reordering algorithms for increasing locality on multicore processors
-
J.C. Pichel, D.E. Singh, J. Carretero, Reordering algorithms for increasing locality on multicore processors, in: Proceedings of the IEEE International Conference on High Performance Computing and Communications, 2008, pp. 123-130.
-
(2008)
Proceedings of the IEEE International Conference on High Performance Computing and Communications
, pp. 123-130
-
-
Pichel, J.C.1
Singh, D.E.2
Carretero, J.3
-
20
-
-
3042576437
-
Improving performance of sparse matrix-vector multiplication
-
A. Pinar, M. Heath, Improving performance of sparse matrix-vector multiplication, in: Proceedings of Supercomputing, 1999.
-
(1999)
Proceedings of Supercomputing
-
-
Pinar, A.1
Heath, M.2
-
21
-
-
78651284120
-
Scan primitives for GPU computing
-
S. Sengupta, M. Harris, Y. Zhang, J.D. Owens, Scan primitives for GPU computing, in: Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware, 2007, pp. 97-106.
-
(2007)
Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware
, pp. 97-106
-
-
Sengupta, S.1
Harris, M.2
Zhang, Y.3
Owens, J.D.4
-
22
-
-
1542710739
-
Sparse tiling for stationary iterative methods
-
M.M. Strout, L. Carter, J. Ferrante, and B. Kreaseck Sparse tiling for stationary iterative methods International Journal of High Performance Computing Applications 18 1 2004 95 113
-
(2004)
International Journal of High Performance Computing Applications
, vol.18
, Issue.1
, pp. 95-113
-
-
Strout, M.M.1
Carter, L.2
Ferrante, J.3
Kreaseck, B.4
-
24
-
-
33646389518
-
Fast sparse matrix-vector multiplication by exploiting variable block structure
-
R. Vuduc, H. Moon, Fast sparse matrix-vector multiplication by exploiting variable block structure, in: High Performance Computing and Communications, Lecture Notes in Computer Science, vol. 3726, 2005, pp. 807-816.
-
(2005)
High Performance Computing and Communications, Lecture Notes in Computer Science
, vol.3726
, pp. 807-816
-
-
Vuduc, R.1
Moon, H.2
-
25
-
-
56749158843
-
Optimization of sparse matrix-vector multiply on emerging multicore platforms
-
S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiply on emerging multicore platforms, in: Proceedings of the ACM/IEEE Conference on Supercomputing, 2007.
-
(2007)
Proceedings of the ACM/IEEE Conference on Supercomputing
-
-
Williams, S.1
Oliker, L.2
Vuduc, R.3
Shalf, J.4
Yelick, K.5
Demmel, J.6
|