SCOPUS 정보 검색 플랫폼

Microprocessors and Microsystems

Volumn 36, Issue 2, 2012, Pages 65-77

Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs

(4) Pichel, Juan C a Rivera, Francisco F a Fernández, Marcos b Rodríguez, Aurelio b

a UNIVERSITY OF SANTIAGO DE COMPOSTELA (Spain)

b Centro de Supercomputacion de Galicia (Spain)

Author keywords

GPUs; Optimization; Performance; Reordering; Sparse matrix

Indexed keywords

OPTIMIZATION; PROGRAM PROCESSORS;

COMMON STRATEGY; DOUBLE PRECISION; GPUS; PERFORMANCE; REORDERING; SPARSE MATRICES; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE FORMATS;

MATRIX ALGEBRA;

EID: 84857332778 PISSN: 01419331 EISSN: None Source Type: Journal
DOI: 10.1016/j.micpro.2011.05.005 Document Type: Article

Times cited : (51)

References (25)

1
- 0030491606
- An approximate minimum degree ordering algorithm
- P.R. Amestoy, T.A. Davis, and I.S. Duff An approximate minimum degree ordering algorithm SIAM Journal on Matrix Analysis and Applications 17 4 1996 886 905
- (1996) SIAM Journal on Matrix Analysis and Applications , vol.17 , Issue.4 , pp. 886-905
- Amestoy, P.R.¹ Davis, T.A.² Duff, I.S.³

2
- 74049163483
- Optimizing sparse matrix-vector multiplication on GPUs
- M.M. Baskaran, R. Bordawekar, Optimizing sparse matrix-vector multiplication on GPUs, Technical report, IBM Research Report RC24704 (W0812-047), 2008.
- (2008) Technical Report, IBM Research Report RC24704 (W0812-047)
- Baskaran, M.M.¹ Bordawekar, R.²

3
- 78650279432
- Pattern-based sparse matrix representation for memory-efficient SMVM kernels
- M. Belgin, G. Back, C.J. Ribbens, Pattern-based sparse matrix representation for memory-efficient SMVM kernels, in: Proceedings of the International Conference on Supercomputing, 2009, pp. 100-109.
- (2009) Proceedings of the International Conference on Supercomputing , pp. 100-109
- Belgin, M.¹ Back, G.² Ribbens, C.J.³

4
- 70350368872
- Efficient sparse matrix-vector multiplication on CUDA
- N. Bell, M. Garland, Efficient sparse matrix-vector multiplication on CUDA, Technical report, NVIDIA, 2008.
- (2008) Technical Report, NVIDIA
- Bell, N.¹ Garland, M.²

5
- 77956260008
- Implementing sparse matrix-vector multiplication on throughput-oriented processors
- N. Bell, M. Garland, Implementing sparse matrix-vector multiplication on throughput-oriented processors, in: Proceedings of ACM/IEEE Conference on Supercomputing (SC), 2009.
- (2009) Proceedings of ACM/IEEE Conference on Supercomputing (SC)
- Bell, N.¹ Garland, M.²

6
- 77953998137
- Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
- J. Bolz, I. Farmer, E. Grinspun, P. Schröder, Sparse matrix solvers on the GPU: conjugate gradients and multigrid, in: Proceedings of ACM SIGGRAPH, 2005, pp. 171-178.
- (2005) Proceedings of ACM SIGGRAPH , pp. 171-178
- J. Bolz¹

7
- 35549013711
- Performance optimization and modeling of blocked sparse kernels
- DOI 10.1177/1094342007083801
- A. Buttari, V. Eijkhout, J. Langou, and S. Filippone Performance optimization and modeling of blocked sparse kernels International Journal of High Performance Computing Applications 21 4 2007 467 484 (Pubitemid 350011340)
- (2007) International Journal of High Performance Computing Applications , vol.21 , Issue.4 , pp. 467-484
- Buttari, A.¹ Eijkhout, V.² Langou, J.³ Filippone, S.⁴

8
- 77749340082
- Model-driven autotuning of sparse matrix-vector multiply on gpus
- J.W. Choi, A. Singh, R. Vuduc, Model-driven autotuning of sparse matrix-vector multiply on gpus, in: Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2010, pp. 115-126.
- (2010) Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , pp. 115-126
- Choi, J.W.¹ Singh, A.² Vuduc, R.³

9
- 33645913852
- Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations
- A.L.G.A. Coutinho, M.A.D. Martins, R.M. Sydenstricker, and R.N. Elias Performance comparison of data-reordering algorithms for sparse matrix-vector multiplication in edge-based unstructured grid computations International Journal for Numerical Methods in Engineering 66 3 2006 431 460
- (2006) International Journal for Numerical Methods in Engineering , vol.66 , Issue.3 , pp. 431-460
- Coutinho, A.L.G.A.¹ Martins, M.A.D.² Sydenstricker, R.M.³ Elias, R.N.⁴

10
- 0001803542
- Rose and Willoughby
- E. Cuthill, and J. McKee Several strategies for reducing the bandwidth of matrices 1972 Rose and Willoughby
- (1972) Several Strategies for Reducing the Bandwidth of Matrices
- Cuthill, E.¹ McKee, J.²

11
- 0013269731
- University of Florida sparse matrix collection
- T. Davis University of Florida sparse matrix collection NA Digest 97 23 1997 < http://www.cise.ufl.edu/research/sparse/matrices >
- (1997) NA Digest , vol.97 , Issue.23
- Davis, T.¹

12
- 0033189408
- Memory hierarchy performance prediction for blocked sparse algorithms
- B.B. Fraguela, R. Doallo, and E.L. Zapata Memory hierarchy performance prediction for blocked sparse algorithms Parallel Processing Letters 9 3 1999 347 360 (Pubitemid 30568721)
- (1999) Parallel Processing Letters , vol.9 , Issue.3 , pp. 347-360
- Fraguela, B.B.¹

13
- 1542501019
- SPARSITY: Framework for optimizing sparse matrix-vector multiply
- E.J. Im, K.A. Yelick, and R. Vuduc SPARSITY: framework for optimizing sparse matrix-vector multiply International Journal of High Performance Computing Applications 18 1 2004 135 158
- (2004) International Journal of High Performance Computing Applications , vol.18 , Issue.1 , pp. 135-158
- Im, E.J.¹ Yelick, K.A.² Vuduc, R.³

14
- 70749149210
- A comparative study of blocking storage methods for sparse matrices on multicore architectures
- V. Karakasis, G. Goumas, N. Koziris, A comparative study of blocking storage methods for sparse matrices on multicore architectures, in: Proceedings of IEEE International Conference on Computational Science and Engineering, 2009, pp. 247-256.
- (2009) Proceedings of IEEE International Conference on Computational Science and Engineering , pp. 247-256
- Karakasis, V.¹ Goumas, G.² Koziris, N.³

15
- 0003734628
- G. Karypis, V. Kumar, METIS: a software package for partitioning unstructured graphs, partitioning meshes, and computing fill-reducing orderings of sparse matrices, 1997.
- (1997) METIS: A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes, and Computing Fill-reducing Orderings of Sparse Matrices
- Karypis, G.¹ Kumar, V.²

16
- 0029713939
- Block algorithms for sparse matrix computations on high performance workstations
- J.J. Navarro, E. García, J.L. Larriba-Pey, T. Juan, Block algorithms for sparse matrix computations on high performance workstations, in: Proceedings IEEE International Conference on Supercomputing, 1996, pp. 301-309.
- (1996) Proceedings IEEE International Conference on Supercomputing , pp. 301-309
- Navarro, J.J.¹

17
- 0036734103
- Effects of ordering strategies and programming paradigms on sparse matrix computations
- L. Oliker, X. Li, P. Husbands, and R. Biswas Effects of ordering strategies and programming paradigms on sparse matrix computations SIAM Review 44 3 2002 373 393
- (2002) SIAM Review , vol.44 , Issue.3 , pp. 373-393
- Oliker, L.¹ Li, X.² Husbands, P.³ Biswas, R.⁴

18
- 25644439819
- Performance optimization of irregular codes based on the combination of reordering and blocking techniques
- DOI 10.1016/j.parco.2005.04.012, PII S0167819105000803
- J.C. Pichel, D.B. Heras, J.C. Cabaleiro, and F.F. Rivera Performance optimization of irregular codes based on the combination of reordering and blocking techniques Parallel Computing 31 8-9 2005 858 876 (Pubitemid 41383385)
- (2005) Parallel Computing , vol.31 , Issue.8-9 , pp. 858-876
- Pichel, J.C.¹ Heras, D.B.² Cabaleiro, J.C.³ Rivera, F.F.⁴

19
- 56349128909
- Reordering algorithms for increasing locality on multicore processors
- J.C. Pichel, D.E. Singh, J. Carretero, Reordering algorithms for increasing locality on multicore processors, in: Proceedings of the IEEE International Conference on High Performance Computing and Communications, 2008, pp. 123-130.
- (2008) Proceedings of the IEEE International Conference on High Performance Computing and Communications , pp. 123-130
- Pichel, J.C.¹ Singh, D.E.² Carretero, J.³

20
- 3042576437
- Improving performance of sparse matrix-vector multiplication
- A. Pinar, M. Heath, Improving performance of sparse matrix-vector multiplication, in: Proceedings of Supercomputing, 1999.
- (1999) Proceedings of Supercomputing
- Pinar, A.¹ Heath, M.²

21
- 78651284120
- Scan primitives for GPU computing
- S. Sengupta, M. Harris, Y. Zhang, J.D. Owens, Scan primitives for GPU computing, in: Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware, 2007, pp. 97-106.
- (2007) Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware , pp. 97-106
- Sengupta, S.¹ Harris, M.² Zhang, Y.³ Owens, J.D.⁴

22
- 1542710739
- Sparse tiling for stationary iterative methods
- M.M. Strout, L. Carter, J. Ferrante, and B. Kreaseck Sparse tiling for stationary iterative methods International Journal of High Performance Computing Applications 18 1 2004 95 113
- (2004) International Journal of High Performance Computing Applications , vol.18 , Issue.1 , pp. 95-113
- Strout, M.M.¹ Carter, L.² Ferrante, J.³ Kreaseck, B.⁴

23
- 0039958691
- Improving memory-system performance of sparse matrix-vector multiplication
- March
- S. Toledo, Improving memory-system performance of sparse matrix-vector multiplication, in: Proceedings of the 8th SIAM Conference on Parallel Processing for Scientific Computing, March 1997.
- (1997) Proceedings of the 8th SIAM Conference on Parallel Processing for Scientific Computing
- Toledo, S.¹

24
- 33646389518
- Fast sparse matrix-vector multiplication by exploiting variable block structure
- R. Vuduc, H. Moon, Fast sparse matrix-vector multiplication by exploiting variable block structure, in: High Performance Computing and Communications, Lecture Notes in Computer Science, vol. 3726, 2005, pp. 807-816.
- (2005) High Performance Computing and Communications, Lecture Notes in Computer Science , vol.3726 , pp. 807-816
- Vuduc, R.¹ Moon, H.²

25
- 56749158843
- Optimization of sparse matrix-vector multiply on emerging multicore platforms
- S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, Optimization of sparse matrix-vector multiply on emerging multicore platforms, in: Proceedings of the ACM/IEEE Conference on Supercomputing, 2007.
- (2007) Proceedings of the ACM/IEEE Conference on Supercomputing
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.