메뉴 건너뛰기




Volumn 36, Issue 2, 2012, Pages 65-77

Optimization of sparse matrix-vector multiplication using reordering techniques on GPUs

Author keywords

GPUs; Optimization; Performance; Reordering; Sparse matrix

Indexed keywords

OPTIMIZATION; PROGRAM PROCESSORS;

EID: 84857332778     PISSN: 01419331     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.micpro.2011.05.005     Document Type: Article
Times cited : (51)

References (25)
  • 4
    • 70350368872 scopus 로고    scopus 로고
    • Efficient sparse matrix-vector multiplication on CUDA
    • N. Bell, M. Garland, Efficient sparse matrix-vector multiplication on CUDA, Technical report, NVIDIA, 2008.
    • (2008) Technical Report, NVIDIA
    • Bell, N.1    Garland, M.2
  • 6
    • 77953998137 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
    • J. Bolz, I. Farmer, E. Grinspun, P. Schröder, Sparse matrix solvers on the GPU: conjugate gradients and multigrid, in: Proceedings of ACM SIGGRAPH, 2005, pp. 171-178.
    • (2005) Proceedings of ACM SIGGRAPH , pp. 171-178
    • J. Bolz1
  • 11
    • 0013269731 scopus 로고    scopus 로고
    • University of Florida sparse matrix collection
    • T. Davis University of Florida sparse matrix collection NA Digest 97 23 1997 < http://www.cise.ufl.edu/research/sparse/matrices >
    • (1997) NA Digest , vol.97 , Issue.23
    • Davis, T.1
  • 12
    • 0033189408 scopus 로고    scopus 로고
    • Memory hierarchy performance prediction for blocked sparse algorithms
    • B.B. Fraguela, R. Doallo, and E.L. Zapata Memory hierarchy performance prediction for blocked sparse algorithms Parallel Processing Letters 9 3 1999 347 360 (Pubitemid 30568721)
    • (1999) Parallel Processing Letters , vol.9 , Issue.3 , pp. 347-360
    • Fraguela, B.B.1
  • 16
    • 0029713939 scopus 로고    scopus 로고
    • Block algorithms for sparse matrix computations on high performance workstations
    • J.J. Navarro, E. García, J.L. Larriba-Pey, T. Juan, Block algorithms for sparse matrix computations on high performance workstations, in: Proceedings IEEE International Conference on Supercomputing, 1996, pp. 301-309.
    • (1996) Proceedings IEEE International Conference on Supercomputing , pp. 301-309
    • Navarro, J.J.1
  • 17
    • 0036734103 scopus 로고    scopus 로고
    • Effects of ordering strategies and programming paradigms on sparse matrix computations
    • L. Oliker, X. Li, P. Husbands, and R. Biswas Effects of ordering strategies and programming paradigms on sparse matrix computations SIAM Review 44 3 2002 373 393
    • (2002) SIAM Review , vol.44 , Issue.3 , pp. 373-393
    • Oliker, L.1    Li, X.2    Husbands, P.3    Biswas, R.4
  • 18
    • 25644439819 scopus 로고    scopus 로고
    • Performance optimization of irregular codes based on the combination of reordering and blocking techniques
    • DOI 10.1016/j.parco.2005.04.012, PII S0167819105000803
    • J.C. Pichel, D.B. Heras, J.C. Cabaleiro, and F.F. Rivera Performance optimization of irregular codes based on the combination of reordering and blocking techniques Parallel Computing 31 8-9 2005 858 876 (Pubitemid 41383385)
    • (2005) Parallel Computing , vol.31 , Issue.8-9 , pp. 858-876
    • Pichel, J.C.1    Heras, D.B.2    Cabaleiro, J.C.3    Rivera, F.F.4
  • 20
    • 3042576437 scopus 로고    scopus 로고
    • Improving performance of sparse matrix-vector multiplication
    • A. Pinar, M. Heath, Improving performance of sparse matrix-vector multiplication, in: Proceedings of Supercomputing, 1999.
    • (1999) Proceedings of Supercomputing
    • Pinar, A.1    Heath, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.