메뉴 건너뛰기




Volumn 23, Issue 8, 2011, Pages 815-826

A new approach for sparse matrix vector product on NVIDIA GPUs

Author keywords

GPU; high performance computing; sparse matrix vector product

Indexed keywords

COMPUTER GRAPHICS; DIGITAL STORAGE; GRAPHICS PROCESSING UNIT; PROGRAM PROCESSORS;

EID: 79955614550     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1658     Document Type: Article
Times cited : (113)

References (20)
  • 3
    • 0031269220 scopus 로고    scopus 로고
    • Improving the memory-system performance of sparse-matrix vector multiplication
    • Toledo S,. Improving the memory-system performance of sparse-matrix vector multiplication. IBM Journal of Research and Development 1997; 41 (6): 711-725. (Pubitemid 127557044)
    • (1997) IBM Journal of Research and Development , vol.41 , Issue.6 , pp. 711-725
    • Toledo, S.1
  • 5
    • 60949098907 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication on emerging multicore platforms
    • Williams S, Oliker L, Vuduc R, Shalf J, Yelick K, Demmel J,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194.
    • (2009) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
    • Williams, S.1    Oliker, L.2    Vuduc, R.3    Shalf, J.4    Yelick, K.5    Demmel, J.6
  • 8
    • 79955587157 scopus 로고    scopus 로고
    • NVIDIA, PG-00000-002.V2.1 September 2008. Available at: [June ]
    • NVIDIA. CUDA CUBLAS Library. PG-00000-002.V2.1 September 2008. Available at: [June 2009 ].
    • (2009) CUDA CUBLAS Library
  • 9
    • 77950518538 scopus 로고    scopus 로고
    • A matrix approach to tomographic reconstruction and its implementation on GPUs
    • Vazquez F, Garzon EM, Fernandez JJ,. A matrix approach to tomographic reconstruction and its implementation on GPUs. Journal of Structural Biology 2010; 170: 146-151.
    • (2010) Journal of Structural Biology , vol.170 , pp. 146-151
    • Vazquez, F.1    Garzon, E.M.2    Fernandez, J.J.3
  • 10
    • 60649099576 scopus 로고    scopus 로고
    • Optimizing matrix multiplication for a short-vector SIMD architectureâCELL processor
    • Kurzak J, Alvaro W, Dongarra J,. Optimizing matrix multiplication for a short-vector SIMD architectureâCELL processor. Parallel Computing 2009; 35 (3): 138-150.
    • (2009) Parallel Computing , vol.35 , Issue.3 , pp. 138-150
    • Kurzak, J.1    Alvaro, W.2    Dongarra, J.3
  • 11
    • 34547309668 scopus 로고    scopus 로고
    • Version 2.3 NVIDIA, August 2009. Available at: [August ]
    • NVIDIA. CUDA Programming guide, Version 2.3, August 2009. Available at: [August 2009 ].
    • (2009) CUDA Programming Guide
  • 13
    • 74049143158 scopus 로고    scopus 로고
    • Implementing sparse matrix-vector multiplication on throughput-oriented processors
    • Storage and Analysis Available at: [December ]
    • Bell N, Garland M,. Implementing sparse matrix-vector multiplication on throughput-oriented processors. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, vol. 18, 2009. Available at: [December 2009 ].
    • (2009) Proceedings of the Conference on High Performance Computing Networking , vol.18
    • Bell, N.1    Garland, M.2
  • 18
    • 77953136155 scopus 로고    scopus 로고
    • Intel, Reference Manual. Available at: [June ]
    • Intel. Math Kernel Library. Reference Manual. Available at: [June 2009 ].
    • (2009) Math Kernel Library
  • 20
    • 0004172155 scopus 로고    scopus 로고
    • Available at: [June ]
    • The Matrix Market. Available at: [June 2009 ].
    • (2009) The Matrix Market


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.