메뉴 건너뛰기




Volumn 4, Issue , 2010, Pages 109-113

Optimizing sparse matrix-vector multiplication on CUDA

Author keywords

CUDA; GPUs; NVIDIA's CUDDPA library; NVIDIA's SpMV library; SpMV

Indexed keywords

PROGRAM PROCESSORS;

EID: 77956072107     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICETC.2010.5529724     Document Type: Article
Times cited : (18)

References (12)
  • 2
    • 70350368872 scopus 로고    scopus 로고
    • Efficient sparse matrix-vector multiplication on CUDA
    • NVIDIA Corporation, Dec.
    • N.Bell and M. Garland. Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-3004, NVIDIA Corporation, Dec. 2008.
    • (2008) NVIDIA Technical Report NVR-2008-3004
    • Bell, N.1    Garland, M.2
  • 4
    • 70350356359 scopus 로고    scopus 로고
    • Implementing blocked sparse matrix-vector multiplication on NVIDIA GPUs
    • Springer, Heidelberg
    • Alexander Monakov and Arutyun Avetisyan. "Implementing Blocked Sparse Matrix-Vector Multiplication on NVIDIA GPUs", LNCS, vo1.5657, pp. 289-297. Springer, Heidelberg (2009).
    • (2009) LNCS , vol.5657 , pp. 289-297
    • Monakov, A.1    Avetisyan, A.2
  • 5
    • 78651269052 scopus 로고    scopus 로고
    • Understanding the efficiency of GPU algorithms for matrix-matrix multiplications
    • Aug. 2004
    • FATAHAIAN K., SUGERMAN J., HANRAHAN P.: Understanding the efficiency of GPU algorithms for matrix-matrix multiplications. In Graphics Hardware 2004 (Aug. 2004), pp. 133-138.
    • Graphics Hardware 2004 , pp. 133-138
    • Fatahaian, K.1    Sugerman, J.2    Hanrahan, P.3
  • 8
    • 0030082727 scopus 로고    scopus 로고
    • Automatic data structure selection and transformation for sparse matrix computations
    • February
    • Aart J. C. Bik and Harry A. G. Wijshoff. Automatic data structure selection and transformation for sparse matrix computations. IEEE Transactions on Parallel and Distributed Systems, 7(2): 109-126, February 1996.
    • (1996) IEEE Transactions on Parallel and Distributed Systems , vol.7 , Issue.2 , pp. 109-126
    • Bik, A.J.C.1    Wijshoff, H.A.G.2
  • 10
    • 38149066031 scopus 로고    scopus 로고
    • Concurrent number cruncher: An efficient sparse linear solver on the GPU
    • Perrott, R, Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007, Springer, Heidelberg
    • Buatois, L., Caumon, G., Levy, B.: Concurrent number cruncher: An efficient sparse linear solver on the GPU. In: Perrott, R, Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007. LNCS, vol.4782, pp. 358-371. Springer, Heidelberg (2007).
    • (2007) LNCS , vol.4782 , pp. 358-371
    • Buatois, L.1    Caumon, G.2    Levy, B.3
  • 12
    • 77956077880 scopus 로고    scopus 로고
    • http://www.cise.urf.edulresearch/sparse/matrices/


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.