메뉴 건너뛰기




Volumn 5952 LNCS, Issue , 2010, Pages 111-125

Automatically tuning sparse matrix-vector multiplication for GPU architectures

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPECIALIZATION; COMPUTATIONAL POWER; EXPERIMENTAL EVALUATION; GRAPHICS PROCESSOR; LOW MEMORY; MEMORY HIERARCHY; MEMORY REFERENCES; PARAMETER-TUNING; SCIENTIFIC APPLICATIONS; SPARSE MATRICES; SPARSE MATRIX COMPUTATIONS; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE FORMATS; UNSTRUCTURED GRID;

EID: 77949577730     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-11515-8_10     Document Type: Conference Paper
Times cited : (234)

References (11)
  • 1
    • 77949577190 scopus 로고    scopus 로고
    • Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley (December 2006)
    • Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley (December 2006)
  • 2
    • 74049163483 scopus 로고    scopus 로고
    • Optimizing sparse matrix-vector multiplication on GPUs
    • Technical report, IBM TJ Watson Research Center
    • Baskaran, M.M., Bordawekar, R.: Optimizing sparse matrix-vector multiplication on GPUs. Technical report, IBM TJ Watson Research Center (2009)
    • (2009)
    • Baskaran, M.M.1    Bordawekar, R.2
  • 3
    • 70350368872 scopus 로고    scopus 로고
    • Efficient sparse matrix-vector multiplication on CUDA
    • NVR-2008-004
    • Bell, N., Garland, M.: Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-004 (2008)
    • (2008) NVIDIA Technical Report
    • Bell, N.1    Garland, M.2
  • 4
    • 38149066031 scopus 로고    scopus 로고
    • Buatois, L., Caumon, G., Lévy, B.: Concurrent number cruncher: An efficient sparse linear solver on the GPU. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007. LNCS, 4782, pp. 358-371. Springer, Heidelberg (2007)
    • Buatois, L., Caumon, G., Lévy, B.: Concurrent number cruncher: An efficient sparse linear solver on the GPU. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007. LNCS, vol. 4782, pp. 358-371. Springer, Heidelberg (2007)
  • 5
    • 77949643882 scopus 로고    scopus 로고
    • Kincaid, D.R., Oppe, T.C., Young, D.M.: ITPACKV 2D User's Guide
    • Kincaid, D.R., Oppe, T.C., Young, D.M.: ITPACKV 2D User's Guide
  • 7
    • 77949622683 scopus 로고    scopus 로고
    • NVIDIA Corporation. NVIDIA CUDA Programming Guide 2.2 (2009)
    • NVIDIA Corporation. NVIDIA CUDA Programming Guide 2.2 (2009)
  • 8
    • 77949613205 scopus 로고    scopus 로고
    • The sparse matrix vector product on GPUs
    • Technical report, University of Almeria
    • Vázquez, F., Garzón, E.M., Martnez, J.A., Fernández, J.J.: The sparse matrix vector product on GPUs. Technical report, University of Almeria (2009)
    • (2009)
    • Vázquez, F.1    Garzón, E.M.2    Martnez, J.A.3    Fernández, J.J.4
  • 11
    • 56749158843 scopus 로고    scopus 로고
    • Williams, S., Oliker, L., Vuduc, R.W., Shalf, J., Yelick, K.A., Demmel, J.: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In: SC, p. 38 (2007)
    • Williams, S., Oliker, L., Vuduc, R.W., Shalf, J., Yelick, K.A., Demmel, J.: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In: SC, p. 38 (2007)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.