SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5952 LNCS, Issue , 2010, Pages 111-125

Automatically tuning sparse matrix-vector multiplication for GPU architectures

(3) Monakov, Alexander a Lokhmotov, Anton b Avetisyan, Arutyun a

a INSTITUTE FOR SYSTEM PROGRAMMING (Russian Federation)

b IMPERIAL COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC SPECIALIZATION; COMPUTATIONAL POWER; EXPERIMENTAL EVALUATION; GRAPHICS PROCESSOR; LOW MEMORY; MEMORY HIERARCHY; MEMORY REFERENCES; PARAMETER-TUNING; SCIENTIFIC APPLICATIONS; SPARSE MATRICES; SPARSE MATRIX COMPUTATIONS; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE FORMATS; UNSTRUCTURED GRID;

COMPUTATIONAL EFFICIENCY; MATRIX ALGEBRA; TUNING;

PROGRAM COMPILERS;

EID: 77949577730 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-11515-8_10 Document Type: Conference Paper

Times cited : (234)

References (11)

1
- 77949577190
- Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley (December 2006)
- Asanovic, K., Bodik, R., Catanzaro, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley. Technical Report UCB/EECS-2006-183, EECS Department, University of California, Berkeley (December 2006)

2
- 74049163483
- Optimizing sparse matrix-vector multiplication on GPUs
- Technical report, IBM TJ Watson Research Center
- Baskaran, M.M., Bordawekar, R.: Optimizing sparse matrix-vector multiplication on GPUs. Technical report, IBM TJ Watson Research Center (2009)
- (2009)
- Baskaran, M.M.¹ Bordawekar, R.²

3
- 70350368872
- Efficient sparse matrix-vector multiplication on CUDA
- NVR-2008-004
- Bell, N., Garland, M.: Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-004 (2008)
- (2008) NVIDIA Technical Report
- Bell, N.¹ Garland, M.²

4
- 38149066031
- Buatois, L., Caumon, G., Lévy, B.: Concurrent number cruncher: An efficient sparse linear solver on the GPU. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007. LNCS, 4782, pp. 358-371. Springer, Heidelberg (2007)
- Buatois, L., Caumon, G., Lévy, B.: Concurrent number cruncher: An efficient sparse linear solver on the GPU. In: Perrott, R., Chapman, B.M., Subhlok, J., de Mello, R.F., Yang, L.T. (eds.) HPCC 2007. LNCS, vol. 4782, pp. 358-371. Springer, Heidelberg (2007)

5
- 77949643882
- Kincaid, D.R., Oppe, T.C., Young, D.M.: ITPACKV 2D User's Guide
- Kincaid, D.R., Oppe, T.C., Young, D.M.: ITPACKV 2D User's Guide

6
- 70350356359
- SAMOS, pp
- Monakov, A., Avetisyan, A.: Implementing blocked sparse matrix-vector multiplication on NVIDIA GPUs. In: SAMOS, pp. 289-297 (2009)
- (2009) Implementing blocked sparse matrix-vector multiplication on NVIDIA GPUs , pp. 289-297
- Monakov, A.¹ Avetisyan, A.²

7
- 77949622683
- NVIDIA Corporation. NVIDIA CUDA Programming Guide 2.2 (2009)
- NVIDIA Corporation. NVIDIA CUDA Programming Guide 2.2 (2009)

8
- 77949613205
- The sparse matrix vector product on GPUs
- Technical report, University of Almeria
- Vázquez, F., Garzón, E.M., Martnez, J.A., Fernández, J.J.: The sparse matrix vector product on GPUs. Technical report, University of Almeria (2009)
- (2009)
- Vázquez, F.¹ Garzón, E.M.² Martnez, J.A.³ Fernández, J.J.⁴

9
- 70350771131
- Benchmarking GPUs to tune dense linear algebra
- IEEE Press, Los Alamitos
- Volkov, V., Demmel, J.W.: Benchmarking GPUs to tune dense linear algebra. In: SC 2008: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, pp. 1-11. IEEE Press, Los Alamitos (2008)
- (2008) SC 2008: Proceedings of the 2008 ACM/IEEE conference on Supercomputing , pp. 1-11
- Volkov, V.¹ Demmel, J.W.²

10
- 10044233808
- PhD thesis, University of California, Berkeley , Chair-Demmel, J.W
- Vuduc, R.W.: Automatic performance tuning of sparse matrix kernels, PhD thesis, University of California, Berkeley (2003); Chair-Demmel, J.W.
- (2003) Automatic performance tuning of sparse matrix kernels
- Vuduc, R.W.¹

11
- 56749158843
- Williams, S., Oliker, L., Vuduc, R.W., Shalf, J., Yelick, K.A., Demmel, J.: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In: SC, p. 38 (2007)
- Williams, S., Oliker, L., Vuduc, R.W., Shalf, J., Yelick, K.A., Demmel, J.: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. In: SC, p. 38 (2007)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.