메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes

Author keywords

Data compression; GPU; Matrix vector multiplication; Memory bandwidth; Parallelism; Sparse matrix format

Indexed keywords

CLUSTERING ALGORITHMS; DIGITAL STORAGE; MATRIX ALGEBRA; MULTIPROCESSING SYSTEMS; OPTIMIZATION; PROGRAM PROCESSORS;

EID: 84899694907     PISSN: 21674329     EISSN: 21674337     Source Type: Conference Proceeding    
DOI: 10.1145/2503210.2503234     Document Type: Conference Paper
Times cited : (49)

References (27)
  • 1
    • 0030491606 scopus 로고    scopus 로고
    • An approximate minimum degree ordering algorithm
    • Oct.
    • P. R. Amestoy, T. A. Davis, and I. S. Du-. An approximate minimum degree ordering algorithm. SIAM J. Matrix Anal. Appl., 17(4):886-905, Oct. 1996.
    • (1996) SIAM J. Matrix Anal. Appl. , vol.17 , Issue.4 , pp. 886-905
    • Amestoy, P.R.1    Davis, T.A.2    Du, I.S.3
  • 5
    • 84899687916 scopus 로고    scopus 로고
    • Cusp library
    • N. Bell and M. Garland. Cusp library, 2012. http://cusp-library. googlecode. com.
    • (2012)
    • Bell, N.1    Garland, M.2
  • 7
    • 81355161778 scopus 로고    scopus 로고
    • The University of Florida sparse matrix collection
    • Dec.
    • T. A. Davis and Y. Hu. The University of Florida sparse matrix collection. ACM Trans. Math. Softw., 38(1):1:1-1:25, Dec. 2011. http://www. cise. u. edu/research/sparse/matrices/.
    • (2011) ACM Trans. Math. Softw. , vol.38 , Issue.1 , pp. 11-125
    • Davis, T.A.1    Hu, Y.2
  • 11
    • 84864039129 scopus 로고    scopus 로고
    • Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation
    • New York, NY, USA
    • D. Grewe and A. Lokhmotov. Automatically generating and tuning GPU code for sparse matrix-vector multiplication from a high-level representation. In Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-4, pages 12:1-12:8, New York, NY, USA, 2011.
    • (2011) Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units , vol.GPGPU-4 , pp. 121-128
    • Grewe, D.1    Lokhmotov, A.2
  • 14
    • 77950369345 scopus 로고    scopus 로고
    • Data clustering: 50 years beyond k-means
    • June
    • A. K. Jain. Data clustering: 50 years beyond k-means. Pattern Recogn. Lett., 31(8):651-666, June 2010.
    • (2010) Pattern Recogn. Lett. , vol.31 , Issue.8 , pp. 651-666
    • Jain, A.K.1
  • 17
    • 2942628343 scopus 로고    scopus 로고
    • Optimizing sparse matrix-vector product computations using unroll and jam
    • May
    • J. Mellor-Crummey and J. Garvin. Optimizing sparse matrix-vector product computations using unroll and jam. Int. J. High Perform. Comput. Appl., 18(2):225-236, May 2004.
    • (2004) Int. J. High Perform. Comput. Appl. , vol.18 , Issue.2 , pp. 225-236
    • Mellor-Crummey, J.1    Garvin, J.2
  • 21
    • 24144467633 scopus 로고    scopus 로고
    • Iterative methods for sparse linear systems
    • Philadelphia, PA, USA, 2nd edition
    • Y. Saad. Iterative Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 2nd edition, 2003.
    • (2003) Society for Industrial and Applied Mathematics
    • Saad, Y.1
  • 23
    • 79955614550 scopus 로고    scopus 로고
    • A new approach for sparse matrix vector product on NVIDIA GPUs
    • June
    • F. Vazquez, J. J. Fernandez, and E. M. Garzon. A new approach for sparse matrix vector product on NVIDIA GPUs. Concurr. Comput.: Pract. Exper., 23(8):815-826, June 2011.
    • (2011) Concurr. Comput.: Pract. Exper. , vol.23 , Issue.8 , pp. 815-826
    • Vazquez, F.1    Fernandez, J.J.2    Garzon, E.M.3
  • 24
    • 24344485098 scopus 로고    scopus 로고
    • OSKI: A library of automatically tuned sparse matrix kernels
    • R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. In Proc. SciDAC, J. Physics: Conf. Ser., volume 16, pages 521-530, 2005.
    • (2005) Proc. SciDAC, J. Physics: Conf. Ser. , vol.16 , pp. 521-530
    • Vuduc, R.1    Demmel, J.W.2    Yelick, K.A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.