메뉴 건너뛰기




Volumn 25, Issue 5, 2014, Pages 1112-1123

A performance modeling and optimizationanalysis tool for sparse matrix-vectormultiplication on GPUs

Author keywords

CUDA; GPU; Performance modeling; sparse matrix vector multiplication

Indexed keywords

EXPERIMENTS; OPTIMIZATION; PROGRAM PROCESSORS; TOOLS;

EID: 84898682038     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2013.123     Document Type: Article
Times cited : (71)

References (27)
  • 5
    • 0242533311 scopus 로고    scopus 로고
    • Sparse matrixsolvers on the gpu: Conjugate gradients and multigrid
    • J. Bolz, I. Farmer, E. Grinspun, and P. Schroder, "Sparse MatrixSolvers on The GPU: Conjugate Gradients and Multigrid," ACMTrans. Graphics, vol. 22, no. 3, pp. 917-924, 2003.
    • (2003) ACMTrans. Graphics , vol.22 , Issue.3 , pp. 917-924
    • Bolz, J.1    Farmer, I.2    Grinspun, E.3    Schroder, P.4
  • 6
    • 84898683364 scopus 로고    scopus 로고
    • NVIDIA CUDA C Programming Guide, Version 4.0, May 2011
    • NVIDIA CUDA C Programming Guide, Version 4.0, May 2011.
  • 7
    • 60649099576 scopus 로고    scopus 로고
    • Optimizing matrix multiplicationfor a short-vector simd architecture-cell processor
    • J. Kurzak, W. Alvaro, and J. Dongarra, "Optimizing Matrix Multiplicationfor a Short-Vector Simd Architecture-Cell Processor,"J. Parallel Computing, vol. 35, no. 3, pp. 138-150, 2009.
    • (2009) J. Parallel Computing , vol.35 , Issue.3 , pp. 138-150
    • Kurzak, J.1    Alvaro, W.2    Dongarra, J.3
  • 16
    • 84857332778 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication using reorderingtechniques on GPUs
    • J.C. Pichel, F.F. Rivera, M. Fernandez, and A. Rodriguez, "Optimization of Sparse Matrix-Vector Multiplication Using ReorderingTechniques on GPUs," Microprocessors and Microsystems,vol. 36, no. 2, pp. 65-77, 2012.
    • (2012) Microprocessors and Microsystems , vol.36 , Issue.2 , pp. 65-77
    • Pichel, J.C.1    Rivera, F.F.2    Fernandez, M.3    Rodriguez, A.4
  • 17
    • 84862123284 scopus 로고    scopus 로고
    • Fast sparsematrix-vector multiplication on GPUs: Implications for graphmining
    • Jan.
    • X. Yang, S. Parthasarathy, and P. Sadayappan, "Fast SparseMatrix-Vector Multiplication on GPUs: Implications for GraphMining," Proc. VLDB Endowment, vol. 4, no. 4, pp. 231-242, Jan.2011.
    • (2011) Proc. VLDB Endowment , vol.4 , Issue.4 , pp. 231-242
    • Yang, X.1    Parthasarathy, S.2    Sadayappan, P.3
  • 21
    • 84886727304 scopus 로고    scopus 로고
    • Performance modeling and optimizationof sparse matrix-vector multiplication on NVIDIA CUDAPlatform
    • S. Xu, W. Xue, and H. Lin, "Performance Modeling and Optimizationof Sparse Matrix-Vector Multiplication on NVIDIA CUDAPlatform," J. Supercomputing, vol. 63, pp. 710-721, 2013.
    • (2013) J. Supercomputing , vol.63 , pp. 710-721
    • Xu, S.1    Xue, W.2    Lin, H.3
  • 24
    • 70450231944 scopus 로고    scopus 로고
    • An analytical model for a gpu architecturewith memory-level and thread-level parallelismawareness
    • S. Hong and H. Kim, "An Analytical Model for a GPU Architecturewith Memory-Level and Thread-Level ParallelismAwareness," Proc. 36th ACM Ann. Int'l Symp. Computer Architecture(ISCA '09), pp. 152-163, 2009.
    • (2009) Proc. 36th ACM Ann. Int'l Symp. Computer Architecture(ISCA '09) , pp. 152-163
    • Hong, S.1    Kim, H.2
  • 26
    • 81355161778 scopus 로고    scopus 로고
    • The university of florida sparse matrixcollection
    • T.A. Davis and Y. Hu, "The University of Florida Sparse MatrixCollection," ACM Trans. Math. Software, vol. 38, no. 1, pp. 1:1-1:25,2011.
    • (2011) ACM Trans. Math. Software , vol.38 , Issue.1 , pp. 11-125
    • Davis, T.A.1    Hu, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.