메뉴 건너뛰기




Volumn , Issue , 2012, Pages 47-56

High-performance sparse matrix-vector multiplication on GPUs for structured grid computations

Author keywords

[No Author keywords available]

Indexed keywords

BLOCK STRUCTURES; COMPRESSED SPARSE ROW; CRITICAL STEPS; DIAGONAL STRUCTURE; DISCRETIZATIONS; GRID NODE; GRID POINTS; HIGHER-DEGREE; ITERATIVE SOLUTIONS; REGULAR STRUCTURE; SPARSE LINEAR SYSTEMS; SPARSE MATRICES; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE FORMATS; STRUCTURED GRID; UNIFORM GRIDS;

EID: 84858763464     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2159430.2159436     Document Type: Conference Paper
Times cited : (28)

References (29)
  • 1
    • 0343228755 scopus 로고    scopus 로고
    • Elements of computational fluid dynamics on block structured grids using implicit solvers
    • BADCOCK, K., RICHARDS, B., AND WOODGATE, M. Elements of computational fluid dynamics on block structured grids using implicit solvers. Progress in Aerospace Sciences 36, 5-6 (2000), 351-392.
    • (2000) Progress in Aerospace Sciences , vol.36 , Issue.5-6 , pp. 351-392
    • Badcock, K.1    Richards, B.2    Woodgate, M.3
  • 9
    • 25144499116 scopus 로고    scopus 로고
    • Vectorized sparse matrix multiply for compressed row storage format
    • Computational Science - ICCS 2005, V. Sunderam, G. van Albada, P. Sloot, and J. Dongarra, Eds., Springer Berlin / Heidelberg, 10.1007/11428831-13
    • D'AZEVEDO, E., FAHEY, M., AND MILLS, R. Vectorized sparse matrix multiply for compressed row storage format. In Computational Science - ICCS 2005, V. Sunderam, G. van Albada, P. Sloot, and J. Dongarra, Eds., vol. 3514 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2005, pp. 785-789. 10.1007/11428831-13.
    • (2005) Lecture Notes in Computer Science , vol.3514 , pp. 785-789
    • D'Azevedo, E.1    Fahey, M.2    Mills, R.3
  • 10
    • 0142215175 scopus 로고    scopus 로고
    • An alternative compressed storage format for sparse matrices
    • Computer and Information Sciences - ISCIS 2003, A. Yazici and C. Sener, Eds., Springer Berlin / Heidelberg, 10.1007/978-3-540-39737-3-25
    • EKAMBARAM, A., AND MONTAGNE, E. An alternative compressed storage format for sparse matrices. In Computer and Information Sciences - ISCIS 2003, A. Yazici and C. Sener, Eds., vol. 2869 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2003, pp. 196-203. 10.1007/978-3-540-39737-3-25.
    • (2003) Lecture Notes in Computer Science , vol.2869 , pp. 196-203
    • Ekambaram, A.1    Montagne, E.2
  • 14
    • 35248817438 scopus 로고    scopus 로고
    • The Cactus framework and toolkit: Design and applications
    • Vector and Parallel Processing - VECPAR'2002, 5th International Conference, Berlin, Springer
    • GOODALE, T., ALLEN, G., LANFERMANN, G., MASSÓ, J., RADKE, T., SEIDEL, E., AND SHALF, J.The Cactus framework and toolkit: Design and applications. In Vector and Parallel Processing - VECPAR'2002, 5th International Conference, Lecture Notes in Computer Science (Berlin, 2003), Springer.
    • (2003) Lecture Notes in Computer Science
    • Goodale, T.1    Allen, G.2    Lanfermann, G.3    Massó, J.4    Radke, T.5    Seidel, E.6    Shalf, J.7
  • 16
    • 84949647432 scopus 로고    scopus 로고
    • Optimizing sparse matrix computations for register reuse in sparsity
    • Computational Science - ICCS 2001, V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, and C. Tan, Eds., Springer Berlin / Heidelberg, 10.1007/3-540-45545-0-22
    • IM, E.-J., AND YELICK, K. Optimizing sparse matrix computations for register reuse in sparsity. In Computational Science - ICCS 2001, V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, and C. Tan, Eds., vol. 2073 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2001, pp. 127-136. 10.1007/3-540-45545-0-22.
    • (2001) Lecture Notes in Computer Science , vol.2073 , pp. 127-136
    • Im, E.-J.1    Yelick, K.2
  • 18
    • 70349100958 scopus 로고    scopus 로고
    • KHRONOS OPENCL WORKING GROUP version 1.2
    • KHRONOS OPENCL WORKING GROUP. The OpenCL specification - version 1.2.
    • The OpenCL Specification
  • 20
    • 84858784448 scopus 로고    scopus 로고
    • LOS ALAMOS NATIONAL LABORATORY. PFLOTRAN. http://ees.lanl.gov/source/ orgs/ees/pflotran/index.shtml.
    • Pflotran
  • 21
    • 77949577730 scopus 로고    scopus 로고
    • Automatically tuning sparse matrix-vector multiplication for gpu architectures
    • High Performance Embedded Architectures and Compilers, Y. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, Eds., Springer Berlin / Heidelberg 10.1007/978-3-642-11515-8-10
    • MONAKOV, A., LOKHMOTOV, A., AND AVETISYAN, A. Automatically tuning sparse matrix-vector multiplication for gpu architectures. In High Performance Embedded Architectures and Compilers, Y. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, Eds., vol. 5952 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2010, pp. 111-125. 10.1007/978-3-642- 11515-8-10.
    • (2010) Lecture Notes in Computer Science , vol.5952 , pp. 111-125
    • Monakov, A.1    Lokhmotov, A.2    Avetisyan, A.3
  • 22
    • 0031999338 scopus 로고    scopus 로고
    • Time-domain (fe/fdtd) technique for solving complex electromagnetic problems
    • IEEE feb
    • MONORCHIO, A., AND MITTRA, R. Time-domain (fe/fdtd) technique for solving complex electromagnetic problems. Microwave and Guided Wave Letters, IEEE 8, 2 (feb 1998), 93-95.
    • (1998) Microwave and Guided Wave Letters , vol.8 , Issue.2 , pp. 93-95
    • Monorchio, A.1    Mittra, R.2
  • 23
    • 1542425156 scopus 로고    scopus 로고
    • An optimal storage format for sparse matrices
    • MONTAGNE, E., AND EKAMBARAM, A. An optimal storage format for sparse matrices. Information Processing Letters 90, 2 (2004), 87-92.
    • (2004) Information Processing Letters , vol.90 , Issue.2 , pp. 87-92
    • Montagne, E.1    Ekambaram, A.2
  • 25
    • 82955212653 scopus 로고    scopus 로고
    • NVIDIA CORPORATION. version 4.0
    • NVIDIA CORPORATION. CUDA C programming guide - version 4.0.
    • CUDA C Programming Guide


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.