SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn , Issue , 2012, Pages 47-56

High-performance sparse matrix-vector multiplication on GPUs for structured grid computations

(3) Godwin, Jeswin a Holewinski, Justin a Sadayappan, P a

a The Ohio State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BLOCK STRUCTURES; COMPRESSED SPARSE ROW; CRITICAL STEPS; DIAGONAL STRUCTURE; DISCRETIZATIONS; GRID NODE; GRID POINTS; HIGHER-DEGREE; ITERATIVE SOLUTIONS; REGULAR STRUCTURE; SPARSE LINEAR SYSTEMS; SPARSE MATRICES; SPARSE MATRIX-VECTOR MULTIPLICATION; STORAGE FORMATS; STRUCTURED GRID; UNIFORM GRIDS;

LINEAR SYSTEMS; MECHANICS; PARTIAL DIFFERENTIAL EQUATIONS; PROGRAM PROCESSORS;

MATRIX ALGEBRA;

EID: 84858763464 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2159430.2159436 Document Type: Conference Paper

Times cited : (28)

References (29)

1
- 0343228755
- Elements of computational fluid dynamics on block structured grids using implicit solvers
- BADCOCK, K., RICHARDS, B., AND WOODGATE, M. Elements of computational fluid dynamics on block structured grids using implicit solvers. Progress in Aerospace Sciences 36, 5-6 (2000), 351-392.
- (2000) Progress in Aerospace Sciences , vol.36 , Issue.5-6 , pp. 351-392
- Badcock, K.¹ Richards, B.² Woodgate, M.³

2
- 0003660984
- Tech. Rep. ANL-95/11 - Revision 3.2, Argonne National Laboratory
- BALAY, S., BROWN, J.,, BUSCHELMAN, K., EIJKHOUT, V., GROPP, W. D., KAUSHIK, D., KNEPLEY, M. G., MCINNES, L. C., SMITH, B. F., AND ZHANG, H. PETSc users manual. Tech. Rep. ANL-95/11 - Revision 3.2, Argonne National Laboratory, 2011.
- (2011) PETSc Users Manual
- Balay, S.¹ Brown, J.² Buschelman, K.³ Eijkhout, V.⁴ Gropp, W.D.⁵ Kaushik, D.⁶ Knepley, M.G.⁷ Mcinnes, L.C.⁸ Smith, B.F.⁹ Zhang, H.¹⁰

3
- 74049163483
- Tech. Rep. RC24704 (W0812-047), IBM T. J. Watson Research Center, April
- BASKARAN, M. M., AND BORDAWEKAR, R. Optimizing sparse matrix-vector multiplication on GPUs. Tech. Rep. RC24704 (W0812-047), IBM T. J. Watson Research Center, April 2009.
- (2009) Optimizing Sparse Matrix-vector Multiplication on GPUs
- Baskaran, M.M.¹ Bordawekar, R.²

4
- 70350368872
- Tech. Rep. NVR-2008-004, NVIDIA Corporation, December
- BELL, N., AND GARLAND, M. Efficient sparse matrix-vector multiplication on CUDA. Tech. Rep. NVR-2008-004, NVIDIA Corporation, December 2008.
- (2008) Efficient Sparse Matrix-vector Multiplication on CUDA
- Bell, N.¹ Garland, M.²

5
- 74049143158
- Implementing sparse matrix-vector multiplication on throughput-oriented processors
- ACM
- BELL, N., AND GARLAND, M. Implementing sparse matrix-vector multiplication on throughput-oriented processors. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (New York, NY, USA, 2009), SC '09, ACM, pp. 18:1-18:11.
- Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis (New York, NY, USA, 2009), SC '09
- Bell, N.¹ Garland, M.²

6
- 80051888188
- Version 0.1.0
- BELL, N., AND GARLAND, M. Cusp: Generic parallel algorithms for sparse matrix and graph computations, 2010. Version 0.1.0.
- (2010) Cusp: Generic Parallel Algorithms for Sparse Matrix and Graph Computations
- Bell, N.¹ Garland, M.²

7
- 77953998137
- Sparse matrix solvers on the gpu: Conjugate gradients and multigrid
- ACM
- BOLZ, J., FARMER, I., GRINSPUN, E., AND SCHRÖODER, P. Sparse matrix solvers on the gpu: conjugate gradients and multigrid. In ACM SIGGRAPH 2003 Papers (New York, NY, USA, 2003), SIGGRAPH '03, ACM, pp. 917-924.
- ACM SIGGRAPH 2003 Papers (New York, NY, USA, 2003), SIGGRAPH '03 , pp. 917-924
- Bolz, J.¹ Farmer, I.² Grinspun, E.³ Schröoder, P.⁴

8
- 77749340082
- Model-driven autotuning of sparse matrix-vector multiply on gpus
- ACM
- CHOI, J. W., SINGH, A., AND VUDUC, R. W. Model-driven autotuning of sparse matrix-vector multiply on gpus. In Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (New York, NY, USA, 2010), PPoPP '10, ACM, pp. 115-126.
- Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (New York, NY, USA, 2010), PPoPP '10 , pp. 115-126
- Choi, J.W.¹ Singh, A.² Vuduc, R.W.³

9
- 25144499116
- Vectorized sparse matrix multiply for compressed row storage format
- Computational Science - ICCS 2005, V. Sunderam, G. van Albada, P. Sloot, and J. Dongarra, Eds., Springer Berlin / Heidelberg, 10.1007/11428831-13
- D'AZEVEDO, E., FAHEY, M., AND MILLS, R. Vectorized sparse matrix multiply for compressed row storage format. In Computational Science - ICCS 2005, V. Sunderam, G. van Albada, P. Sloot, and J. Dongarra, Eds., vol. 3514 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2005, pp. 785-789. 10.1007/11428831-13.
- (2005) Lecture Notes in Computer Science , vol.3514 , pp. 785-789
- D'Azevedo, E.¹ Fahey, M.² Mills, R.³

10
- 0142215175
- An alternative compressed storage format for sparse matrices
- Computer and Information Sciences - ISCIS 2003, A. Yazici and C. Sener, Eds., Springer Berlin / Heidelberg, 10.1007/978-3-540-39737-3-25
- EKAMBARAM, A., AND MONTAGNE, E. An alternative compressed storage format for sparse matrices. In Computer and Information Sciences - ISCIS 2003, A. Yazici and C. Sener, Eds., vol. 2869 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2003, pp. 196-203. 10.1007/978-3-540-39737-3-25.
- (2003) Lecture Notes in Computer Science , vol.2869 , pp. 196-203
- Ekambaram, A.¹ Montagne, E.²

11
- 51549093017
- Sparse matrix computations on manycore gpu's
- ACM
- GARLAND, M. Sparse matrix computations on manycore gpu's. In Proceedings of the 45th annual Design Automation Conference (New York, NY, USA, 2008), DAC '08, ACM, pp. 2-6.
- Proceedings of the 45th Annual Design Automation Conference (New York, NY, USA, 2008), DAC '08 , pp. 2-6
- Garland, M.¹

12
- 84858784447
- Efficient finite element geometric multigrid solvers for unstructured grids on GPUs
- GEVELER, M., RIBBROCK, D., GÖDDEKE, D., PETER, Z., AND STEFAN, T. Efficient finite element geometric multigrid solvers for unstructured grids on GPUs. PARENG (April 2011).
- PARENG (April 2011)
- Geveler, M.¹ Ribbrock, D.² Göddeke, D.³ Peter, Z.⁴ Stefan, T.⁵

13
- 84858765971
- Towards a complete FEM-based simulation toolkit on GPUs: Geometric multigrid solvers
- GEVELER, M., RIBBROCK, D., GÖDDEKE, D., PETER, Z., AND STEFAN, T. Towards a complete FEM-based simulation toolkit on GPUs: Geometric multigrid solvers. ParCFD (May 2011).
- ParCFD (May 2011)
- Geveler, M.¹ Ribbrock, D.² Göddeke, D.³ Peter, Z.⁴ Stefan, T.⁵

14
- 35248817438
- The Cactus framework and toolkit: Design and applications
- Vector and Parallel Processing - VECPAR'2002, 5th International Conference, Berlin, Springer
- GOODALE, T., ALLEN, G., LANFERMANN, G., MASSÓ, J., RADKE, T., SEIDEL, E., AND SHALF, J.The Cactus framework and toolkit: Design and applications. In Vector and Parallel Processing - VECPAR'2002, 5th International Conference, Lecture Notes in Computer Science (Berlin, 2003), Springer.
- (2003) Lecture Notes in Computer Science
- Goodale, T.¹ Allen, G.² Lanfermann, G.³ Massó, J.⁴ Radke, T.⁵ Seidel, E.⁶ Shalf, J.⁷

15
- 73349098372
- August
- GRIMES, R., KINCAID, D., AND YOUNG, D. ITPACK 2.0 user's guide, August 1979.
- (1979) ITPACK 2.0 User's Guide
- Grimes, R.¹ Kincaid, D.² Young, D.³

16
- 84949647432
- Optimizing sparse matrix computations for register reuse in sparsity
- Computational Science - ICCS 2001, V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, and C. Tan, Eds., Springer Berlin / Heidelberg, 10.1007/3-540-45545-0-22
- IM, E.-J., AND YELICK, K. Optimizing sparse matrix computations for register reuse in sparsity. In Computational Science - ICCS 2001, V. Alexandrov, J. Dongarra, B. Juliano, R. Renner, and C. Tan, Eds., vol. 2073 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2001, pp. 127-136. 10.1007/3-540-45545-0-22.
- (2001) Lecture Notes in Computer Science , vol.2073 , pp. 127-136
- Im, E.-J.¹ Yelick, K.²

17
- 1542501019
- Sparsity: Optimization framework for sparse matrix kernels
- IM, E.-J., YELICK, K., AND VUDUC, R. Sparsity: Optimization framework for sparse matrix kernels. International Journal of High Performance Computing Applications 18, 1 (2004), 135-158.
- (2004) International Journal of High Performance Computing Applications , vol.18 , Issue.1 , pp. 135-158
- Im, E.-J.¹ Yelick, K.² Vuduc, R.³

18
- 70349100958
- KHRONOS OPENCL WORKING GROUP version 1.2
- KHRONOS OPENCL WORKING GROUP. The OpenCL specification - version 1.2.
- The OpenCL Specification

19
- 77954024744
- Linear algebra operators for gpu implementation of numerical algorithms
- KRÜGER, J., AND WESTERMANN, R. Linear algebra operators for gpu implementation of numerical algorithms. In ACM SIGGRAPH 2005 Courses (New York, NY, USA, 2005), SIGGRAPH '05, ACM.
- ACM SIGGRAPH 2005 Courses (New York, NY, USA, 2005), SIGGRAPH '05, ACM
- Krüger, J.¹ Westermann, R.²

20
- 84858784448
- LOS ALAMOS NATIONAL LABORATORY. PFLOTRAN. http://ees.lanl.gov/source/ orgs/ees/pflotran/index.shtml.
- Pflotran

21
- 77949577730
- Automatically tuning sparse matrix-vector multiplication for gpu architectures
- High Performance Embedded Architectures and Compilers, Y. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, Eds., Springer Berlin / Heidelberg 10.1007/978-3-642-11515-8-10
- MONAKOV, A., LOKHMOTOV, A., AND AVETISYAN, A. Automatically tuning sparse matrix-vector multiplication for gpu architectures. In High Performance Embedded Architectures and Compilers, Y. Patt, P. Foglia, E. Duesterwald, P. Faraboschi, and X. Martorell, Eds., vol. 5952 of Lecture Notes in Computer Science. Springer Berlin / Heidelberg, 2010, pp. 111-125. 10.1007/978-3-642- 11515-8-10.
- (2010) Lecture Notes in Computer Science , vol.5952 , pp. 111-125
- Monakov, A.¹ Lokhmotov, A.² Avetisyan, A.³

22
- 0031999338
- Time-domain (fe/fdtd) technique for solving complex electromagnetic problems
- IEEE feb
- MONORCHIO, A., AND MITTRA, R. Time-domain (fe/fdtd) technique for solving complex electromagnetic problems. Microwave and Guided Wave Letters, IEEE 8, 2 (feb 1998), 93-95.
- (1998) Microwave and Guided Wave Letters , vol.8 , Issue.2 , pp. 93-95
- Monorchio, A.¹ Mittra, R.²

23
- 1542425156
- An optimal storage format for sparse matrices
- MONTAGNE, E., AND EKAMBARAM, A. An optimal storage format for sparse matrices. Information Processing Letters 90, 2 (2004), 87-92.
- (2004) Information Processing Letters , vol.90 , Issue.2 , pp. 87-92
- Montagne, E.¹ Ekambaram, A.²

24
- 2442630376
- Sensitivity analysis with the fdtd method on structured grids
- april
- NIKOLOVA, N., TAM, H., AND BAKR, M. Sensitivity analysis with the fdtd method on structured grids. Microwave Theory and Techniques, IEEE Transactions on 52, 4 (april 2004), 1207-1216.
- (2004) Microwave Theory and Techniques, IEEE Transactions on , vol.52 , Issue.4 , pp. 1207-1216
- Nikolova, N.¹ Tam, H.² Bakr, M.³

25
- 82955212653
- NVIDIA CORPORATION. version 4.0
- NVIDIA CORPORATION. CUDA C programming guide - version 4.0.
- CUDA C Programming Guide

26
- 79958244563
- NVIDIA CORPORATION
- NVIDIA CORPORATION. OpenCL programming guide for the CUDA architecture.
- OpenCL Programming Guide for the CUDA Architecture

27
- 0003550735
- version 2
- SAAD, Y. SPARSKIT: A basic toolkit for sparse matrix computations - version 2.
- SPARSKIT: A Basic Toolkit for Sparse Matrix Computations
- Saad, Y.¹

28
- 1842829625
- Society for Industrial Mathematics
- SAAD, Y. Iterative Methods for Sparse Linear Systems. Society for Industrial Mathematics, 2003.
- (2003) Iterative Methods for Sparse Linear Systems
- Saad, Y.¹

29
- 79955614550
- A new approach for sparse matrix vector product on NVIDIA GPUs
- VÁZQUEZ, F., FERNÁNDEZ, J. J., AND GARZÓN, E. M. A new approach for sparse matrix vector product on NVIDIA GPUs. Concurrency and Computation: Practice and Experience 23, 8 (2011), 815-826.
- (2011) Concurrency and Computation: Practice and Experience , vol.23 , Issue.8 , pp. 815-826
- Vázquez, F.¹ Fernández, J.J.² Garzón, E.M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.