SCOPUS 정보 검색 플랫폼

Journal of Computational and Applied Mathematics

Volumn 257, Issue , 2014, Pages 195-211

Architecting the finite element method pipeline for the GPU

(4) Fu, Zhisong b James Lewis, T b Kirby, Robert M b Whitaker, Ross T a

a University of Utah (United States)

b UNIVERSITY OF UTAH (United States)

Author keywords

Algebraic multigrid (AMG); Finite element method (FEM); Graphical processing units (GPUs)

Indexed keywords

ALGEBRAIC MULTIGRID METHODS; ALGEBRAIC MULTIGRIDS; FINE-GRAINED PARALLELISM; GRAPHICAL PROCESSING UNIT (GPUS); GRAPHICAL PROCESSING UNITS; MANY-CORE ARCHITECTURE; PARTIAL DIFFERENTIAL EQUATIONS (PDES); SCIENCE AND ENGINEERING;

ALGEBRA; ALGORITHMS; COMPUTER ARCHITECTURE; CONJUGATE GRADIENT METHOD; DISTRIBUTED COMPUTER SYSTEMS; FINITE ELEMENT METHOD; LINEAR SYSTEMS; PARALLEL ARCHITECTURES; PARTIAL DIFFERENTIAL EQUATIONS; PIPELINES; PROGRAM PROCESSORS;

COMPUTER GRAPHICS EQUIPMENT;

EID: 84884657209 PISSN: 03770427 EISSN: None Source Type: Journal
DOI: 10.1016/j.cam.2013.09.001 Document Type: Article

Times cited : (65)

References (40)

1
- 0003453209
- Prentice-Hall
- T.J.R. Hughes The Finite Element Method: Linear Static and Dynamic Finite Element Analysis 1987 Prentice-Hall
- (1987) The Finite Element Method: Linear Static and Dynamic Finite Element Analysis
- Hughes, T.J.R.¹

2
- 0342550060
- Spectral/hp element methods for CFD
- Oxford University Press
- G. Karniadakis, and S.J. Sherwin Spectral/hp element methods for CFD Numerical Mathematics and Scientific Computation 1999 Oxford University Press
- (1999) Numerical Mathematics and Scientific Computation
- Karniadakis, G.¹ Sherwin, S.J.²

3
- 84871378617
- NVIDIA, Nvidia cuda programming guide. URL: http://developer.nvidia.com/ nvidia-gpu-computing-documentation.
- NVIDIA, Nvidia Cuda Programming Guide

4
- 84871030508
- Finite element assembly strategies on multi-core and many-core architectures
- G. Markall, A. Slemmer, D. Ham, P. Kelly, C. Cantwell, and S. Sherwin Finite element assembly strategies on multi-core and many-core architectures International Journal for Numerical Methods in Fluids 71 2013 80 97
- (2013) International Journal for Numerical Methods in Fluids , vol.71 , pp. 80-97
- Markall, G.¹ Slemmer, A.² Ham, D.³ Kelly, P.⁴ Cantwell, C.⁵ Sherwin, S.⁶

5
- 84871025463
- Accuracy, memory, and speed strategies in GPU-based finite-element matrix-generation
- A. Dziekonski, P. Sypek, A. Lamecki, and M. Mrozowski Accuracy, memory, and speed strategies in GPU-based finite-element matrix-generation Antennas and Wireless Propagation Letters, IEEE 11 2012 1346 1349
- (2012) Antennas and Wireless Propagation Letters, IEEE , vol.11 , pp. 1346-1349
- Dziekonski, A.¹ Sypek, P.² Lamecki, A.³ Mrozowski, M.⁴

6
- 84865374391
- Finite element matrix generation on a GPU
- A. Dziekonski, P. Sypek, A. Lamecki, and M. Mrozowski Finite element matrix generation on a GPU Progress in Electromagnetics Research 128 2012 249 265
- (2012) Progress in Electromagnetics Research , vol.128 , pp. 249-265
- Dziekonski, A.¹ Sypek, P.² Lamecki, A.³ Mrozowski, M.⁴

7
- 84875625015
- Generation of large finite-element matrices on multiple graphics processors
- A. Dziekonski, P. Sypek, A. Lamecki, and M. Mrozowski Generation of large finite-element matrices on multiple graphics processors International Journal for Numerical Methods in Engineering 94 2 2013 204 220
- (2013) International Journal for Numerical Methods in Engineering , vol.94 , Issue.2 , pp. 204-220
- Dziekonski, A.¹ Sypek, P.² Lamecki, A.³ Mrozowski, M.⁴

8
- 78650691046
- Assembly of finite element methods on graphics processors
- C. Cecka, A.J. Lew, and E. Darve Assembly of finite element methods on graphics processors International Journal for Numerical Methods in Engineering 85 2011 640 669
- (2011) International Journal for Numerical Methods in Engineering , vol.85 , pp. 640-669
- Cecka, C.¹ Lew, A.J.² Darve, E.³

9
- 0003424374
- SIAM: Society for Industrial and Applied Mathematics
- L.N. Trefethen, and D.B. III Numerical Linear Algebra 1997 SIAM: Society for Industrial and Applied Mathematics
- (1997) Numerical Linear Algebra
- Trefethen, L.N.¹

10
- 0003424372
- SIAM: Society for Industrial and Applied Mathematics
- J.W. Demmel Applied Numerical Linear Algebra 1997 SIAM: Society for Industrial and Applied Mathematics
- (1997) Applied Numerical Linear Algebra
- Demmel, J.W.¹

11
- 84866417895
- Exposing fine-grained parallelism in algebraic multigrid methods
- N. Bell, S. Dalton, and L. Olson Exposing fine-grained parallelism in algebraic multigrid methods SIAM Journal on Scientific Computing 34 4 2012 C123 C152
- (2012) SIAM Journal on Scientific Computing , vol.34 , Issue.4
- Bell, N.¹ Dalton, S.² Olson, L.³

12
- 77951541240
- A parallel algebraic multigrid solver on graphics processing units
- W. Zhang, Z. Chen, C.C. Douglas, W. Tong, Lecture Notes in Computer Science Springer
- G. Haase, M. Liebmann, C.C. Douglas, and G. Plank A parallel algebraic multigrid solver on graphics processing units W. Zhang, Z. Chen, C.C. Douglas, W. Tong, HPCA (China) Lecture Notes in Computer Science vol. 5938 2009 Springer 38 47
- (2009) HPCA (China) , vol.5938 VOL. , pp. 38-47
- Haase, G.¹ Liebmann, M.² Douglas, C.C.³ Plank, G.⁴

13
- 33845903366
- A comparison of monodomain and bidomain reaction-diffusion models for action potential propagation in the human heart
- DOI 10.1109/TBME.2006.880875, 5
- M. Potse, B. Dube, J. Richer, A. Vinet, and R. Gulrajani A comparison of monodomain and bidomain reaction-diffusion models for action potential propagation in the human heart IEEE Transactions on Biomedical Engineering 53 12 2006 2425 2435 (Pubitemid 46019529)
- (2006) IEEE Transactions on Biomedical Engineering , vol.53 , Issue.12 , pp. 2425-2435
- Potse, M.¹ Dube, B.² Richer, J.³ Vinet, A.⁴ Gulrajani, R.M.⁵

14
- 0242533311
- Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
- J. Bolz, I. Farmer, E. Grinspun, and P. Schröder Sparse matrix solvers on the GPU: conjugate gradients and multigrid ACM Transactions on Graphics 22 3 2003 917 924
- (2003) ACM Transactions on Graphics , vol.22 , Issue.3 , pp. 917-924
- Bolz, J.¹ Farmer, I.² Grinspun, E.³ Schröder, P.⁴

15
- 84885386532
- J. Rodríguez-Navarro, A.S. Sánchez, Non structured meshes for cloth GPU simulation using FEM, 2006.
- (2006) Non Structured Meshes for Cloth GPU Simulation Using FEM
- Rodríguez-Navarro, J.¹ Sánchez, A.S.²

16
- 69949091119
- Nodal discontinuous Galerkin methods on graphics processors
- A. Klöckner, T. Warburton, J. Bridge, and J.S. Hesthaven Nodal discontinuous Galerkin methods on graphics processors Journal of Computational Physics 228 21 2009 7863 7882
- (2009) Journal of Computational Physics , vol.228 , Issue.21 , pp. 7863-7882
- Klöckner, A.¹ Warburton, T.² Bridge, J.³ Hesthaven, J.S.⁴

17
- 64449087473
- Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA
- D. Komatitsch, D. Michéa, and G. Erlebacher Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA Journal of Parallel and Distributed Computing 69 5 2009 451 460
- (2009) Journal of Parallel and Distributed Computing , vol.69 , Issue.5 , pp. 451-460
- Komatitsch, D.¹ Michéa, D.² Erlebacher, G.³

18
- 77952425955
- From h to p efficiently: Implementing finite and spectral hp element methods to achieve optimal performance for low- and high-order discretisations
- P.E.J. Vos, S.J. Sherwin, and R.M. Kirby From h to p efficiently: implementing finite and spectral hp element methods to achieve optimal performance for low- and high-order discretisations Journal of Computational Physics 229 2010 5161 5181
- (2010) Journal of Computational Physics , vol.229 , pp. 5161-5181
- Vos, P.E.J.¹ Sherwin, S.J.² Kirby, R.M.³

19
- 84856393116
- Parallel realization of the element-by-element fem technique by cuda
- I. Kiss, S. Gyimothy, Z. Badics, and J. Pavo Parallel realization of the element-by-element fem technique by cuda IEEE Transactions on Magnetics 48 2 2012 507 510
- (2012) IEEE Transactions on Magnetics , vol.48 , Issue.2 , pp. 507-510
- Kiss, I.¹ Gyimothy, S.² Badics, Z.³ Pavo, J.⁴

20
- 79952241422
- From h to p efficiently: Strategy selection for operator evaluation on hexahedral and tetrahedral elements
- C. Cantwell, S. Sherwin, R. Kirby, and P. Kelly From h to p efficiently: strategy selection for operator evaluation on hexahedral and tetrahedral elements Computers & Fluids 43 2011 23 28
- (2011) Computers & Fluids , vol.43 , pp. 23-28
- Cantwell, C.¹ Sherwin, S.² Kirby, R.³ Kelly, P.⁴

21
- 1842829625
- second ed. SIAM
- Y. Saad Iterative Methods for Sparse Linear Systems second ed. 2003 SIAM
- (2003) Iterative Methods for Sparse Linear Systems
- Saad, Y.¹

22
- 79957924849
- R. Li, and Y. Saad GPU-accelerated Preconditioned Iterative Linear Solvers, Tech. Rep., Technical Report, University of Minnesota 2010
- (2010) GPU-accelerated Preconditioned Iterative Linear Solvers, Tech. Rep., Technical Report, University of Minnesota
- Li, R.¹ Saad, Y.²

23
- 0004252812
- SIAM Publications
- S.F. McCormick Multigrid Methods 1987 SIAM Publications
- (1987) Multigrid Methods
- McCormick, S.F.¹

24
- 0004056964
- Revised ed., Classics in Applied Mathematics Society for Industrial and Applied Mathematics
- A. Brandt, and O. Livne Multigrid Techniques: 1984 Guide With Applications to Fluid Dynamics Revised ed., Classics in Applied Mathematics 2011 Society for Industrial and Applied Mathematics
- (2011) Multigrid Techniques: 1984 Guide with Applications to Fluid Dynamics
- Brandt, A.¹ Livne, O.²

25
- 0036532956
- BoomerAMG: A parallel algebraic multigrid solver and preconditioner
- DOI 10.1016/S0168-9274(01)00115-5, PII S0168927401001155
- V.E. Henson, and U.M. Yang BoomerAMG: a parallel algebraic multigrid solver and preconditioner Applied Numerical Mathematics: Transactions of IMACS 41 1 2002 155 177 (Pubitemid 34154710)
- (2002) Applied Numerical Mathematics , vol.41 , Issue.1 , pp. 155-177
- Henson, V.E.¹ Yang, U.M.²

26
- 0030388574
- Algebraic multigrid by smoothed aggregation for second and fourth order elliptic problems
- P. Vanek, J. Mandel, and M. Brezina Algebraic multigrid by smoothed aggregation for second and fourth order elliptic problems Computing 56 3 1996 179 196 (Pubitemid 126633416)
- (1996) Computing (Vienna/New York) , vol.56 , Issue.3 , pp. 179-196
- Vanek, P.¹ Mandel, J.² Brezina, M.³

27
- 85015330781
- Parallel multigrid solver for 3D unstructured finite element problems
- ACM SIGARCH and IEEE Portland, OR
- M. Adams, and J. Demmel Parallel multigrid solver for 3D unstructured finite element problems Proceedings of Supercomputing'99 (CD-ROM) 1999 ACM SIGARCH and IEEE Portland, OR
- (1999) Proceedings of Supercomputing'99 (CD-ROM)
- Adams, M.¹ Demmel, J.²

28
- 81555213057
- Multigrid smoothers for ultraparallel computing
- A.H. Baker, R.D. Falgout, T.V. Kolev, and U.M. Yang Multigrid smoothers for ultraparallel computing SIAM Journal on Scientific Computing 33 5 2011 2864 2887
- (2011) SIAM Journal on Scientific Computing , vol.33 , Issue.5 , pp. 2864-2887
- Baker, A.H.¹ Falgout, R.D.² Kolev, T.V.³ Yang, U.M.⁴

29
- 2942526684
- Parallel smoothed aggregation multigrid: Aggregation strategies on massively parallel machines
- R.S. Tuminaro, C. Tong, Parallel smoothed aggregation multigrid: aggregation strategies on massively parallel machines, in: SuperComputing 2000 Proceedings, 2000.
- (2000) SuperComputing 2000 Proceedings
- Tuminaro, R.S.¹ Tong, C.²

30
- 77958566946
- NVIDIA Corporation
- N. Bell, and M. Garland Efficient Sparse Matrix-Vector Multiplication on CUDA, NVIDIA Technical Report NVR-2008-004 2008 NVIDIA Corporation
- (2008) Efficient Sparse Matrix-Vector Multiplication on CUDA, NVIDIA Technical Report NVR-2008-004
- Bell, N.¹ Garland, M.²

31
- 84884636724
- Towards a complete fem-based simulation toolkit on GPUs: Unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses
- M. Geveler, D. Ribbrock, D. Goddeke, P. Zajac, and S. Turek Towards a complete fem-based simulation toolkit on GPUs: unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses Computers & Fluids 80 0 2013 327 332
- (2013) Computers & Fluids , vol.80 , Issue.0 , pp. 327-332
- Geveler, M.¹ Ribbrock, D.² Goddeke, D.³ Zajac, P.⁴ Turek, S.⁵

32
- 78651340345
- GPU acceleration of multilevel solvers for analysis of microwave components with finite element method
- A. Dziekonski, A. Lamecki, and M. Mrozowski GPU acceleration of multilevel solvers for analysis of microwave components with finite element method Microwave and Wireless Components Letters, IEEE 21 1 2011 1 3
- (2011) Microwave and Wireless Components Letters, IEEE , vol.21 , Issue.1 , pp. 1-3
- Dziekonski, A.¹ Lamecki, A.² Mrozowski, M.³

33
- 79960133510
- Tuning a hybrid GPU-CPU v-cycle multilevel preconditioner for solving large real and complex systems of fem equations
- A. Dziekonski, A. Lamecki, and M. Mrozowski Tuning a hybrid GPU-CPU v-cycle multilevel preconditioner for solving large real and complex systems of fem equations Antennas and Wireless Propagation Letters, IEEE 10 2011 619 622
- (2011) Antennas and Wireless Propagation Letters, IEEE , vol.10 , pp. 619-622
- Dziekonski, A.¹ Lamecki, A.² Mrozowski, M.³

34
- 0001011699
- A fast and simple randomized parallel algorithm for the maximal independent set problem
- N. Alon, L. Babai, and A. Itai A fast and simple randomized parallel algorithm for the maximal independent set problem Journal of Algorithms 7 1986 567 583
- (1986) Journal of Algorithms , vol.7 , pp. 567-583
- Alon, N.¹ Babai, L.² Itai, A.³

35
- 84870404942
- NVIDIA
- NVIDIA, Nvidias next generation cuda compute architecture: Fermi. URL: http://www.nvidia.com/content/PDF/fermi-white-papers/NVIDIA-Fermi-Compute- Architecture-Whitepaper.pdf.
- Nvidias Next Generation Cuda Compute Architecture: Fermi

36
- 84884666121
- Exploiting mixed precision floating point hardware in scientific computations
- L. Grandinetti, Advances in Parallel Computing IOS Press Amsterdam
- A. Buttari, J. Dongarra, J. Kurzak, J. Langou, J. Langou, P. Luszczek, and S. Tomov Exploiting mixed precision floating point hardware in scientific computations L. Grandinetti, High Performance Computing (HPC) and Grids in Action Advances in Parallel Computing vol. 16 2008 IOS Press Amsterdam 19 36
- (2008) High Performance Computing (HPC) and Grids in Action , vol.16 VOL. , pp. 19-36
- Buttari, A.¹ Dongarra, J.² Kurzak, J.³ Langou, J.⁴ Langou, J.⁵ Luszczek, P.⁶ Tomov, S.⁷

37
- 78249272764
- Mixed-precision AMG as linear equation solver for definite systems
- M. Emans, and A. van der Meer Mixed-precision AMG as linear equation solver for definite systems Procedia CS 1 1 2010 175 183
- (2010) Procedia CS , vol.1 , Issue.1 , pp. 175-183
- Emans, M.¹ Van Der Meer, A.²

38
- 84885369798
- NVIDIA, Cusp library. URL: http://developer.nvidia.com/cusp.
- NVIDIA, Cusp Library

39
- 84885390293
- LLNL, Hypre library. URL: https://computation.llnl.gov/casc/hypre/ software.html.
- LLNL, Hypre Library

40
- 77952611196
- Concurrent number cruncher - A GPU implementation of a general sparse linear solver
- L. Buatois, G. Caumon, and B. Levy Concurrent number cruncher - a GPU implementation of a general sparse linear solver International Journal of Parallel, Emergent and Distributed Systems 24 3 2009 205 223
- (2009) International Journal of Parallel, Emergent and Distributed Systems , vol.24 , Issue.3 , pp. 205-223
- Buatois, L.¹ Caumon, G.² Levy, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.