메뉴 건너뛰기




Volumn 257, Issue , 2014, Pages 195-211

Architecting the finite element method pipeline for the GPU

Author keywords

Algebraic multigrid (AMG); Finite element method (FEM); Graphical processing units (GPUs)

Indexed keywords

ALGEBRAIC MULTIGRID METHODS; ALGEBRAIC MULTIGRIDS; FINE-GRAINED PARALLELISM; GRAPHICAL PROCESSING UNIT (GPUS); GRAPHICAL PROCESSING UNITS; MANY-CORE ARCHITECTURE; PARTIAL DIFFERENTIAL EQUATIONS (PDES); SCIENCE AND ENGINEERING;

EID: 84884657209     PISSN: 03770427     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cam.2013.09.001     Document Type: Article
Times cited : (65)

References (40)
  • 9
    • 0003424374 scopus 로고    scopus 로고
    • SIAM: Society for Industrial and Applied Mathematics
    • L.N. Trefethen, and D.B. III Numerical Linear Algebra 1997 SIAM: Society for Industrial and Applied Mathematics
    • (1997) Numerical Linear Algebra
    • Trefethen, L.N.1
  • 11
    • 84866417895 scopus 로고    scopus 로고
    • Exposing fine-grained parallelism in algebraic multigrid methods
    • N. Bell, S. Dalton, and L. Olson Exposing fine-grained parallelism in algebraic multigrid methods SIAM Journal on Scientific Computing 34 4 2012 C123 C152
    • (2012) SIAM Journal on Scientific Computing , vol.34 , Issue.4
    • Bell, N.1    Dalton, S.2    Olson, L.3
  • 12
    • 77951541240 scopus 로고    scopus 로고
    • A parallel algebraic multigrid solver on graphics processing units
    • W. Zhang, Z. Chen, C.C. Douglas, W. Tong, Lecture Notes in Computer Science Springer
    • G. Haase, M. Liebmann, C.C. Douglas, and G. Plank A parallel algebraic multigrid solver on graphics processing units W. Zhang, Z. Chen, C.C. Douglas, W. Tong, HPCA (China) Lecture Notes in Computer Science vol. 5938 2009 Springer 38 47
    • (2009) HPCA (China) , vol.5938 VOL. , pp. 38-47
    • Haase, G.1    Liebmann, M.2    Douglas, C.C.3    Plank, G.4
  • 13
    • 33845903366 scopus 로고    scopus 로고
    • A comparison of monodomain and bidomain reaction-diffusion models for action potential propagation in the human heart
    • DOI 10.1109/TBME.2006.880875, 5
    • M. Potse, B. Dube, J. Richer, A. Vinet, and R. Gulrajani A comparison of monodomain and bidomain reaction-diffusion models for action potential propagation in the human heart IEEE Transactions on Biomedical Engineering 53 12 2006 2425 2435 (Pubitemid 46019529)
    • (2006) IEEE Transactions on Biomedical Engineering , vol.53 , Issue.12 , pp. 2425-2435
    • Potse, M.1    Dube, B.2    Richer, J.3    Vinet, A.4    Gulrajani, R.M.5
  • 14
    • 0242533311 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
    • J. Bolz, I. Farmer, E. Grinspun, and P. Schröder Sparse matrix solvers on the GPU: conjugate gradients and multigrid ACM Transactions on Graphics 22 3 2003 917 924
    • (2003) ACM Transactions on Graphics , vol.22 , Issue.3 , pp. 917-924
    • Bolz, J.1    Farmer, I.2    Grinspun, E.3    Schröder, P.4
  • 17
    • 64449087473 scopus 로고    scopus 로고
    • Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA
    • D. Komatitsch, D. Michéa, and G. Erlebacher Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA Journal of Parallel and Distributed Computing 69 5 2009 451 460
    • (2009) Journal of Parallel and Distributed Computing , vol.69 , Issue.5 , pp. 451-460
    • Komatitsch, D.1    Michéa, D.2    Erlebacher, G.3
  • 18
    • 77952425955 scopus 로고    scopus 로고
    • From h to p efficiently: Implementing finite and spectral hp element methods to achieve optimal performance for low- and high-order discretisations
    • P.E.J. Vos, S.J. Sherwin, and R.M. Kirby From h to p efficiently: implementing finite and spectral hp element methods to achieve optimal performance for low- and high-order discretisations Journal of Computational Physics 229 2010 5161 5181
    • (2010) Journal of Computational Physics , vol.229 , pp. 5161-5181
    • Vos, P.E.J.1    Sherwin, S.J.2    Kirby, R.M.3
  • 19
    • 84856393116 scopus 로고    scopus 로고
    • Parallel realization of the element-by-element fem technique by cuda
    • I. Kiss, S. Gyimothy, Z. Badics, and J. Pavo Parallel realization of the element-by-element fem technique by cuda IEEE Transactions on Magnetics 48 2 2012 507 510
    • (2012) IEEE Transactions on Magnetics , vol.48 , Issue.2 , pp. 507-510
    • Kiss, I.1    Gyimothy, S.2    Badics, Z.3    Pavo, J.4
  • 20
    • 79952241422 scopus 로고    scopus 로고
    • From h to p efficiently: Strategy selection for operator evaluation on hexahedral and tetrahedral elements
    • C. Cantwell, S. Sherwin, R. Kirby, and P. Kelly From h to p efficiently: strategy selection for operator evaluation on hexahedral and tetrahedral elements Computers & Fluids 43 2011 23 28
    • (2011) Computers & Fluids , vol.43 , pp. 23-28
    • Cantwell, C.1    Sherwin, S.2    Kirby, R.3    Kelly, P.4
  • 25
    • 0036532956 scopus 로고    scopus 로고
    • BoomerAMG: A parallel algebraic multigrid solver and preconditioner
    • DOI 10.1016/S0168-9274(01)00115-5, PII S0168927401001155
    • V.E. Henson, and U.M. Yang BoomerAMG: a parallel algebraic multigrid solver and preconditioner Applied Numerical Mathematics: Transactions of IMACS 41 1 2002 155 177 (Pubitemid 34154710)
    • (2002) Applied Numerical Mathematics , vol.41 , Issue.1 , pp. 155-177
    • Henson, V.E.1    Yang, U.M.2
  • 26
    • 0030388574 scopus 로고    scopus 로고
    • Algebraic multigrid by smoothed aggregation for second and fourth order elliptic problems
    • P. Vanek, J. Mandel, and M. Brezina Algebraic multigrid by smoothed aggregation for second and fourth order elliptic problems Computing 56 3 1996 179 196 (Pubitemid 126633416)
    • (1996) Computing (Vienna/New York) , vol.56 , Issue.3 , pp. 179-196
    • Vanek, P.1    Mandel, J.2    Brezina, M.3
  • 27
    • 85015330781 scopus 로고    scopus 로고
    • Parallel multigrid solver for 3D unstructured finite element problems
    • ACM SIGARCH and IEEE Portland, OR
    • M. Adams, and J. Demmel Parallel multigrid solver for 3D unstructured finite element problems Proceedings of Supercomputing'99 (CD-ROM) 1999 ACM SIGARCH and IEEE Portland, OR
    • (1999) Proceedings of Supercomputing'99 (CD-ROM)
    • Adams, M.1    Demmel, J.2
  • 29
    • 2942526684 scopus 로고    scopus 로고
    • Parallel smoothed aggregation multigrid: Aggregation strategies on massively parallel machines
    • R.S. Tuminaro, C. Tong, Parallel smoothed aggregation multigrid: aggregation strategies on massively parallel machines, in: SuperComputing 2000 Proceedings, 2000.
    • (2000) SuperComputing 2000 Proceedings
    • Tuminaro, R.S.1    Tong, C.2
  • 31
    • 84884636724 scopus 로고    scopus 로고
    • Towards a complete fem-based simulation toolkit on GPUs: Unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses
    • M. Geveler, D. Ribbrock, D. Goddeke, P. Zajac, and S. Turek Towards a complete fem-based simulation toolkit on GPUs: unstructured grid finite element geometric multigrid solvers with strong smoothers based on sparse approximate inverses Computers & Fluids 80 0 2013 327 332
    • (2013) Computers & Fluids , vol.80 , Issue.0 , pp. 327-332
    • Geveler, M.1    Ribbrock, D.2    Goddeke, D.3    Zajac, P.4    Turek, S.5
  • 32
    • 78651340345 scopus 로고    scopus 로고
    • GPU acceleration of multilevel solvers for analysis of microwave components with finite element method
    • A. Dziekonski, A. Lamecki, and M. Mrozowski GPU acceleration of multilevel solvers for analysis of microwave components with finite element method Microwave and Wireless Components Letters, IEEE 21 1 2011 1 3
    • (2011) Microwave and Wireless Components Letters, IEEE , vol.21 , Issue.1 , pp. 1-3
    • Dziekonski, A.1    Lamecki, A.2    Mrozowski, M.3
  • 33
    • 79960133510 scopus 로고    scopus 로고
    • Tuning a hybrid GPU-CPU v-cycle multilevel preconditioner for solving large real and complex systems of fem equations
    • A. Dziekonski, A. Lamecki, and M. Mrozowski Tuning a hybrid GPU-CPU v-cycle multilevel preconditioner for solving large real and complex systems of fem equations Antennas and Wireless Propagation Letters, IEEE 10 2011 619 622
    • (2011) Antennas and Wireless Propagation Letters, IEEE , vol.10 , pp. 619-622
    • Dziekonski, A.1    Lamecki, A.2    Mrozowski, M.3
  • 34
    • 0001011699 scopus 로고
    • A fast and simple randomized parallel algorithm for the maximal independent set problem
    • N. Alon, L. Babai, and A. Itai A fast and simple randomized parallel algorithm for the maximal independent set problem Journal of Algorithms 7 1986 567 583
    • (1986) Journal of Algorithms , vol.7 , pp. 567-583
    • Alon, N.1    Babai, L.2    Itai, A.3
  • 37
    • 78249272764 scopus 로고    scopus 로고
    • Mixed-precision AMG as linear equation solver for definite systems
    • M. Emans, and A. van der Meer Mixed-precision AMG as linear equation solver for definite systems Procedia CS 1 1 2010 175 183
    • (2010) Procedia CS , vol.1 , Issue.1 , pp. 175-183
    • Emans, M.1    Van Der Meer, A.2
  • 39
    • 84885390293 scopus 로고    scopus 로고
    • LLNL, Hypre library. URL: https://computation.llnl.gov/casc/hypre/ software.html.
    • LLNL, Hypre Library


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.