메뉴 건너뛰기




Volumn 228, Issue 21, 2009, Pages 7863-7882

Nodal discontinuous Galerkin methods on graphics processors

Author keywords

Discontinuous Galerkin; GPU; High order; Many core; Maxwell's equations; Parallel computation

Indexed keywords

DIGITAL ARITHMETIC; GALERKIN METHODS; MAXWELL EQUATIONS; NUMERICAL METHODS; PARALLEL PROCESSING SYSTEMS; PROGRAM PROCESSORS;

EID: 69949091119     PISSN: 00219991     EISSN: 10902716     Source Type: Journal    
DOI: 10.1016/j.jcp.2009.06.041     Document Type: Article
Times cited : (270)

References (25)
  • 1
    • 69949104144 scopus 로고    scopus 로고
    • A Streaming Language Implementation of the Discontinuous Galerkin Method
    • Technical Report 20050184165, NASA Ames Research Center
    • Timothy Barth, Timothy Knight, A Streaming Language Implementation of the Discontinuous Galerkin Method, Technical Report 20050184165, NASA Ames Research Center, 2005.
    • (2005)
    • Barth, T.1    Knight, T.2
  • 3
    • 0004075585 scopus 로고
    • Fourth-order 2N-storage Runge-Kutta Schemes
    • Technical report, NASA Langley Research Center
    • M.H. Carpenter, C.A. Kennedy, Fourth-order 2N-storage Runge-Kutta Schemes, Technical report, NASA Langley Research Center, 1994.
    • (1994)
    • Carpenter, M.H.1    Kennedy, C.A.2
  • 4
    • 84966261380 scopus 로고
    • The Runge-Kutta local projection discontinuous Galerkin finite element method for conservation laws. IV: The multidimensional case
    • Cockburn B., Hou S., and Shu C.W. The Runge-Kutta local projection discontinuous Galerkin finite element method for conservation laws. IV: The multidimensional case. Math. Comput. 54 (1990) 545-581
    • (1990) Math. Comput. , vol.54 , pp. 545-581
    • Cockburn, B.1    Hou, S.2    Shu, C.W.3
  • 5
    • 14344259756 scopus 로고    scopus 로고
    • Letter Symbols to be used in Electrical Technology - Part 2: Telecommunications and Electronics
    • International Electrotechnical Commission, Technical report, International Electrotechnical Commission, Geneva, Switzerland, November
    • International Electrotechnical Commission, Letter Symbols to be used in Electrical Technology - Part 2: Telecommunications and Electronics, Technical report, International Electrotechnical Commission, Geneva, Switzerland, November 2000.
    • (2000)
  • 8
    • 85190809010 scopus 로고    scopus 로고
    • The OpenCL 1.0 Specification
    • Khronos OpenCL Working Group, December
    • Khronos OpenCL Working Group, The OpenCL 1.0 Specification. Khronos Group, December 2008.
    • (2008) Khronos Group
  • 9
    • 48149107858 scopus 로고    scopus 로고
    • Fast multipole methods on graphics processors
    • 10.1016/j.jcp.2008.05.023
    • Gumerov N.A., and Duraiswami R. Fast multipole methods on graphics processors. J. Comput. Phys. 227 September (2008) 8290-8313 10.1016/j.jcp.2008.05.023
    • (2008) J. Comput. Phys. , vol.227 , Issue.September , pp. 8290-8313
    • Gumerov, N.A.1    Duraiswami, R.2
  • 10
    • 0036740992 scopus 로고    scopus 로고
    • Nodal high-order methods on unstructured grids: I. Time-domain solution of Maxwell's equations
    • 10.1006/jcph.2002.7118
    • Hesthaven J.S., and Warburton T. Nodal high-order methods on unstructured grids: I. Time-domain solution of Maxwell's equations. J. Comput. Phys. 181 September (2002) 186-221 10.1006/jcph.2002.7118
    • (2002) J. Comput. Phys. , vol.181 , Issue.September , pp. 186-221
    • Hesthaven, J.S.1    Warburton, T.2
  • 14
    • 0032131147 scopus 로고    scopus 로고
    • A fast and high quality multilevel scheme for partitioning irregular graphs
    • Karypis G., and Kumar V. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20 (1999) 359-392
    • (1999) SIAM J. Sci. Comput. , vol.20 , pp. 359-392
    • Karypis, G.1    Kumar, V.2
  • 15
    • 4444302436 scopus 로고    scopus 로고
    • Acceleration of finite-difference time-domain (FDTD) using graphics processor units (GPU)
    • ISBN 0149-645X. doi:10.1109/MWSYM.2004.1339160
    • S.E. Krakiwsky, L.E. Turner, M.M. Okoniewski, Acceleration of finite-difference time-domain (FDTD) using graphics processor units (GPU), in: 2004 IEEE MTT-S International Microwave Symposium Digest, vol. 2, pp. 1033-1036, 2004. ISBN 0149-645X. doi:10.1109/MWSYM.2004.1339160.
    • (2004) 2004 IEEE MTT-S International Microwave Symposium Digest , vol.2 , pp. 1033-1036
    • Krakiwsky, S.E.1    Turner, L.E.2    Okoniewski, M.M.3
  • 16
    • 0347516395 scopus 로고    scopus 로고
    • Implementing Lattice Boltzmann computation on graphics hardware
    • Li W., Wei X., and Kaufman A. Implementing Lattice Boltzmann computation on graphics hardware. Visual Comput. 19 (2003) 444-456
    • (2003) Visual Comput. , vol.19 , pp. 444-456
    • Li, W.1    Wei, X.2    Kaufman, A.3
  • 17
    • 44849137198 scopus 로고    scopus 로고
    • Nvidia Tesla: a unified graphics and computing architecture
    • 0272-1732 10.1109/MM.2008.31
    • Lindholm E., Nickolls J., Oberman S., and Montrym J. Nvidia Tesla: a unified graphics and computing architecture. Micro. IEEE, 0272-1732 28 (2008) 39-55 10.1109/MM.2008.31
    • (2008) Micro. IEEE , vol.28 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 18
    • 85190814056 scopus 로고    scopus 로고
    • NVIDIA CUDA 2.0 Compute Unified Device Architecture Programming Guide, Nvidia Corporation, Santa Clara, USA
    • Nvidia Corporation, June
    • Nvidia Corporation, NVIDIA CUDA 2.0 Compute Unified Device Architecture Programming Guide, Nvidia Corporation, Santa Clara, USA, June 2008.
    • (2008)
  • 19
    • 0003630916 scopus 로고
    • Triangular Mesh Methods for the Neutron Transport Equation
    • Technical report, Los Alamos Scientific Laboratory, Los Alamos
    • W.H. Reed, T.R. Hill, Triangular Mesh Methods for the Neutron Transport Equation, Technical report, Los Alamos Scientific Laboratory, Los Alamos, 1973.
    • (1973)
    • Reed, W.H.1    Hill, T.R.2
  • 20
    • 84878932047 scopus 로고    scopus 로고
    • Meshing piecewise linear complexes by constrained delaunay tetrahedralizations
    • Springer
    • Si H., and Gaertner K. Meshing piecewise linear complexes by constrained delaunay tetrahedralizations. Proceedings of the 14th International Meshing Roundtable (2005), Springer 147-163
    • (2005) Proceedings of the 14th International Meshing Roundtable , pp. 147-163
    • Si, H.1    Gaertner, K.2
  • 21
    • 53749087821 scopus 로고    scopus 로고
    • An Efficient Implementation of CUDA Kernels on Multi-cores
    • MCUDA:, Technical report, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA, March
    • J. Stratton, S. Stone, W. Hwu, MCUDA: An Efficient Implementation of CUDA Kernels on Multi-cores. Technical report, University of Illinois at Urbana-Champaign, Urbana-Champaign, IL, USA, March 2008.
    • (2008)
    • Stratton, J.1    Stone, S.2    Hwu, W.3
  • 22
    • 85190771622 scopus 로고    scopus 로고
    • Various authors, Comparison of Nvidia graphics processing units - Wikipedia, The Free Encyclopedia. , 2008 (accessed 9.11.08).
    • Various authors, Comparison of Nvidia graphics processing units - Wikipedia, The Free Encyclopedia. , 2008 (accessed 9.11.08).
  • 23
    • 33846667672 scopus 로고    scopus 로고
    • An explicit construction of interpolation nodes on the simplex
    • Warburton T. An explicit construction of interpolation nodes on the simplex. J. Eng. Math. 56 (2006) 247-262
    • (2006) J. Eng. Math. , vol.56 , pp. 247-262
    • Warburton, T.1
  • 24
    • 55349094368 scopus 로고    scopus 로고
    • Taming the CFL number for discontinuous Galerkin methods on structured meshes
    • 10.1137/060672601
    • Warburton T., and Hagstrom T. Taming the CFL number for discontinuous Galerkin methods on structured meshes. SIAM J. Numer. Anal. 46 (2008) 3151-3180 10.1137/060672601
    • (2008) SIAM J. Numer. Anal. , vol.46 , pp. 3151-3180
    • Warburton, T.1    Hagstrom, T.2
  • 25
    • 0343462141 scopus 로고    scopus 로고
    • Automated empirical optimizations of software and the ATLAS project
    • Whaley R.C., Petitet A., and Dongarra J.J. Automated empirical optimizations of software and the ATLAS project. Parallel Comput. 27 (2001) 3-35
    • (2001) Parallel Comput. , vol.27 , pp. 3-35
    • Whaley, R.C.1    Petitet, A.2    Dongarra, J.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.