메뉴 건너뛰기




Volumn 69, Issue 5, 2009, Pages 451-460

Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA

Author keywords

CUDA; Finite elements; GPGPU; Spectral methods; Speedup

Indexed keywords

CUDA; FINITE ELEMENTS; GPGPU; SPECTRAL METHODS; SPEEDUP;

EID: 64449087473     PISSN: 07437315     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.jpdc.2009.01.006     Document Type: Article
Times cited : (149)

References (30)
  • 2
    • 41249087856 scopus 로고    scopus 로고
    • General purpose molecular dynamics simulations fully implemented on graphics processing units
    • Anderson J.A., Lorenz C.D., and Travesset A. General purpose molecular dynamics simulations fully implemented on graphics processing units. J. Comput. Phys. 227 10 (2008) 5342-5359
    • (2008) J. Comput. Phys. , vol.227 , Issue.10 , pp. 5342-5359
    • Anderson, J.A.1    Lorenz, C.D.2    Travesset, A.3
  • 3
    • 64449083379 scopus 로고
    • A mesh coloring method for efficient MIMD processing in finite element problems
    • ICPP'82, August 24-27, 1982, Bellaire, Michigan, USA, IEEE Computer Society
    • Berger P., Brouaye P., and Syre J.C. A mesh coloring method for efficient MIMD processing in finite element problems. Proceedings of the International Conference on Parallel Processing. ICPP'82, August 24-27, 1982, Bellaire, Michigan, USA (1982), IEEE Computer Society 41-46
    • (1982) Proceedings of the International Conference on Parallel Processing , pp. 41-46
    • Berger, P.1    Brouaye, P.2    Syre, J.C.3
  • 4
    • 38349000703 scopus 로고    scopus 로고
    • Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware
    • Proceedings of the Institute of Mechanical Engineers
    • Brandvik T., and Pullan G. Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware. Proceedings of the Institute of Mechanical Engineers. Part C: J. Mech. Eng., Part C: J. Mech. Eng. Sci. 221 12 (2007) 1745-1748
    • (2007) Part C: J. Mech. Eng., Part C: J. Mech. Eng. Sci. , vol.221 , Issue.12 , pp. 1745-1748
    • Brandvik, T.1    Pullan, G.2
  • 5
    • 64349125096 scopus 로고    scopus 로고
    • I. Buck, GeForce 8800 and NVIDIA CUDA: A new architecture for computing on the GPU, in: Proceedings of the Supercomputing'06 Workshop on General-Purpose GPU Computing: Practice and Experience, 2006. URL www.gpgpu.org/sc2006/workshop/presentations/Buck_NVIDIA_Cuda.pdf
    • I. Buck, GeForce 8800 and NVIDIA CUDA: A new architecture for computing on the GPU, in: Proceedings of the Supercomputing'06 Workshop on "General-Purpose GPU Computing: Practice and Experience", 2006. URL www.gpgpu.org/sc2006/workshop/presentations/Buck_NVIDIA_Cuda.pdf
  • 6
    • 64449083791 scopus 로고    scopus 로고
    • March, URL
    • D. Dobb's, Dr. Dobb's Portal web site (March 2008). URL www.ddj.com/hpc-high-performance-computing/207200659
    • (2008) Dobb's Portal web site
    • Dobb's, D.1    Dr2
  • 8
    • 0024606944 scopus 로고
    • A general approach to nonlinear finite-element computations on shared-memory multiprocessors
    • Farhat C., and Crivelli L. A general approach to nonlinear finite-element computations on shared-memory multiprocessors. Comput. Methods Appl. Mech. Engrg. 72 2 (1989) 153-171
    • (1989) Comput. Methods Appl. Mech. Engrg. , vol.72 , Issue.2 , pp. 153-171
    • Farhat, C.1    Crivelli, L.2
  • 10
    • 33947588604 scopus 로고    scopus 로고
    • Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
    • Göddeke D., Strzodka R., and Turek S. Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. Internat. J. Parallel Emerg. Distrib. Syst. 22 4 (2007) 221-256
    • (2007) Internat. J. Parallel Emerg. Distrib. Syst. , vol.22 , Issue.4 , pp. 221-256
    • Göddeke, D.1    Strzodka, R.2    Turek, S.3
  • 11
    • 35948931417 scopus 로고    scopus 로고
    • Cache-efficient numerical algorithms using graphics hardware
    • Govindaraju N.K., and Manocha D. Cache-efficient numerical algorithms using graphics hardware. Parallel Comput. 33 (2007) 663-684
    • (2007) Parallel Comput. , vol.33 , pp. 663-684
    • Govindaraju, N.K.1    Manocha, D.2
  • 12
    • 0023314898 scopus 로고
    • Large-scale vectorized implicit calculations in solid mechanics on a Cray X-MP/48 utilizing EBE preconditioned conjugate gradients
    • Hughes T.J.R., Ferencz R.M., and Hallquist J.O. Large-scale vectorized implicit calculations in solid mechanics on a Cray X-MP/48 utilizing EBE preconditioned conjugate gradients. Comput. Methods Appl. Mech. Engrg. 61 2 (1987) 215-248
    • (1987) Comput. Methods Appl. Mech. Engrg. , vol.61 , Issue.2 , pp. 215-248
    • Hughes, T.J.R.1    Ferencz, R.M.2    Hallquist, J.O.3
  • 14
    • 77950377383 scopus 로고    scopus 로고
    • T. Kim, Hardware-aware analysis and optimization of 'Stable Fluids', in: Proceedings of the ACM Symposium on Interactive 3D Graphics and Games, 2008
    • T. Kim, Hardware-aware analysis and optimization of 'Stable Fluids', in: Proceedings of the ACM Symposium on Interactive 3D Graphics and Games, 2008
  • 15
    • 58349102183 scopus 로고    scopus 로고
    • A simulation of seismic wave propagation at high resolution in the inner core of the Earth on 2166 processors of MareNostrum
    • Komatitsch D., Labarta J., and Michéa D. A simulation of seismic wave propagation at high resolution in the inner core of the Earth on 2166 processors of MareNostrum. Lecture Notes in Computer Science vol. 5336 (2008) 364-377
    • (2008) Lecture Notes in Computer Science , vol.5336 , pp. 364-377
    • Komatitsch, D.1    Labarta, J.2    Michéa, D.3
  • 16
    • 0033400861 scopus 로고    scopus 로고
    • Introduction to the spectral-element method for 3-D seismic wave propagation
    • URL www.geodynamics.org/cig/software/packages/seismo
    • Komatitsch D., and Tromp J. Introduction to the spectral-element method for 3-D seismic wave propagation. Geophys. J. Int. 139 3 (1999) 806-822. http://www.geodynamics.org/cig/software/packages/seismo URL www.geodynamics.org/cig/software/packages/seismo
    • (1999) Geophys. J. Int. , vol.139 , Issue.3 , pp. 806-822
    • Komatitsch, D.1    Tromp, J.2
  • 19
    • 12144275095 scopus 로고    scopus 로고
    • Spectral-element moment-tensor inversions for earthquakes in Southern California
    • Liu Q., Polet J., Komatitsch D., and Tromp J. Spectral-element moment-tensor inversions for earthquakes in Southern California. Bull. Seismol. Soc. Amer. 94 5 (2004) 1748-1761
    • (2004) Bull. Seismol. Soc. Amer. , vol.94 , Issue.5 , pp. 1748-1761
    • Liu, Q.1    Polet, J.2    Komatitsch, D.3    Tromp, J.4
  • 22
    • 64449084366 scopus 로고    scopus 로고
    • NVIDIA, Version 1.1, NVIDIA Corporation, Santa Clara, CA, USA, 143 pages November
    • NVIDIA, CUDA (Compute Unified Device Architecture) Programming Guide Version 1.1, NVIDIA Corporation, Santa Clara, CA, USA, 143 pages (November 2007)
    • (2007) CUDA (Compute Unified Device Architecture) Programming Guide
  • 23
    • 64449084184 scopus 로고    scopus 로고
    • NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing
    • Tech. Rep, NVIDIA, 2008. URL
    • NVIDIA, NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing, Tech. Rep., NVIDIA, 2008. URL www.nvidia.com/docs/IO/55506/GeForce_GTX_200_GPU_Technical_Brief.pdf
  • 24
    • 44849094749 scopus 로고    scopus 로고
    • Fast N-body simulation with CUDA
    • Addison-Wesley Professional (Chapter 31)
    • Nyland L., Harris M., and Prins J. Fast N-body simulation with CUDA. GPU Gems 3 (2007), Addison-Wesley Professional 677-695 (Chapter 31)
    • (2007) GPU Gems 3 , pp. 677-695
    • Nyland, L.1    Harris, M.2    Prins, J.3
  • 27
    • 43049153024 scopus 로고    scopus 로고
    • High-speed nonlinear finite element analysis for surgical simulation using Graphics Processing Units
    • Taylor Z.A., Cheng M., and Ourselin S. High-speed nonlinear finite element analysis for surgical simulation using Graphics Processing Units. IEEE Trans. Med. Imaging 27 5 (2008) 650-663
    • (2008) IEEE Trans. Med. Imaging , vol.27 , Issue.5 , pp. 650-663
    • Taylor, Z.A.1    Cheng, M.2    Ourselin, S.3
  • 29
    • 23444434540 scopus 로고    scopus 로고
    • A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting: Research articles
    • Wu W., and Heng P.A. A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting: Research articles. Comput. Animat. Virtual Worlds Archive. 15 3-4 (2004) 219-227
    • (2004) Comput. Animat. Virtual Worlds Archive. , vol.15 , Issue.3-4 , pp. 219-227
    • Wu, W.1    Heng, P.A.2
  • 30
    • 24944437464 scopus 로고    scopus 로고
    • An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation
    • Wu W., and Heng P.A. An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation. Vis. Comput. 21 8-10 (2005) 707-717
    • (2005) Vis. Comput. , vol.21 , Issue.8-10 , pp. 707-717
    • Wu, W.1    Heng, P.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.