SCOPUS 정보 검색 플랫폼

Journal of Parallel and Distributed Computing

Volumn 69, Issue 5, 2009, Pages 451-460

Porting a high-order finite-element earthquake modeling application to NVIDIA graphics cards using CUDA

(3) Komatitsch, Dimitri a,b Michéa, David a Erlebacher, Gordon c

a UNIVERSITÉ DE PAU ET DES PAYS DE L'ADOUR (France)

b INSTITUT UNIVERSITAIRE DE FRANCE (France)

c FLORIDA STATE UNIVERSITY (United States)

Author keywords

CUDA; Finite elements; GPGPU; Spectral methods; Speedup

Indexed keywords

CUDA; FINITE ELEMENTS; GPGPU; SPECTRAL METHODS; SPEEDUP;

APPLICATIONS; PROGRAM PROCESSORS; SEISMIC WAVES; SPECTRUM ANALYSIS;

EARTHQUAKES;

EID: 64449087473 PISSN: 07437315 EISSN: None Source Type: Journal
DOI: 10.1016/j.jpdc.2009.01.006 Document Type: Article

Times cited : (149)

References (30)

1
- 64449085151
- Master's Thesis, ENSEIRB, Bordeaux, France
- R. Abdelkhalek, Évaluation des accélérateurs de calcul GPGPU pour la modélisation sismique, Master's Thesis, ENSEIRB, Bordeaux, France, 2007
- (2007) Évaluation des accélérateurs de calcul GPGPU pour la modélisation sismique
- Abdelkhalek, R.¹

2
- 41249087856
- General purpose molecular dynamics simulations fully implemented on graphics processing units
- Anderson J.A., Lorenz C.D., and Travesset A. General purpose molecular dynamics simulations fully implemented on graphics processing units. J. Comput. Phys. 227 10 (2008) 5342-5359
- (2008) J. Comput. Phys. , vol.227 , Issue.10 , pp. 5342-5359
- Anderson, J.A.¹ Lorenz, C.D.² Travesset, A.³

3
- 64449083379
- A mesh coloring method for efficient MIMD processing in finite element problems
- ICPP'82, August 24-27, 1982, Bellaire, Michigan, USA, IEEE Computer Society
- Berger P., Brouaye P., and Syre J.C. A mesh coloring method for efficient MIMD processing in finite element problems. Proceedings of the International Conference on Parallel Processing. ICPP'82, August 24-27, 1982, Bellaire, Michigan, USA (1982), IEEE Computer Society 41-46
- (1982) Proceedings of the International Conference on Parallel Processing , pp. 41-46
- Berger, P.¹ Brouaye, P.² Syre, J.C.³

4
- 38349000703
- Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware
- Proceedings of the Institute of Mechanical Engineers
- Brandvik T., and Pullan G. Acceleration of a two-dimensional Euler flow solver using commodity graphics hardware. Proceedings of the Institute of Mechanical Engineers. Part C: J. Mech. Eng., Part C: J. Mech. Eng. Sci. 221 12 (2007) 1745-1748
- (2007) Part C: J. Mech. Eng., Part C: J. Mech. Eng. Sci. , vol.221 , Issue.12 , pp. 1745-1748
- Brandvik, T.¹ Pullan, G.²

5
- 64349125096
- I. Buck, GeForce 8800 and NVIDIA CUDA: A new architecture for computing on the GPU, in: Proceedings of the Supercomputing'06 Workshop on General-Purpose GPU Computing: Practice and Experience, 2006. URL www.gpgpu.org/sc2006/workshop/presentations/Buck_NVIDIA_Cuda.pdf
- I. Buck, GeForce 8800 and NVIDIA CUDA: A new architecture for computing on the GPU, in: Proceedings of the Supercomputing'06 Workshop on "General-Purpose GPU Computing: Practice and Experience", 2006. URL www.gpgpu.org/sc2006/workshop/presentations/Buck_NVIDIA_Cuda.pdf

6
- 64449083791
- March, URL
- D. Dobb's, Dr. Dobb's Portal web site (March 2008). URL www.ddj.com/hpc-high-performance-computing/207200659
- (2008) Dobb's Portal web site
- Dobb's, D.¹ Dr²

7
- 68249112512
- A hybrid multi-core parallel programming environment
- HMPP:, Boston, MA, USA, URL
- R. Dolbeau, S. Bihan, F. Bodin, HMPP: A hybrid multi-core parallel programming environment, in: Proceedings of the Workshop on General Purpose Processing on Graphics Processing Units, GPGPU'2007, Boston, MA, USA, 2007. URL www.irisa.fr/caps/projects/Astex
- (2007) Proceedings of the Workshop on General Purpose Processing on Graphics Processing Units, GPGPU
- Dolbeau, R.¹ Bihan, S.² Bodin, F.³

8
- 0024606944
- A general approach to nonlinear finite-element computations on shared-memory multiprocessors
- Farhat C., and Crivelli L. A general approach to nonlinear finite-element computations on shared-memory multiprocessors. Comput. Methods Appl. Mech. Engrg. 72 2 (1989) 153-171
- (1989) Comput. Methods Appl. Mech. Engrg. , vol.72 , Issue.2 , pp. 153-171
- Farhat, C.¹ Crivelli, L.²

9
- 35748969304
- Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
- Göddeke D., Strzodka R., Mohd-Yusof J., McCormick P., Buijssen S.H.M., Grajewski M., and Turek S. Exploring weak scalability for FEM calculations on a GPU-enhanced cluster. Parallel Comput. 33 10-11 (2007) 685-699
- (2007) Parallel Comput. , vol.33 , Issue.10-11 , pp. 685-699
- Göddeke, D.¹ Strzodka, R.² Mohd-Yusof, J.³ McCormick, P.⁴ Buijssen, S.H.M.⁵ Grajewski, M.⁶ Turek, S.⁷

10
- 33947588604
- Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
- Göddeke D., Strzodka R., and Turek S. Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations. Internat. J. Parallel Emerg. Distrib. Syst. 22 4 (2007) 221-256
- (2007) Internat. J. Parallel Emerg. Distrib. Syst. , vol.22 , Issue.4 , pp. 221-256
- Göddeke, D.¹ Strzodka, R.² Turek, S.³

11
- 35948931417
- Cache-efficient numerical algorithms using graphics hardware
- Govindaraju N.K., and Manocha D. Cache-efficient numerical algorithms using graphics hardware. Parallel Comput. 33 (2007) 663-684
- (2007) Parallel Comput. , vol.33 , pp. 663-684
- Govindaraju, N.K.¹ Manocha, D.²

12
- 0023314898
- Large-scale vectorized implicit calculations in solid mechanics on a Cray X-MP/48 utilizing EBE preconditioned conjugate gradients
- Hughes T.J.R., Ferencz R.M., and Hallquist J.O. Large-scale vectorized implicit calculations in solid mechanics on a Cray X-MP/48 utilizing EBE preconditioned conjugate gradients. Comput. Methods Appl. Mech. Engrg. 61 2 (1987) 215-248
- (1987) Comput. Methods Appl. Mech. Engrg. , vol.61 , Issue.2 , pp. 215-248
- Hughes, T.J.R.¹ Ferencz, R.M.² Hallquist, J.O.³

13
- 84947292580
- Performance analysis of multilevel parallel applications on shared memory architectures
- Nice, France, URL
- G. Jost, H. Jin, J. Labarta, J. Giménez, J. Caubet, Performance analysis of multilevel parallel applications on shared memory architectures, in: Proceedings of the IPDPS'2003 International Parallel and Distributed Processing Symposium, Nice, France, 2003. URL www.cepba.upc.es/paraver
- (2003) Proceedings of the IPDPS'2003 International Parallel and Distributed Processing Symposium
- Jost, G.¹ Jin, H.² Labarta, J.³ Giménez, J.⁴ Caubet, J.⁵

14
- 77950377383
- T. Kim, Hardware-aware analysis and optimization of 'Stable Fluids', in: Proceedings of the ACM Symposium on Interactive 3D Graphics and Games, 2008
- T. Kim, Hardware-aware analysis and optimization of 'Stable Fluids', in: Proceedings of the ACM Symposium on Interactive 3D Graphics and Games, 2008

15
- 58349102183
- A simulation of seismic wave propagation at high resolution in the inner core of the Earth on 2166 processors of MareNostrum
- Komatitsch D., Labarta J., and Michéa D. A simulation of seismic wave propagation at high resolution in the inner core of the Earth on 2166 processors of MareNostrum. Lecture Notes in Computer Science vol. 5336 (2008) 364-377
- (2008) Lecture Notes in Computer Science , vol.5336 , pp. 364-377
- Komatitsch, D.¹ Labarta, J.² Michéa, D.³

16
- 0033400861
- Introduction to the spectral-element method for 3-D seismic wave propagation
- URL www.geodynamics.org/cig/software/packages/seismo
- Komatitsch D., and Tromp J. Introduction to the spectral-element method for 3-D seismic wave propagation. Geophys. J. Int. 139 3 (1999) 806-822. http://www.geodynamics.org/cig/software/packages/seismo URL www.geodynamics.org/cig/software/packages/seismo
- (1999) Geophys. J. Int. , vol.139 , Issue.3 , pp. 806-822
- Komatitsch, D.¹ Tromp, J.²

17
- 84877030797
- A 14.6 billion degrees of freedom, 5 teraflops, 2.5 terabyte earthquake simulation on the Earth Simulator
- D. Komatitsch, S. Tsuboi, C. Ji, J. Tromp, A 14.6 billion degrees of freedom, 5 teraflops, 2.5 terabyte earthquake simulation on the Earth Simulator, in: Proceedings of the ACM/IEEE Supercomputing SC'2003 Conference, 2003, pp. 4-11
- (2003) Proceedings of the ACM/IEEE Supercomputing SC'2003 Conference , pp. 4-11
- Komatitsch, D.¹ Tsuboi, S.² Ji, C.³ Tromp, J.⁴

18
- 78649804288
- Acceleration of time-domain finite element method (TD-FEM) using Graphics Processor Units (GPU)
- Guilin, China
- K. Liu, X.B. Wang, Y. Zhang, C. Liao, Acceleration of time-domain finite element method (TD-FEM) using Graphics Processor Units (GPU), in: Proceedings of the 7th International Symposium on Antennas, Propagation & EM Theory, ISAPE '06, Guilin, China, 2006
- (2006) Proceedings of the 7th International Symposium on Antennas, Propagation & EM Theory, ISAPE '06
- Liu, K.¹ Wang, X.B.² Zhang, Y.³ Liao, C.⁴

19
- 12144275095
- Spectral-element moment-tensor inversions for earthquakes in Southern California
- Liu Q., Polet J., Komatitsch D., and Tromp J. Spectral-element moment-tensor inversions for earthquakes in Southern California. Bull. Seismol. Soc. Amer. 94 5 (2004) 1748-1761
- (2004) Bull. Seismol. Soc. Amer. , vol.94 , Issue.5 , pp. 1748-1761
- Liu, Q.¹ Polet, J.² Komatitsch, D.³ Tromp, J.⁴

20
- 35948940866
- Scout: A data-parallel programming language for graphics processors
- McCormick P., Inman J., Ahrens J., Mohd-Yusof J., Roth G., and Cummins S. Scout: A data-parallel programming language for graphics processors. Parallel Comput. 33 (2007) 648-662
- (2007) Parallel Comput. , vol.33 , pp. 648-662
- McCormick, P.¹ Inman, J.² Ahrens, J.³ Mohd-Yusof, J.⁴ Roth, G.⁵ Cummins, S.⁶

21
- 50649094257
- GPULib: GPU computing in high-level languages
- Messmer P., Mullowney P.J., and Granger B.E. GPULib: GPU computing in high-level languages. Comput. Sci. Engrg. 10 5 (2008) 70-73
- (2008) Comput. Sci. Engrg. , vol.10 , Issue.5 , pp. 70-73
- Messmer, P.¹ Mullowney, P.J.² Granger, B.E.³

22
- 64449084366
- NVIDIA, Version 1.1, NVIDIA Corporation, Santa Clara, CA, USA, 143 pages November
- NVIDIA, CUDA (Compute Unified Device Architecture) Programming Guide Version 1.1, NVIDIA Corporation, Santa Clara, CA, USA, 143 pages (November 2007)
- (2007) CUDA (Compute Unified Device Architecture) Programming Guide

23
- 64449084184
- NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing
- Tech. Rep, NVIDIA, 2008. URL
- NVIDIA, NVIDIA GeForce GTX 200 GPU architectural overview, second-generation unified GPU architecture for visual computing, Tech. Rep., NVIDIA, 2008. URL www.nvidia.com/docs/IO/55506/GeForce_GTX_200_GPU_Technical_Brief.pdf

24
- 44849094749
- Fast N-body simulation with CUDA
- Addison-Wesley Professional (Chapter 31)
- Nyland L., Harris M., and Prins J. Fast N-body simulation with CUDA. GPU Gems 3 (2007), Addison-Wesley Professional 677-695 (Chapter 31)
- (2007) GPU Gems 3 , pp. 677-695
- Nyland, L.¹ Harris, M.² Prins, J.³

25
- 33947588048
- A survey of general-purpose computation on graphics hardware
- Owens J.D., Luebke D., Govindaraju N., Harris M., Krüger J., Lefohn A.E., and Purcell T.J. A survey of general-purpose computation on graphics hardware. Comput. Graph. Forum 26 1 (2007) 80-113
- (2007) Comput. Graph. Forum , vol.26 , Issue.1 , pp. 80-113
- Owens, J.D.¹ Luebke, D.² Govindaraju, N.³ Harris, M.⁴ Krüger, J.⁵ Lefohn, A.E.⁶ Purcell, T.J.⁷

26
- 77953967887
- Extracting threads using traces for system on a chip
- La Coruña, Spain
- E. Petit, F. Bodin, Extracting threads using traces for system on a chip, in: Proceedings of the Compilers for Parallel Computers, CPC'2006, La Coruña, Spain, 2006
- (2006) Proceedings of the Compilers for Parallel Computers, CPC
- Petit, E.¹ Bodin, F.²

27
- 43049153024
- High-speed nonlinear finite element analysis for surgical simulation using Graphics Processing Units
- Taylor Z.A., Cheng M., and Ourselin S. High-speed nonlinear finite element analysis for surgical simulation using Graphics Processing Units. IEEE Trans. Med. Imaging 27 5 (2008) 650-663
- (2008) IEEE Trans. Med. Imaging , vol.27 , Issue.5 , pp. 650-663
- Taylor, Z.A.¹ Cheng, M.² Ourselin, S.³

28
- 48349115111
- High-level programming of graphics hardware to increase performance of electromagnetics simulation
- M. Woolsey, W.E. Hutchcraft, R.K. Gordon, High-level programming of graphics hardware to increase performance of electromagnetics simulation, in: Proceedings of the 2007 IEEE International Symposium on Antennas and Propagation, 2007
- (2007) Proceedings of the 2007 IEEE International Symposium on Antennas and Propagation
- Woolsey, M.¹ Hutchcraft, W.E.² Gordon, R.K.³

29
- 23444434540
- A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting: Research articles
- Wu W., and Heng P.A. A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting: Research articles. Comput. Animat. Virtual Worlds Archive. 15 3-4 (2004) 219-227
- (2004) Comput. Animat. Virtual Worlds Archive. , vol.15 , Issue.3-4 , pp. 219-227
- Wu, W.¹ Heng, P.A.²

30
- 24944437464
- An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation
- Wu W., and Heng P.A. An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation. Vis. Comput. 21 8-10 (2005) 707-717
- (2005) Vis. Comput. , vol.21 , Issue.8-10 , pp. 707-717
- Wu, W.¹ Heng, P.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.