SCOPUS 정보 검색 플랫폼

Concurrency and Computation: Practice and Experience

Volumn 27, Issue 13, 2015, Pages 3205-3219

Parallel resolution of the 3D Helmholtz equation based on multi-graphics processing unit clusters

(5) Ortega, Gloria a Lobera, Julia b García, Inmaculada c Pilar Arroyo, M d Garzõn, Ester M a

a UNIVERSITY OF ALMERÍA (Spain)

b CENTRO UNIVERSITARIO DE LA DEFENSA (Spain)

c UNIVERSITY OF MÁLAGA (Spain)

d UNIVERSITY OF ZARAGOZA (Spain)

Author keywords

biconjugate gradient method; Helmholtz equation; MPI paradigm; multi GPU

Indexed keywords

COMPUTER GRAPHICS EQUIPMENT; GRADIENT METHODS; HELMHOLTZ EQUATION; MESSAGE PASSING; PROGRAM PROCESSORS; THREE DIMENSIONAL COMPUTER GRAPHICS;

BI-CONJUGATE GRADIENT METHODS; HIGH-PERFORMANCE COMPUTING RESOURCES; LARGE SPARSE MATRIX; MESSAGE PASSING INTERFACE; MPI PARADIGM; MULTI-GPU; SPARSE MATRIX VECTOR PRODUCTS; TECHNOLOGICAL APPLICATIONS;

GRAPHICS PROCESSING UNIT;

EID: 84939529782 PISSN: 15320626 EISSN: 15320634 Source Type: Journal
DOI: 10.1002/cpe.3212 Document Type: Article

Times cited : (6)

References (43)

1
- 0003542431
- Springer-Verlag: Berlin.
- Dautray R, Lions JL,. Mathematical Analysis and Numerical Methods for Science and Technology. Springer-Verlag: Berlin, 1990.
- (1990) Mathematical Analysis and Numerical Methods for Science and Technology
- Dautray, R.¹ Lions, J.L.²

2
- 0004079177
- (Second Edition). The MIT Press: Cambridge, Massachusetts.
- Junger MC, Feit D,. Sound, Structures, and Their Interaction (Second Edition). The MIT Press: Cambridge, Massachusetts, 1986.
- (1986) Sound, Structures, and Their Interaction
- Junger, M.C.¹ Feit, D.²

3
- 23244462384
- Fast fluid dynamics simulation on the GPU
- Courses, New York, NY, USA.
- Harris M,. Fast fluid dynamics simulation on the GPU, ACM SIGGRAPH 2005 Courses, New York, NY, USA, 2005; 637-665.
- (2005) ACM SIGGRAPH 2005 , pp. 637-665
- Harris, M.¹

4
- 0004250217
- (Second Edition). CRC Press: Boca Raton.
- Sadiku MNO,. Numerical Techniques in Electromagnetics (Second Edition). CRC Press: Boca Raton, 2001.
- (2001) Numerical Techniques in Electromagnetics
- Sadiku, M.N.O.¹

5
- 33646473348
- Elsevier Science: Amsterdam, London.
- Nail A, Gumerov RD,. Fast Multipole Methods for the Helmholtz Equation in Three Dimensions. Elsevier Science: Amsterdam, London, 2004.
- (2004) Fast Multipole Methods for the Helmholtz Equation in Three Dimensions
- Nail, A.¹ Gumerov, R.D.²

6
- 0031383340
- Solution of Helmholtz problems by knowledge-based FEM
- Ihlenburg F, Babuska I,. Solution of Helmholtz problems by knowledge-based FEM. CAMES 1997; 4: 397-415.
- (1997) CAMES , vol.4 , pp. 397-415
- Ihlenburg, F.¹ Babuska, I.²

7
- 0141780468
- Is the pollution effect of the FEM avoidable for the Helmholtz equation considering high wave numbers?
- Babuska IM, Sauter SA,. Is the pollution effect of the FEM avoidable for the Helmholtz equation considering high wave numbers? SIAM Review 2000; 42 (3): 451-484. DOI: 10.1137/S0036142994269186.
- (2000) SIAM Review , vol.42 , Issue.3 , pp. 451-484
- Babuska, I.M.¹ Sauter, S.A.²

8
- 1442315655
- Numerical solution of the Helmholtz equation with high wave numbers
- Bao G, Wei GW, Zhao S,. Numerical solution of the Helmholtz equation with high wave numbers. International Journal for Numerical Methods in Engineering 2004; 59: 389-408.
- (2004) International Journal for Numerical Methods in Engineering , vol.59 , pp. 389-408
- Bao, G.¹ Wei, G.W.² Zhao, S.³

9
- 46749093802
- Optical diffraction tomography in fluid velocimetry: The use of a priori information
- Lobera J, Coupland JM,. Optical diffraction tomography in fluid velocimetry: The use of a priori information. Measurement Science and Technology 2008; 19 (7): 074013.
- (2008) Measurement Science and Technology , vol.19 , Issue.7 , pp. 074013
- Lobera, J.¹ Coupland, J.M.²

10
- 84870399830
- Available form: [Accessed on 21 january 2014].
- TOP 500 supercomputing site. Available form: http://www.top500.org/ [Accessed on 21 january 2014].
- TOP 500 supercomputing site

11
- 82555200279
- Available from: [Accessed on 21 january 2014].
- Jacobsen DA, Thibault JC, Senocak I,. An MPI-CUDA implementation for massively parallel incompressible flow computations on multi-GPU clusters, 2010. Available from: http://scholarworks.boisestate.edu/cgi/viewcontent.cgi?article=1004&context=mecheng-facpubs [Accessed on 21 january 2014].
- (2010) An MPI-CUDA implementation for massively parallel incompressible flow computations on multi-GPU clusters
- Jacobsen, D.A.¹ Thibault, J.C.² Senocak, I.³

12
- 84866991798
- High performance computing for optical diffraction Tomography
- (HPCS 2012).
- Ortega G, Lobera J, Arroyo MP, García I, Garzõn EM,. High performance computing for optical diffraction Tomography. Proceedings of The 2012 International Conference on High Performance Computing & Simulation (HPCS 2012), 2012; 195-201.
- (2012) Proceedings of The 2012 International Conference on High Performance Computing & Simulation , pp. 195-201
- Ortega, G.¹ Lobera, J.² Arroyo, M.P.³ García, I.⁴ Garzõn, E.M.⁵

13
- 1842829625
- (Second Edition). SIAM: Philadelphia.
- Saad Y,. Iterative Methods for Sparse Linear Systems (Second Edition). SIAM: Philadelphia, 2003.
- (2003) Iterative Methods for Sparse Linear Systems
- Saad, Y.¹

14
- 0003473816
- (2nd Edition). SIAM: Philadelphia, PA.
- Barrett R, Berry M, Chan TF, Demmel J, Donato J, Dongarra J, Eijkhout V, Pozo R, Romine C, der Vorst HV,. Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods (2nd Edition). SIAM: Philadelphia, PA, 1994.
- (1994) Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods
- Barrett, R.¹ Berry, M.² Chan, T.F.³ Demmel, J.⁴ Donato, J.⁵ Dongarra, J.⁶ Eijkhout, V.⁷ Pozo, R.⁸ Romine, C.⁹ Der Vorst, H.V.¹⁰

15
- 3042707652
- Cambridge monographs on applied and computational mathematics, Cambridge University Press: Cambridge, UK, New York, Available from: [Accessed on 21 january 2014].
- Van der Vorst HA,. Iterative Krylov Methods for Large Linear Systems, Cambridge monographs on applied and computational mathematics, Cambridge University Press: Cambridge, UK, New York, 2003. Available from: http://opac.inria.fr/record=b1100090 [Accessed on 21 january 2014].
- (2003) Iterative Krylov Methods for Large Linear Systems
- Van Der Vorst, H.A.¹

16
- 0003470761
- Society for Industrial and Applied Mathematics: Philadelphia, PA, USA.
- Greenbaum A,. Iterative Methods for Solving Linear Systems. Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1997.
- (1997) Iterative Methods for Solving Linear Systems
- Greenbaum, A.¹

17
- 0034399114
- Real valued iterative methods for solving complex symmetric linear systems
- Axelsson O, Kucherov A,. Real valued iterative methods for solving complex symmetric linear systems. Numerical Linear Algebra with Applications 2000; 7 (4): 197-218, DOI: 10.1002/1099-1506(200005)7:4197::AID-NLA1943.0.CO;2-S.
- (2000) Numerical Linear Algebra with Applications , vol.7 , Issue.4 , pp. 197-218
- Axelsson, O.¹ Kucherov, A.²

18
- 0025401744
- A Petrov-Galerkin type method for solving Axk=b, where A is symmetric complex
- Van der Vorst HA, Melissen JBM,. A Petrov-Galerkin type method for solving Axk=b, where A is symmetric complex. IEEE Transactions on Magnetics 1990; 26 (2): 706-708, DOI: 10.1109/20.106415.
- (1990) IEEE Transactions on Magnetics , vol.26 , Issue.2 , pp. 706-708
- Van Der Vorst, H.A.¹ Melissen, J.B.M.²

19
- 84857040838
- Available from: [Accessed on 21 january 2014].
- Balay S, et al,. PETSc Users Manual. Revision 3.3. Available from: http://www.mcs.anl.gov/petsc/petsc-current/docs/manual.pdf [Accessed on 21 january 2014].
- PETSc Users Manual. Revision 3.3
- Balay, S.¹

20
- 84880331064
- A scalable Helmholtz solver in GRAPES over large-scale multicore cluster
- Li L, Xue W, Ranjan R, Jin Z,. A scalable Helmholtz solver in GRAPES over large-scale multicore cluster. Concurrency and Computation: Practice and Experience 2013; 25 (12): 1722-1737. DOI: 10.1002/cpe.2979.
- (2013) Concurrency and Computation: Practice and Experience , vol.25 , Issue.12 , pp. 1722-1737
- Li, L.¹ Xue, W.² Ranjan, R.³ Jin, Z.⁴

21
- 84939243711
- 3D Helmholtz Krylov solver preconditioned by a shifted Laplace multigrid method on multi-GPUS
- In, Cangiani A. Davidchack R. Georgoulis E. Gorban A. Levesley J. Tretyakov M. (eds). Springer: Berlin Heidelberg.
- Knibbe H, Oosterlee CW, Vuik C,. 3D Helmholtz Krylov solver preconditioned by a shifted Laplace multigrid method on multi-GPUs. In Numerical Mathematics and Advanced Applications 2011, Cangiani A, Davidchack R, Georgoulis E, Gorban A, Levesley J, Tretyakov M, (eds). Springer: Berlin Heidelberg, 2013; 653-661.
- (2013) Numerical Mathematics and Advanced Applications 2011 , pp. 653-661
- Knibbe, H.¹ Oosterlee, C.W.² Vuik, C.³

22
- 84939564599
- Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system
- Barcelone, Espagne, September;. Available from: [Accessed on 21 january 2014].
- Bordage C,. Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system. ADVCIMP12, Barcelone, Espagne, September 2012; 90-95. Available from: http://hal.inria.fr/hal-00773114 [Accessed on 21 january 2014].
- (2012) ADVCIMP12 , pp. 90-95
- Bordage, C.¹

23
- 42449098971
- Numerical performance of a parallel solution method for a heterogeneous 2D Helmholtz equation
- April
- Kononov AV, Riyanti CD, de Leeuw SW, Oosterlee CW, Vuik C,. Numerical performance of a parallel solution method for a heterogeneous 2D Helmholtz equation. Computing and Visualization in Science April 2008; 11 (3): 139-146. DOI: 10.1007/s00791-007-0069-6.
- (2008) Computing and Visualization in Science , vol.11 , Issue.3 , pp. 139-146
- Kononov, A.V.¹ Riyanti, C.D.² De Leeuw, S.W.³ Oosterlee, C.W.⁴ Vuik, C.⁵

24
- 60949098907
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms
- Williams S, Oliker L, Vuduc R, Shalf J, Yelick K, Demmel J,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194. DOI: http://dx.doi.org/ 10.1016/j.parco.2008.12.006.
- (2009) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
- Williams, S.¹ Oliker, L.² Vuduc, R.³ Shalf, J.⁴ Yelick, K.⁵ Demmel, J.⁶

25
- 84939569307
- Math kernel library
- . Available from: [Accessed on 21 january 2014].
- INTEL. Math kernel library, 2013. Available from: http://software.intel.com/en-us/articles/intel-math-kernel-library-documentation [Accessed on 21 january 2014].
- (2013) INTEL

26
- 60649099576
- Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor
- Kurzak J, Alvaro W, Dongarra J,. Optimizing matrix multiplication for a short-vector SIMD architecture-CELL processor. Parallel Computing 2009; 35 (3): 138-150. DOI: http://dx.doi.org/ 10.1016/j.parco.2008.12.010.
- (2009) Parallel Computing , vol.35 , Issue.3 , pp. 138-150
- Kurzak, J.¹ Alvaro, W.² Dongarra, J.³

27
- 74049143158
- Implementing sparse matrix-vector multiplication on throughput-oriented processors
- New York, NY, USA.
- Bell N, Garland M,. Implementing sparse matrix-vector multiplication on throughput-oriented processors. Sc '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, New York, NY, USA, 2009; 1-11.
- (2009) Sc '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 1-11
- Bell, N.¹ Garland, M.²

28
- 77952611196
- Concurrent number cruncher - a GPU implementation of a general sparse linear solver
- Buatois L, Caumon G, Lévy B,. Concurrent number cruncher-a GPU implementation of a general sparse linear solver. International Journal of Parallel Emergent and Distributed Systems 2009; 24 (3): 205-223.
- (2009) International Journal of Parallel Emergent and Distributed Systems , vol.24 , Issue.3 , pp. 205-223
- Buatois, L.¹ Caumon, G.² Lévy, B.³

29
- 77949577730
- Automatically tuning sparse matrix-vector multiplication for GPU architectures
- Pisa, Italy.
- Monakov A, Lokhmotov A, Avetisyan A,. Automatically tuning sparse matrix-vector multiplication for GPU architectures. Proceedings of HiPEAC 2010, LNCS 5952, Pisa, Italy, 2010; 111-125.
- (2010) Proceedings of HiPEAC 2010, LNCS 5952 , pp. 111-125
- Monakov, A.¹ Lokhmotov, A.² Avetisyan, A.³

30
- 79955614550
- A new approach for sparse matrix vector product on NVIDIA GPUS
- Vázquez F, Fernández JJ, Garzõn EM,. A new approach for sparse matrix vector product on NVIDIA GPUs. Concurrency and Computation: Practice And Experience 2011; 23 (8): 815-826. DOI: 10.1002/cpe.1658.
- (2011) Concurrency and Computation: Practice And Experience , vol.23 , Issue.8 , pp. 815-826
- Vázquez, F.¹ Fernández, J.J.² Garzõn, E.M.³

31
- 78249244772
- Improving the performance of the sparse matrix vector product with GPUS
- CIT 2010.
- Vázquez F, Ortega G, Fernández JJ, Garzõn EM,. Improving the performance of the sparse matrix vector product with GPUs. 10th IEEE International Conference on Computer and Information Technology. CIT 2010, 2010; 1146-1151.
- (2010) 10th IEEE International Conference on Computer and Information Technology , pp. 1146-1151
- Vázquez, F.¹ Ortega, G.² Fernández, J.J.³ Garzõn, E.M.⁴

32
- 84939568897
- Cusparse library
- Available from: [Accessed on 21 january 2014].
- NVIDIA. Cusparse library, V5.5, 2013. Available from: http://docs.nvidia.com/cuda/cusparse/ [Accessed on 21 january 2014].
- (2013) NVIDIA. V5.5

33
- 0004168818
- (3rd Edition). The Johns Hopkins University Press.
- Golub GH, van Van Loan CF,. Matrix Computations (Johns Hopkins Studies in Mathematical Sciences) (3rd Edition). The Johns Hopkins University Press, 1996.
- (1996) Matrix Computations (Johns Hopkins Studies in Mathematical Sciences)
- Golub, G.H.¹ Van Van Loan, C.F.²

34
- 77950518538
- A matrix approach to tomographic reconstruction and its implementation on GPUS
- Vázquez F, Garzõn EM, Fernández JJ,. A matrix approach to tomographic reconstruction and its implementation on GPUs. Journal of Structural Biology 2010; 170 (1): 146-151.
- (2010) Journal of Structural Biology , vol.170 , Issue.1 , pp. 146-151
- Vázquez, F.¹ Garzõn, E.M.² Fernández, J.J.³

35
- 0001201384
- Finite element solution of the Helmholtz equation with high wave number part I: The h-version of the FEM
- Ihlenburg F, Babuska I,. Finite element solution of the Helmholtz equation with high wave number part I: The h-version of the FEM. Computers & Mathematics with Applications 1995; 30 (9): 9-37. DOI: 10.1016/0898-1221(95)00144-N.
- (1995) Computers & Mathematics with Applications , vol.30 , Issue.9 , pp. 9-37
- Ihlenburg, F.¹ Babuska, I.²

36
- 28444496198
- Generalized finite element methods- main ideas, results and perspective
- Babuska I, Banerjee U, Osborn JE,. Generalized finite element methods- main ideas, results and perspective. International Journal of Computational Methods 2004; 1: 153-156.
- (2004) International Journal of Computational Methods , vol.1 , pp. 153-156
- Babuska, I.¹ Banerjee, U.² Osborn, J.E.³

37
- 0031145041
- Preconditioned CG methods for sparse matrices on massively parallel machines
- Basermann A, Reichel B, Schelthoff C,. Preconditioned CG methods for sparse matrices on massively parallel machines. Parallel Computing 1997; 23 (3): 381-398.
- (1997) Parallel Computing , vol.23 , Issue.3 , pp. 381-398
- Basermann, A.¹ Reichel, B.² Schelthoff, C.³

38
- 4744357005
- Oxford University Press: New York, USA.
- Bisseling RH,. Parallel Scientific Computation. Oxford University Press: New York, USA, 2004.
- (2004) Parallel Scientific Computation
- Bisseling, R.H.¹

39
- 84875238105
- The biconjugate gradient method on GPUS
- Ortega G, Garzõn EM, Vázquez F, García I,. The biconjugate gradient method on GPUs. The Journal of Supercomputing 2013; 64: 49-58. DOI: 10.1007/s11227-012-0761-2.
- (2013) The Journal of Supercomputing , vol.64 , pp. 49-58
- Ortega, G.¹ Garzõn, E.M.² Vázquez, F.³ García, I.⁴

40
- 0003719406
- (Revised 4th Edition), The Morgan Kaufmann Series in Computer Architecture and Design, Academic Press: Amsterdam.
- Patterson DA, Hennessy JL,. Computer Organization and Design-The Hardware / Software Interface (Revised 4th Edition), The Morgan Kaufmann Series in Computer Architecture and Design, Academic Press: Amsterdam, 2012.
- (2012) Computer Organization and Design - The Hardware / Software Interface
- Patterson, D.A.¹ Hennessy, J.L.²

41
- 0003710740
- (2nd. (Revised)). MIT Press: Cambridge, MA, USA.
- Snir M, Otto S, Huss-Lederman S, Walker D, Dongarra J,. MPI-The Complete Reference, Volume 1: The MPI Core (2nd. (Revised)). MIT Press: Cambridge, MA, USA, 1998.
- (1998) MPI-The Complete Reference, Volume 1: The MPI Core
- Snir, M.¹ Otto, S.² Huss-Lederman, S.³ Walker, D.⁴ Dongarra, J.⁵

42
- 84939562114
- NVIDIA Corporation 2701 San Tomas Expressway. CUDA C Best Practices Guide., Available from: [Accessed on 21 january 2014].
- NVIDIA Corporation 2701 San Tomas Expressway. Santa Clara 95050, USA. CUDA C Best Practices Guide., 2013. Available from: http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html [Accessed on 21 january 2014].
- (2013) Santa Clara 95050, USA

43
- 84868121117
- Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors
- Anzt H, Castillo M, Fernández JC, Heuveline V, Igual FD, Mayo R, Quintana-Ortí ES,. Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors. Computer Science-R&D 2012; 27 (4): 299-307.
- (2012) Computer Science - R&D , vol.27 , Issue.4 , pp. 299-307
- Anzt, H.¹ Castillo, M.² Fernández, J.C.³ Heuveline, V.⁴ Igual, F.D.⁵ Mayo, R.⁶ Quintana-Ortí, E.S.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.