메뉴 건너뛰기




Volumn 27, Issue 13, 2015, Pages 3205-3219

Parallel resolution of the 3D Helmholtz equation based on multi-graphics processing unit clusters

Author keywords

biconjugate gradient method; Helmholtz equation; MPI paradigm; multi GPU

Indexed keywords

COMPUTER GRAPHICS EQUIPMENT; GRADIENT METHODS; HELMHOLTZ EQUATION; MESSAGE PASSING; PROGRAM PROCESSORS; THREE DIMENSIONAL COMPUTER GRAPHICS;

EID: 84939529782     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.3212     Document Type: Article
Times cited : (6)

References (43)
  • 3
    • 23244462384 scopus 로고    scopus 로고
    • Fast fluid dynamics simulation on the GPU
    • Courses, New York, NY, USA.
    • Harris M,. Fast fluid dynamics simulation on the GPU, ACM SIGGRAPH 2005 Courses, New York, NY, USA, 2005; 637-665.
    • (2005) ACM SIGGRAPH 2005 , pp. 637-665
    • Harris, M.1
  • 6
    • 0031383340 scopus 로고    scopus 로고
    • Solution of Helmholtz problems by knowledge-based FEM
    • Ihlenburg F, Babuska I,. Solution of Helmholtz problems by knowledge-based FEM. CAMES 1997; 4: 397-415.
    • (1997) CAMES , vol.4 , pp. 397-415
    • Ihlenburg, F.1    Babuska, I.2
  • 7
    • 0141780468 scopus 로고    scopus 로고
    • Is the pollution effect of the FEM avoidable for the Helmholtz equation considering high wave numbers?
    • Babuska IM, Sauter SA,. Is the pollution effect of the FEM avoidable for the Helmholtz equation considering high wave numbers? SIAM Review 2000; 42 (3): 451-484. DOI: 10.1137/S0036142994269186.
    • (2000) SIAM Review , vol.42 , Issue.3 , pp. 451-484
    • Babuska, I.M.1    Sauter, S.A.2
  • 9
    • 46749093802 scopus 로고    scopus 로고
    • Optical diffraction tomography in fluid velocimetry: The use of a priori information
    • Lobera J, Coupland JM,. Optical diffraction tomography in fluid velocimetry: The use of a priori information. Measurement Science and Technology 2008; 19 (7): 074013.
    • (2008) Measurement Science and Technology , vol.19 , Issue.7 , pp. 074013
    • Lobera, J.1    Coupland, J.M.2
  • 10
    • 84870399830 scopus 로고    scopus 로고
    • Available form: [Accessed on 21 january 2014].
    • TOP 500 supercomputing site. Available form: http://www.top500.org/ [Accessed on 21 january 2014].
    • TOP 500 supercomputing site
  • 15
    • 3042707652 scopus 로고    scopus 로고
    • Cambridge monographs on applied and computational mathematics, Cambridge University Press: Cambridge, UK, New York, Available from: [Accessed on 21 january 2014].
    • Van der Vorst HA,. Iterative Krylov Methods for Large Linear Systems, Cambridge monographs on applied and computational mathematics, Cambridge University Press: Cambridge, UK, New York, 2003. Available from: http://opac.inria.fr/record=b1100090 [Accessed on 21 january 2014].
    • (2003) Iterative Krylov Methods for Large Linear Systems
    • Van Der Vorst, H.A.1
  • 17
    • 0034399114 scopus 로고    scopus 로고
    • Real valued iterative methods for solving complex symmetric linear systems
    • Axelsson O, Kucherov A,. Real valued iterative methods for solving complex symmetric linear systems. Numerical Linear Algebra with Applications 2000; 7 (4): 197-218, DOI: 10.1002/1099-1506(200005)7:4197::AID-NLA1943.0.CO;2-S.
    • (2000) Numerical Linear Algebra with Applications , vol.7 , Issue.4 , pp. 197-218
    • Axelsson, O.1    Kucherov, A.2
  • 18
    • 0025401744 scopus 로고
    • A Petrov-Galerkin type method for solving Axk=b, where A is symmetric complex
    • Van der Vorst HA, Melissen JBM,. A Petrov-Galerkin type method for solving Axk=b, where A is symmetric complex. IEEE Transactions on Magnetics 1990; 26 (2): 706-708, DOI: 10.1109/20.106415.
    • (1990) IEEE Transactions on Magnetics , vol.26 , Issue.2 , pp. 706-708
    • Van Der Vorst, H.A.1    Melissen, J.B.M.2
  • 19
    • 84857040838 scopus 로고    scopus 로고
    • Available from: [Accessed on 21 january 2014].
    • Balay S, et al,. PETSc Users Manual. Revision 3.3. Available from: http://www.mcs.anl.gov/petsc/petsc-current/docs/manual.pdf [Accessed on 21 january 2014].
    • PETSc Users Manual. Revision 3.3
    • Balay, S.1
  • 20
    • 84880331064 scopus 로고    scopus 로고
    • A scalable Helmholtz solver in GRAPES over large-scale multicore cluster
    • Li L, Xue W, Ranjan R, Jin Z,. A scalable Helmholtz solver in GRAPES over large-scale multicore cluster. Concurrency and Computation: Practice and Experience 2013; 25 (12): 1722-1737. DOI: 10.1002/cpe.2979.
    • (2013) Concurrency and Computation: Practice and Experience , vol.25 , Issue.12 , pp. 1722-1737
    • Li, L.1    Xue, W.2    Ranjan, R.3    Jin, Z.4
  • 21
    • 84939243711 scopus 로고    scopus 로고
    • 3D Helmholtz Krylov solver preconditioned by a shifted Laplace multigrid method on multi-GPUS
    • In, Cangiani A. Davidchack R. Georgoulis E. Gorban A. Levesley J. Tretyakov M. (eds). Springer: Berlin Heidelberg.
    • Knibbe H, Oosterlee CW, Vuik C,. 3D Helmholtz Krylov solver preconditioned by a shifted Laplace multigrid method on multi-GPUs. In Numerical Mathematics and Advanced Applications 2011, Cangiani A, Davidchack R, Georgoulis E, Gorban A, Levesley J, Tretyakov M, (eds). Springer: Berlin Heidelberg, 2013; 653-661.
    • (2013) Numerical Mathematics and Advanced Applications 2011 , pp. 653-661
    • Knibbe, H.1    Oosterlee, C.W.2    Vuik, C.3
  • 22
    • 84939564599 scopus 로고    scopus 로고
    • Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system
    • Barcelone, Espagne, September;. Available from: [Accessed on 21 january 2014].
    • Bordage C,. Parallelization on heterogeneous multicore and multi-GPU systems of the fast multipole method for the Helmholtz equation using a runtime system. ADVCIMP12, Barcelone, Espagne, September 2012; 90-95. Available from: http://hal.inria.fr/hal-00773114 [Accessed on 21 january 2014].
    • (2012) ADVCIMP12 , pp. 90-95
    • Bordage, C.1
  • 23
    • 42449098971 scopus 로고    scopus 로고
    • Numerical performance of a parallel solution method for a heterogeneous 2D Helmholtz equation
    • April
    • Kononov AV, Riyanti CD, de Leeuw SW, Oosterlee CW, Vuik C,. Numerical performance of a parallel solution method for a heterogeneous 2D Helmholtz equation. Computing and Visualization in Science April 2008; 11 (3): 139-146. DOI: 10.1007/s00791-007-0069-6.
    • (2008) Computing and Visualization in Science , vol.11 , Issue.3 , pp. 139-146
    • Kononov, A.V.1    Riyanti, C.D.2    De Leeuw, S.W.3    Oosterlee, C.W.4    Vuik, C.5
  • 24
    • 60949098907 scopus 로고    scopus 로고
    • Optimization of sparse matrix-vector multiplication on emerging multicore platforms
    • Williams S, Oliker L, Vuduc R, Shalf J, Yelick K, Demmel J,. Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 2009; 35 (3): 178-194. DOI: http://dx.doi.org/ 10.1016/j.parco.2008.12.006.
    • (2009) Parallel Computing , vol.35 , Issue.3 , pp. 178-194
    • Williams, S.1    Oliker, L.2    Vuduc, R.3    Shalf, J.4    Yelick, K.5    Demmel, J.6
  • 25
    • 84939569307 scopus 로고    scopus 로고
    • Math kernel library
    • . Available from: [Accessed on 21 january 2014].
    • INTEL. Math kernel library, 2013. Available from: http://software.intel.com/en-us/articles/intel-math-kernel-library-documentation [Accessed on 21 january 2014].
    • (2013) INTEL
  • 26
    • 60649099576 scopus 로고    scopus 로고
    • Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor
    • Kurzak J, Alvaro W, Dongarra J,. Optimizing matrix multiplication for a short-vector SIMD architecture-CELL processor. Parallel Computing 2009; 35 (3): 138-150. DOI: http://dx.doi.org/ 10.1016/j.parco.2008.12.010.
    • (2009) Parallel Computing , vol.35 , Issue.3 , pp. 138-150
    • Kurzak, J.1    Alvaro, W.2    Dongarra, J.3
  • 29
    • 77949577730 scopus 로고    scopus 로고
    • Automatically tuning sparse matrix-vector multiplication for GPU architectures
    • Pisa, Italy.
    • Monakov A, Lokhmotov A, Avetisyan A,. Automatically tuning sparse matrix-vector multiplication for GPU architectures. Proceedings of HiPEAC 2010, LNCS 5952, Pisa, Italy, 2010; 111-125.
    • (2010) Proceedings of HiPEAC 2010, LNCS 5952 , pp. 111-125
    • Monakov, A.1    Lokhmotov, A.2    Avetisyan, A.3
  • 32
    • 84939568897 scopus 로고    scopus 로고
    • Cusparse library
    • Available from: [Accessed on 21 january 2014].
    • NVIDIA. Cusparse library, V5.5, 2013. Available from: http://docs.nvidia.com/cuda/cusparse/ [Accessed on 21 january 2014].
    • (2013) NVIDIA. V5.5
  • 34
    • 77950518538 scopus 로고    scopus 로고
    • A matrix approach to tomographic reconstruction and its implementation on GPUS
    • Vázquez F, Garzõn EM, Fernández JJ,. A matrix approach to tomographic reconstruction and its implementation on GPUs. Journal of Structural Biology 2010; 170 (1): 146-151.
    • (2010) Journal of Structural Biology , vol.170 , Issue.1 , pp. 146-151
    • Vázquez, F.1    Garzõn, E.M.2    Fernández, J.J.3
  • 35
    • 0001201384 scopus 로고
    • Finite element solution of the Helmholtz equation with high wave number part I: The h-version of the FEM
    • Ihlenburg F, Babuska I,. Finite element solution of the Helmholtz equation with high wave number part I: The h-version of the FEM. Computers & Mathematics with Applications 1995; 30 (9): 9-37. DOI: 10.1016/0898-1221(95)00144-N.
    • (1995) Computers & Mathematics with Applications , vol.30 , Issue.9 , pp. 9-37
    • Ihlenburg, F.1    Babuska, I.2
  • 37
    • 0031145041 scopus 로고    scopus 로고
    • Preconditioned CG methods for sparse matrices on massively parallel machines
    • Basermann A, Reichel B, Schelthoff C,. Preconditioned CG methods for sparse matrices on massively parallel machines. Parallel Computing 1997; 23 (3): 381-398.
    • (1997) Parallel Computing , vol.23 , Issue.3 , pp. 381-398
    • Basermann, A.1    Reichel, B.2    Schelthoff, C.3
  • 42
    • 84939562114 scopus 로고    scopus 로고
    • NVIDIA Corporation 2701 San Tomas Expressway. CUDA C Best Practices Guide., Available from: [Accessed on 21 january 2014].
    • NVIDIA Corporation 2701 San Tomas Expressway. Santa Clara 95050, USA. CUDA C Best Practices Guide., 2013. Available from: http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html [Accessed on 21 january 2014].
    • (2013) Santa Clara 95050, USA


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.