메뉴 건너뛰기




Volumn 23, Issue 1, 2011, Pages 22-32

Cyclic reduction tridiagonal solvers on GPUs applied to mixed-precision multigrid

Author keywords

cyclic reduction; finite elements; GPU Computing; mixed precision iterative refinement; multigrid; NVIDIA CUDA.; tridiagonal solvers

Indexed keywords

CYCLIC REDUCTION; FINITE ELEMENT; GPU COMPUTING; ITERATIVE REFINEMENT; MULTI-GRID; NVIDIA CUDA.; TRI-DIAGONAL SOLVER;

EID: 78649807974     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPDS.2010.61     Document Type: Article
Times cited : (96)

References (34)
  • 4
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA tesla: A unified graphics and computing architecture
    • Mar./Apr.
    • E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro, vol. 28, no. 2, pp. 39-55, Mar./Apr. 2008.
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 5
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • Mar./Apr.
    • J. Nickolls, I. Buck, M. Garland, and K. Skadron, "Scalable Parallel Programming with CUDA," ACM Queue, vol. 6, no. 2, pp. 40-53, Mar./Apr. 2008.
    • (2008) ACM Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1    Buck, I.2    Garland, M.3    Skadron, K.4
  • 6
    • 77953998137 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
    • July
    • J. Bolz, I. Farmer, E. Grinspun, and P. Schröder, "Sparse Matrix Solvers on the GPU: Conjugate Gradients and Multigrid," ACM Trans. Graphics, vol. 22, no. 3, pp. 917-924, July 2003.
    • (2003) ACM Trans. Graphics , vol.22 , Issue.3 , pp. 917-924
    • Bolz, J.1    Farmer, I.2    Grinspun, E.3    Schröder, P.4
  • 7
    • 11144277251 scopus 로고    scopus 로고
    • A multigrid solver for boundary value problems using programmable graphics hardware
    • M. Doggett, W. Heidrich, W.R. Mark, and A. Schilling, eds. July
    • N. Goodnight, C. Woolley, G. Lewin, D.P. Luebke, and G. Humphreys, "A Multigrid Solver for Boundary Value Problems Using Programmable Graphics Hardware," Proc. Conf. Graphics Hardware, M. Doggett, W. Heidrich, W.R. Mark, and A. Schilling, eds., pp. 102-111, July 2003.
    • (2003) Proc. Conf. Graphics Hardware , pp. 102-111
    • Goodnight, N.1    Woolley, C.2    Lewin, G.3    Luebke, D.P.4    Humphreys, G.5
  • 8
    • 10644295769 scopus 로고    scopus 로고
    • Image registration by a regularized gradient flow - A streaming implementation in DX9 graphics hardware
    • Nov.
    • R. Strzodka, M. Droske, and M. Rumpf, "Image Registration by a Regularized Gradient Flow - a Streaming Implementation in DX9 Graphics Hardware," Computing, vol. 73, no. 4, pp. 373-389, Nov. 2004.
    • (2004) Computing , vol.73 , Issue.4 , pp. 373-389
    • Strzodka, R.1    Droske, M.2    Rumpf, M.3
  • 9
    • 33947588604 scopus 로고    scopus 로고
    • Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
    • Jan.
    • D. Göddeke, R. Strzodka, and S. Turek, "Performance and Accuracy of Hardware-Oriented Native-, Emulated- and Mixed-Precision Solvers in FEM Simulations," Int'l J. Parallel, Emergent and Distributed Systems, vol. 22, no. 4, pp. 221-256, Jan. 2007.
    • (2007) Int'l J. Parallel, Emergent and Distributed Systems , vol.22 , Issue.4 , pp. 221-256
    • Göddeke, D.1    Strzodka, R.2    Turek, S.3
  • 10
    • 49249134702 scopus 로고    scopus 로고
    • Streaming multigrid for gradient-domain operations on large images
    • Aug.
    • M. Kazhdan and H. Hoppe, "Streaming Multigrid for Gradient-Domain Operations on Large Images," ACM Trans. Graphics, vol. 27, no. 3, pp. 1-10, Aug. 2008.
    • (2008) ACM Trans. Graphics , vol.27 , Issue.3 , pp. 1-10
    • Kazhdan, M.1    Hoppe, H.2
  • 12
    • 54249162842 scopus 로고    scopus 로고
    • Large calculation of the flow over a hypersonic vehicle using a GPU
    • Dec.
    • E. Elsen, P. LeGresley, and E. Darve, "Large Calculation of the Flow over a Hypersonic Vehicle Using a GPU," J. Computational Physics, vol. 227, no. 24, pp. 10148-10161, Dec. 2008.
    • (2008) J. Computational Physics , vol.227 , Issue.24 , pp. 10148-10161
    • Elsen, E.1    LeGresley, P.2    Darve, E.3
  • 15
    • 84932220767 scopus 로고
    • A fast direct solution of poisson's equation using fourier analysis
    • Jan.
    • R.W. Hockney, "A Fast Direct Solution of Poisson's Equation Using Fourier Analysis," J. ACM, vol. 12, no. 1, pp. 95-113, Jan. 1965.
    • (1965) J. ACM , vol.12 , Issue.1 , pp. 95-113
    • Hockney, R.W.1
  • 17
    • 84976729385 scopus 로고
    • An efficient parallel algorithm for the solution of a tridiagonal linear system of equations
    • Jan.
    • H.S. Stone, "An Efficient Parallel Algorithm for the Solution of a Tridiagonal Linear System of Equations," J. ACM, vol. 20, no. 1, pp. 27-38, Jan. 1973.
    • (1973) J. ACM , vol.20 , Issue.1 , pp. 27-38
    • Stone, H.S.1
  • 19
    • 67649528185 scopus 로고    scopus 로고
    • Mathematical and numerical analysis of a robust and efficient grid deformation method in the finite element context
    • Nov.
    • M. Grajewski, M. Köster, and S. Turek, "Mathematical and Numerical Analysis of a Robust and Efficient Grid Deformation Method in the Finite Element Context," SIAM J. Scientific Computing, vol. 31, no. 2, pp. 1539-1557, Nov. 2008.
    • (2008) SIAM J. Scientific Computing , vol.31 , Issue.2 , pp. 1539-1557
    • Grajewski, M.1    Köster, M.2    Turek, S.3
  • 20
    • 26444596160 scopus 로고    scopus 로고
    • Hardware-oriented numerics and concepts for PDE software
    • Feb.
    • S. Turek, C. Becker, and S. Kilian, "Hardware-Oriented Numerics and Concepts for PDE Software," Future Generation Computer Systems, vol. 22, nos. 1/2, pp. 217-238, Feb. 2004.
    • (2004) Future Generation Computer Systems , vol.22 , Issue.1-2 , pp. 217-238
    • Turek, S.1    Becker, C.2    Kilian, S.3
  • 29
    • 0012065017 scopus 로고
    • Iterative refinement of the solution of a positive definite system of equations
    • May
    • R.S. Martin, G. Peters, and J.H. Wilkinson, "Iterative Refinement of the Solution of a Positive Definite System of Equations," Numerische Mathematik, vol. 8, no. 3, pp. 203-216, May 1966.
    • (1966) Numerische Mathematik , vol.8 , Issue.3 , pp. 203-216
    • Martin, R.S.1    Peters, G.2    Wilkinson, J.H.3
  • 30
    • 0012066965 scopus 로고
    • Solution of real and complex systems of linear equations
    • May
    • H.J. Bowdler, R.S. Martin, G. Peters, and J.H. Wilkinson, "Solution of Real and Complex Systems of Linear Equations," Numerische Mathematik, vol. 8, no. 3, pp. 217-234, May 1966.
    • (1966) Numerische Mathematik , vol.8 , Issue.3 , pp. 217-234
    • Bowdler, H.J.1    Martin, R.S.2    Peters, G.3    Wilkinson, J.H.4
  • 31
    • 0001467517 scopus 로고
    • Iterative refinement in floating point
    • Apr.
    • C.B. Moler, "Iterative Refinement in Floating Point," J. ACM, vol. 14, no. 2, pp. 316-321, Apr. 1967.
    • (1967) J. ACM , vol.14 , Issue.2 , pp. 316-321
    • Moler, C.B.1
  • 33
    • 0003237190 scopus 로고
    • Elliptic problems in linear difference equations over a network
    • Columbia Univ.
    • L.H. Thomas, "Elliptic Problems in Linear Difference Equations over a Network," Watson Scientific Computing Laboratory Report, Columbia Univ., 1949.
    • (1949) Watson Scientific Computing Laboratory Report
    • Thomas, L.H.1
  • 34
    • 0002058827 scopus 로고
    • The numerical solution of parabolic and elliptic differential equations
    • Mar.
    • D.W. Peaceman and H.H. Rachford Jr, "The Numerical Solution of Parabolic and Elliptic Differential Equations," J. Soc. for Industrial and Applied Math., vol. 3, no. 1, pp. 28-41, Mar. 1955.
    • (1955) J. Soc. for Industrial and Applied Math. , vol.3 , Issue.1 , pp. 28-41
    • Peaceman, D.W.1    Rachford Jr., H.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.