메뉴 건너뛰기




Volumn 64, Issue 10-12, 2010, Pages 1254-1273

Conjugate gradients on multiple GPUs

Author keywords

Communication computation overlapping; Conjugate Gradients; GPGPU; Mixed precision; Poisson's equation

Indexed keywords

COMPUTATIONAL ASPECTS; CONDITION NUMBERS; CONJUGATE GRADIENT; CONJUGATE-GRADIENT SOLVERS; DATA FORMAT; DOUBLE PRECISION; GPGPU; ILL-CONDITIONED; ITERATIVE REFINEMENT; LOW-BANDWIDTH; MATRIX VECTOR MULTIPLICATION; MIXED PRECISION; NUMERICAL CHARACTERISTICS; ORDER OF MAGNITUDE; POISSON'S EQUATION; POWER EFFICIENCY; SINGLE PRECISION; TIME SPENT;

EID: 78649682552     PISSN: 02712091     EISSN: 10970363     Source Type: Journal    
DOI: 10.1002/fld.2462     Document Type: Article
Times cited : (33)

References (16)
  • 2
    • 78649685685 scopus 로고    scopus 로고
    • Automatic performance tuning of sparse matrix kernels. Ph.D. Thesis, University of California at Berkeley.
    • Vuduc R. Automatic performance tuning of sparse matrix kernels. Ph.D. Thesis, University of California at Berkeley, 2003.
    • (2003)
    • Vuduc, R.1
  • 3
    • 78649644902 scopus 로고    scopus 로고
    • NVIDIA Corporation. CUDA Programming Guide, version 2.0, NVIDIA Corporation.
    • NVIDIA Corporation. CUDA Programming Guide, version 2.0, NVIDIA Corporation, 2008.
    • (2008)
  • 5
    • 33947588048 scopus 로고    scopus 로고
    • A survey of general-purpose computation on graphics hardware
    • Owens D et al. A survey of general-purpose computation on graphics hardware. Computer Graphics Forum 2007; 26:80-113.
    • (2007) Computer Graphics Forum , vol.26 , pp. 80-113
    • Owens, D.1
  • 6
    • 77953998137 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU: conjugate gradients and multigrid
    • Bolz J et al. Sparse matrix solvers on the GPU: conjugate gradients and multigrid. ACM Transactions on Graphics 2003; 22:917-924.
    • (2003) ACM Transactions on Graphics , vol.22 , pp. 917-924
    • Bolz, J.1
  • 7
    • 38149066031 scopus 로고    scopus 로고
    • Concurrent Number Cruncher: An Efficient Sparse Linear Solver on the GPU
    • Lecture Notes in Computer Science, Springer: Berlin.
    • Buatois L et al. Concurrent Number Cruncher: An Efficient Sparse Linear Solver on the GPU. Lecture Notes in Computer Science, vol. 4782. Springer: Berlin, 2007; 358-371.
    • (2007) , vol.4782 , pp. 358-371
    • Buatois, L.1
  • 9
    • 78649679954 scopus 로고    scopus 로고
    • Implementation of an efficient Conjugate Gradients algorithm for Poisson Solutions on Graphic Processors. Proceedings of CFD2007, 2007.
    • Menon S, Perot JB. Implementation of an efficient Conjugate Gradients algorithm for Poisson Solutions on Graphic Processors. Proceedings of CFD2007, 2007.
    • Menon, S.1    Perot, J.B.2
  • 10
    • 78649651720 scopus 로고    scopus 로고
    • University of Florida Sparse Matrix Collection. Available from.
    • Davis T. University of Florida Sparse Matrix Collection. Available from.
    • Davis, T.1
  • 12
    • 54449089617 scopus 로고    scopus 로고
    • Deflated preconditioned conjugate gradient solvers for the Pressure-Poisson equation
    • Aubry R et al. Deflated preconditioned conjugate gradient solvers for the Pressure-Poisson equation. Journal of Computational Physics 2008; 227:10196-10208.
    • (2008) Journal of Computational Physics , vol.227 , pp. 10196-10208
    • Aubry, R.1
  • 13
    • 1842829625 scopus 로고    scopus 로고
    • Iterative Methods for Sparse Linear Systems
    • (2nd edn). SIAM: Philadelphia, PA.
    • Saad Y. Iterative Methods for Sparse Linear Systems (2nd edn). SIAM: Philadelphia, PA, 2003.
    • (2003)
    • Saad, Y.1
  • 14
    • 34548206782 scopus 로고    scopus 로고
    • Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy. Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, Tampa, FL, 2006.
    • Langou J et al. Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy. Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, Tampa, FL, 2006.
    • Langou, J.1
  • 15
    • 78649646259 scopus 로고    scopus 로고
    • Efficient sparse matrix-vector multiplication on CUDA. Techreport, NVIDIA Corporation.
    • Bell N, Garland M. Efficient sparse matrix-vector multiplication on CUDA. Techreport, NVIDIA Corporation, 2008.
    • (2008)
    • Bell, N.1    Garland, M.2
  • 16
    • 0003851784 scopus 로고    scopus 로고
    • Numerical Linear Algebra for High Performance Computers
    • SIAM: Philadelphia, PA.
    • Dongarra J et al. Numerical Linear Algebra for High Performance Computers. SIAM: Philadelphia, PA, 1998; 166-168.
    • (1998) , pp. 166-168
    • Dongarra, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.