메뉴 건너뛰기




Volumn , Issue , 2013, Pages

GPU-accelerated scalable solver for banded linear systems

Author keywords

[No Author keywords available]

Indexed keywords

ADVECTION-DIFFUSION EQUATION; AMAZON EC2; COMMUNICATION OVERHEADS; ETHERNET NETWORKS; GPU-ACCELERATED; MATRIX DECOMPOSITION; MULTIPLE GPUS; SCIENTIFIC AND ENGINEERING APPLICATIONS;

EID: 84893617896     PISSN: 15525244     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CLUSTER.2013.6702612     Document Type: Conference Paper
Times cited : (10)

References (39)
  • 1
    • 41249094535 scopus 로고    scopus 로고
    • A versatile sharp interface immersed boundary method for incompressible flows with complex boundaries
    • R. Mittal et al., "A versatile sharp interface immersed boundary method for incompressible flows with complex boundaries." Journal of Computational Physics, 2008.
    • (2008) Journal of Computational Physics
    • Mittal, R.1
  • 2
    • 83155193227 scopus 로고    scopus 로고
    • Scaling lattice qcd beyond 100 gpus
    • R. Babich et al., "Scaling lattice qcd beyond 100 gpus." in SC. 2011.
    • (2011) SC
    • Babich, R.1
  • 4
    • 84877702106 scopus 로고    scopus 로고
    • A scalable, numerically stable, high-performance tridiagonal solver using gpus
    • L.-w. Chang et al., "A scalable, numerically stable, high-performance tridiagonal solver using gpus", in SC, 2012.
    • (2012) SC
    • Chang, L.-.1
  • 5
    • 33750298089 scopus 로고    scopus 로고
    • Spike: A parallel environment for solving banded linear systems
    • E. Polizzi et al., "Spike: A parallel environment for solving banded linear systems", Computers & Fluids, 2007.
    • (2007) Computers & Fluids
    • Polizzi, E.1
  • 6
    • 77952958084 scopus 로고    scopus 로고
    • Modeling the propagation of elastic waves using spectral elements on a cluster of 192 gpus
    • D. Komatitsch et al., "Modeling the propagation of elastic waves using spectral elements on a cluster of 192 gpus", Computer Science-Research and Development, 2010.
    • (2010) Computer Science-research and Development
    • Komatitsch, D.1
  • 7
    • 84877699080 scopus 로고    scopus 로고
    • Forward and adjoint simulations of seismic wave propagation on emerging large-scale gpu architectures
    • M. Rietmann et al., "Forward and adjoint simulations of seismic wave propagation on emerging large-scale gpu architectures", in SC, 2012.
    • (2012) SC
    • Rietmann, M.1
  • 8
    • 84870692280 scopus 로고    scopus 로고
    • Using 1000+ gpus and 10000+ cpus for sedimentary basin simulations
    • M. Wen et al., "Using 1000+ gpus and 10000+ cpus for sedimentary basin simulations", in CLUSTER, 2012.
    • (2012) CLUSTER
    • Wen, M.1
  • 9
    • 84877706293 scopus 로고    scopus 로고
    • Scalable multi-gpu 3-d fft for tsubame 2.0 supercomputer
    • A. Nukada et al., "Scalable multi-gpu 3-d fft for tsubame 2.0 supercomputer", in SC, 2012.
    • (2012) SC
    • Nukada, A.1
  • 10
    • 84893633244 scopus 로고    scopus 로고
    • NVIDIA CUSP, "http://developer.nvidia.com/cuda/cusp"
  • 11
    • 0242533310 scopus 로고    scopus 로고
    • Linear algebra operators for gpu implementation of numerical agorithms
    • J. Krüger et al., "Linear algebra operators for gpu implementation of numerical agorithms", in TOG, 2003.
    • (2003) TOG
    • Krüger, J.1
  • 12
    • 77952662514 scopus 로고    scopus 로고
    • A parallel preconditioned conjugate gradient solver for the poisson problem on a mUlti-gpu platform
    • M. Ament et al., "A parallel preconditioned conjugate gradient solver for the poisson problem on a mUlti-gpu platform", in PDP, 2010.
    • (2010) PDP
    • Ament, M.1
  • 13
    • 79952800023 scopus 로고    scopus 로고
    • A cg-based poisson solver on a gpu-cluster
    • G. Knittel, "A cg-based poisson solver on a gpu-cluster", in HiPC, 2010.
    • (2010) HiPC
    • Knittel, G.1
  • 15
    • 0242533311 scopus 로고    scopus 로고
    • Sparse matrix solvers on the gpu: Conjugate gradients and multigrid
    • J. Bolz et al., "Sparse matrix solvers on the gpu: conjugate gradients and multigrid", in TOG, 2003.
    • (2003) TOG
    • Bolz, J.1
  • 16
    • 0022850316 scopus 로고
    • Multigrid methods for elliptic problems: A review
    • S. Fulton et al., "Multigrid methods for elliptic problems: A review", Mon. Wea. Rev, 1986.
    • (1986) Mon. Wea. Rev
    • Fulton, S.1
  • 17
    • 84867642413 scopus 로고    scopus 로고
    • Block-asynchronous multigrid smoothers for gpuaccelerated systems
    • H. Anzt et al., "Block-asynchronous multigrid smoothers for gpuaccelerated systems", Technical report, Tech. Rep., 2011.
    • (2011) Technical Report, Tech. Rep.
    • Anzt, H.1
  • 19
    • 0000048673 scopus 로고
    • Gmres: A generalized minimal residual algorithm for solving nonsymmetric linear systems
    • Y. Saad et al., "Gmres: A generalized minimal residual algorithm for solving nonsymmetric linear systems." SIAM J. Sci. Stat. Comput., 1986.
    • (1986) SIAM J. Sci. Stat. Comput.
    • Saad, Y.1
  • 20
    • 0001845470 scopus 로고
    • Bicgstab (1) for linear equations involving unsymmetric matrices with complex spectrum
    • G. Sleijpen et al., "Bicgstab (1) for linear equations involving unsymmetric matrices with complex spectrum", Electronic Transactions on Numerical Analysis, 1993.
    • (1993) Electronic Transactions on Numerical Analysis
    • Sleijpen, G.1
  • 21
    • 84876512127 scopus 로고    scopus 로고
    • Matrix decomposition based conjugate gradient solver for poisson equation
    • H. Liu et al., "Matrix decomposition based conjugate gradient solver for poisson equation", in SC, 2012.
    • (2012) SC
    • Liu, H.1
  • 22
    • 33745869834 scopus 로고    scopus 로고
    • Flow simulation with complex boundaries
    • W Li et al., "Flow simulation with complex boundaries", GPU Gems, 2005.
    • (2005) GPU Gems
    • Li, W.1
  • 23
    • 84877709628 scopus 로고    scopus 로고
    • Toward real-time modeling of human heart ventricles at cellular resolution: Simulation of drug-induced arrhythmias
    • A. A. Mirin et al., "Toward real-time modeling of human heart ventricles at cellular resolution: simulation of drug-induced arrhythmias", in SC, 2012.
    • (2012) SC
    • Mirin, A.A.1
  • 24
    • 80053140672 scopus 로고    scopus 로고
    • Perfomlance of hybrid programming models for multiscale cardiac simulations: Preparing for petascale computation
    • B. J. Pope et al., "Perfomlance of hybrid programming models for multiscale cardiac simulations: Preparing for petascale computation", Biomedical Engineering, IEEE Transaclions on, 2011.
    • (2011) Biomedical Engineering, IEEE Transaclions on
    • Pope, B.J.1
  • 25
    • 84864199775 scopus 로고    scopus 로고
    • Accelerating cardiac bidomain simulations using graphics processing units
    • A. Neic et al., "Accelerating cardiac bidomain simulations using graphics processing units", Biomedical Engineering, 2012.
    • (2012) Biomedical Engineering
    • Neic, A.1
  • 26
    • 84860392008 scopus 로고    scopus 로고
    • Simulating human cardiac electrophysiology on clinical time-scales
    • S. Niederer et al, "Simulating human cardiac electrophysiology on clinical time-scales", Frontiers in Physiology, 2011.
    • (2011) Frontiers in Physiology
    • Niederer, S.1
  • 28
    • 31044454001 scopus 로고    scopus 로고
    • A parallel hybrid banded system solver: The spike algorithm
    • E. Polizzi et al., "A parallel hybrid banded system solver: the spike algorithm", Parallel computing, 2006.
    • (2006) Parallel Computing
    • Polizzi, E.1
  • 30
    • 0025557020 scopus 로고
    • A parallel preconditioned conjugate gradient method using domain decomposition and inexact solvers on each subdomain
    • A. Meyer, "A parallel preconditioned conjugate gradient method using domain decomposition and inexact solvers on each subdomain", Computing, 1990.
    • (1990) Computing
    • Meyer, A.1
  • 31
    • 84893542081 scopus 로고
    • Bi-cgstab: A fast and smoothly converging variant of bicg in the presence of rounding errors
    • H. Van der Vorst, "Bi-cgstab: A fast and smoothly converging variant of bicg in the presence of rounding errors", J. Sci. Slatisl. Comput, 1992.
    • (1992) J. Sci. Slatisl. Comput
    • Van Der Vorst, H.1
  • 33
    • 70350368872 scopus 로고    scopus 로고
    • Efficient sparse matrix-vector multiplication on cuda
    • N. Bell et al., "Efficient sparse matrix-vector multiplication on cuda", NVIDIA Technical Report, 2008.
    • (2008) NVIDIA Technical Report
    • Bell, N.1
  • 34
    • 77952579552 scopus 로고    scopus 로고
    • Demystifying gpu microarchitecture through microbenchmarking
    • H. Wong et al., "Demystifying gpu microarchitecture through microbenchmarking", in ISPASS, 2010.
    • (2010) ISPASS
    • Wong, H.1
  • 35
    • 84893528724 scopus 로고    scopus 로고
    • M. Market, "http://math.nist.gov/matrixmarket/."
    • Market, M.1
  • 37
    • 79952426001 scopus 로고    scopus 로고
    • Perfomnance analysis of high performance computing applications on the amazon web services cloud
    • K. R. Jackson et al., "Perfomnance analysis of high performance computing applications on the amazon web services cloud", in CloudCom, 2010.
    • (2010) CloudCom
    • Jackson, K.R.1
  • 38
    • 84870704125 scopus 로고    scopus 로고
    • Optimized strategies for mapping three-dimensional ffts onto cuda gpus
    • J. Wu et al, "Optimized strategies for mapping three-dimensional ffts onto cuda gpus", in InPar, 2012.
    • (2012) InPar
    • Wu, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.