-
1
-
-
41249094535
-
A versatile sharp interface immersed boundary method for incompressible flows with complex boundaries
-
R. Mittal et al., "A versatile sharp interface immersed boundary method for incompressible flows with complex boundaries." Journal of Computational Physics, 2008.
-
(2008)
Journal of Computational Physics
-
-
Mittal, R.1
-
2
-
-
83155193227
-
Scaling lattice qcd beyond 100 gpus
-
R. Babich et al., "Scaling lattice qcd beyond 100 gpus." in SC. 2011.
-
(2011)
SC
-
-
Babich, R.1
-
4
-
-
84877702106
-
A scalable, numerically stable, high-performance tridiagonal solver using gpus
-
L.-w. Chang et al., "A scalable, numerically stable, high-performance tridiagonal solver using gpus", in SC, 2012.
-
(2012)
SC
-
-
Chang, L.-.1
-
5
-
-
33750298089
-
Spike: A parallel environment for solving banded linear systems
-
E. Polizzi et al., "Spike: A parallel environment for solving banded linear systems", Computers & Fluids, 2007.
-
(2007)
Computers & Fluids
-
-
Polizzi, E.1
-
6
-
-
77952958084
-
Modeling the propagation of elastic waves using spectral elements on a cluster of 192 gpus
-
D. Komatitsch et al., "Modeling the propagation of elastic waves using spectral elements on a cluster of 192 gpus", Computer Science-Research and Development, 2010.
-
(2010)
Computer Science-research and Development
-
-
Komatitsch, D.1
-
7
-
-
84877699080
-
Forward and adjoint simulations of seismic wave propagation on emerging large-scale gpu architectures
-
M. Rietmann et al., "Forward and adjoint simulations of seismic wave propagation on emerging large-scale gpu architectures", in SC, 2012.
-
(2012)
SC
-
-
Rietmann, M.1
-
8
-
-
84870692280
-
Using 1000+ gpus and 10000+ cpus for sedimentary basin simulations
-
M. Wen et al., "Using 1000+ gpus and 10000+ cpus for sedimentary basin simulations", in CLUSTER, 2012.
-
(2012)
CLUSTER
-
-
Wen, M.1
-
9
-
-
84877706293
-
Scalable multi-gpu 3-d fft for tsubame 2.0 supercomputer
-
A. Nukada et al., "Scalable multi-gpu 3-d fft for tsubame 2.0 supercomputer", in SC, 2012.
-
(2012)
SC
-
-
Nukada, A.1
-
10
-
-
84893633244
-
-
NVIDIA CUSP, "http://developer.nvidia.com/cuda/cusp"
-
-
-
-
11
-
-
0242533310
-
Linear algebra operators for gpu implementation of numerical agorithms
-
J. Krüger et al., "Linear algebra operators for gpu implementation of numerical agorithms", in TOG, 2003.
-
(2003)
TOG
-
-
Krüger, J.1
-
12
-
-
77952662514
-
A parallel preconditioned conjugate gradient solver for the poisson problem on a mUlti-gpu platform
-
M. Ament et al., "A parallel preconditioned conjugate gradient solver for the poisson problem on a mUlti-gpu platform", in PDP, 2010.
-
(2010)
PDP
-
-
Ament, M.1
-
13
-
-
79952800023
-
A cg-based poisson solver on a gpu-cluster
-
G. Knittel, "A cg-based poisson solver on a gpu-cluster", in HiPC, 2010.
-
(2010)
HiPC
-
-
Knittel, G.1
-
15
-
-
0242533311
-
Sparse matrix solvers on the gpu: Conjugate gradients and multigrid
-
J. Bolz et al., "Sparse matrix solvers on the gpu: conjugate gradients and multigrid", in TOG, 2003.
-
(2003)
TOG
-
-
Bolz, J.1
-
16
-
-
0022850316
-
Multigrid methods for elliptic problems: A review
-
S. Fulton et al., "Multigrid methods for elliptic problems: A review", Mon. Wea. Rev, 1986.
-
(1986)
Mon. Wea. Rev
-
-
Fulton, S.1
-
17
-
-
84867642413
-
Block-asynchronous multigrid smoothers for gpuaccelerated systems
-
H. Anzt et al., "Block-asynchronous multigrid smoothers for gpuaccelerated systems", Technical report, Tech. Rep., 2011.
-
(2011)
Technical Report, Tech. Rep.
-
-
Anzt, H.1
-
19
-
-
0000048673
-
Gmres: A generalized minimal residual algorithm for solving nonsymmetric linear systems
-
Y. Saad et al., "Gmres: A generalized minimal residual algorithm for solving nonsymmetric linear systems." SIAM J. Sci. Stat. Comput., 1986.
-
(1986)
SIAM J. Sci. Stat. Comput.
-
-
Saad, Y.1
-
20
-
-
0001845470
-
Bicgstab (1) for linear equations involving unsymmetric matrices with complex spectrum
-
G. Sleijpen et al., "Bicgstab (1) for linear equations involving unsymmetric matrices with complex spectrum", Electronic Transactions on Numerical Analysis, 1993.
-
(1993)
Electronic Transactions on Numerical Analysis
-
-
Sleijpen, G.1
-
21
-
-
84876512127
-
Matrix decomposition based conjugate gradient solver for poisson equation
-
H. Liu et al., "Matrix decomposition based conjugate gradient solver for poisson equation", in SC, 2012.
-
(2012)
SC
-
-
Liu, H.1
-
22
-
-
33745869834
-
Flow simulation with complex boundaries
-
W Li et al., "Flow simulation with complex boundaries", GPU Gems, 2005.
-
(2005)
GPU Gems
-
-
Li, W.1
-
23
-
-
84877709628
-
Toward real-time modeling of human heart ventricles at cellular resolution: Simulation of drug-induced arrhythmias
-
A. A. Mirin et al., "Toward real-time modeling of human heart ventricles at cellular resolution: simulation of drug-induced arrhythmias", in SC, 2012.
-
(2012)
SC
-
-
Mirin, A.A.1
-
24
-
-
80053140672
-
Perfomlance of hybrid programming models for multiscale cardiac simulations: Preparing for petascale computation
-
B. J. Pope et al., "Perfomlance of hybrid programming models for multiscale cardiac simulations: Preparing for petascale computation", Biomedical Engineering, IEEE Transaclions on, 2011.
-
(2011)
Biomedical Engineering, IEEE Transaclions on
-
-
Pope, B.J.1
-
25
-
-
84864199775
-
Accelerating cardiac bidomain simulations using graphics processing units
-
A. Neic et al., "Accelerating cardiac bidomain simulations using graphics processing units", Biomedical Engineering, 2012.
-
(2012)
Biomedical Engineering
-
-
Neic, A.1
-
26
-
-
84860392008
-
Simulating human cardiac electrophysiology on clinical time-scales
-
S. Niederer et al, "Simulating human cardiac electrophysiology on clinical time-scales", Frontiers in Physiology, 2011.
-
(2011)
Frontiers in Physiology
-
-
Niederer, S.1
-
28
-
-
31044454001
-
A parallel hybrid banded system solver: The spike algorithm
-
E. Polizzi et al., "A parallel hybrid banded system solver: the spike algorithm", Parallel computing, 2006.
-
(2006)
Parallel Computing
-
-
Polizzi, E.1
-
30
-
-
0025557020
-
A parallel preconditioned conjugate gradient method using domain decomposition and inexact solvers on each subdomain
-
A. Meyer, "A parallel preconditioned conjugate gradient method using domain decomposition and inexact solvers on each subdomain", Computing, 1990.
-
(1990)
Computing
-
-
Meyer, A.1
-
31
-
-
84893542081
-
Bi-cgstab: A fast and smoothly converging variant of bicg in the presence of rounding errors
-
H. Van der Vorst, "Bi-cgstab: A fast and smoothly converging variant of bicg in the presence of rounding errors", J. Sci. Slatisl. Comput, 1992.
-
(1992)
J. Sci. Slatisl. Comput
-
-
Van Der Vorst, H.1
-
33
-
-
70350368872
-
Efficient sparse matrix-vector multiplication on cuda
-
N. Bell et al., "Efficient sparse matrix-vector multiplication on cuda", NVIDIA Technical Report, 2008.
-
(2008)
NVIDIA Technical Report
-
-
Bell, N.1
-
34
-
-
77952579552
-
Demystifying gpu microarchitecture through microbenchmarking
-
H. Wong et al., "Demystifying gpu microarchitecture through microbenchmarking", in ISPASS, 2010.
-
(2010)
ISPASS
-
-
Wong, H.1
-
35
-
-
84893528724
-
-
M. Market, "http://math.nist.gov/matrixmarket/."
-
-
-
Market, M.1
-
37
-
-
79952426001
-
Perfomnance analysis of high performance computing applications on the amazon web services cloud
-
K. R. Jackson et al., "Perfomnance analysis of high performance computing applications on the amazon web services cloud", in CloudCom, 2010.
-
(2010)
CloudCom
-
-
Jackson, K.R.1
-
38
-
-
84870704125
-
Optimized strategies for mapping three-dimensional ffts onto cuda gpus
-
J. Wu et al, "Optimized strategies for mapping three-dimensional ffts onto cuda gpus", in InPar, 2012.
-
(2012)
InPar
-
-
Wu, J.1
|