-
2
-
-
49049088756
-
GPU computing
-
May
-
J.D. Owens, M. Houston, D.P. Luebke, S. Green, J.E. Stone, and J.C. Phillips, "GPU Computing," Proc. IEEE, vol. 96, no. 5, pp. 879-899, May 2008.
-
(2008)
Proc. IEEE
, vol.96
, Issue.5
, pp. 879-899
-
-
Owens, J.D.1
Houston, M.2
Luebke, D.P.3
Green, S.4
Stone, J.E.5
Phillips, J.C.6
-
3
-
-
53749092570
-
Parallel computing experiences with CUDA
-
July
-
M. Garland, S.L. Grand, J. Nickolls, J.A. Anderson, J. Hardwick, S. Morton, E.H. Phillips, Y. Zhang, and V. Volkov, "Parallel Computing Experiences with CUDA," IEEE Micro, vol. 28, no. 4, pp. 13-27, July 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.4
, pp. 13-27
-
-
Garland, M.1
Grand, S.L.2
Nickolls, J.3
Anderson, J.A.4
Hardwick, J.5
Morton, S.6
Phillips, E.H.7
Zhang, Y.8
Volkov, V.9
-
4
-
-
44849137198
-
NVIDIA tesla: A unified graphics and computing architecture
-
Mar./Apr.
-
E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym, "NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro, vol. 28, no. 2, pp. 39-55, Mar./Apr. 2008.
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
5
-
-
78651550268
-
Scalable parallel programming with CUDA
-
Mar./Apr.
-
J. Nickolls, I. Buck, M. Garland, and K. Skadron, "Scalable Parallel Programming with CUDA," ACM Queue, vol. 6, no. 2, pp. 40-53, Mar./Apr. 2008.
-
(2008)
ACM Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
6
-
-
77953998137
-
Sparse matrix solvers on the GPU: Conjugate gradients and multigrid
-
July
-
J. Bolz, I. Farmer, E. Grinspun, and P. Schröder, "Sparse Matrix Solvers on the GPU: Conjugate Gradients and Multigrid," ACM Trans. Graphics, vol. 22, no. 3, pp. 917-924, July 2003.
-
(2003)
ACM Trans. Graphics
, vol.22
, Issue.3
, pp. 917-924
-
-
Bolz, J.1
Farmer, I.2
Grinspun, E.3
Schröder, P.4
-
7
-
-
11144277251
-
A multigrid solver for boundary value problems using programmable graphics hardware
-
M. Doggett, W. Heidrich, W.R. Mark, and A. Schilling, eds. July
-
N. Goodnight, C. Woolley, G. Lewin, D.P. Luebke, and G. Humphreys, "A Multigrid Solver for Boundary Value Problems Using Programmable Graphics Hardware," Proc. Conf. Graphics Hardware, M. Doggett, W. Heidrich, W.R. Mark, and A. Schilling, eds., pp. 102-111, July 2003.
-
(2003)
Proc. Conf. Graphics Hardware
, pp. 102-111
-
-
Goodnight, N.1
Woolley, C.2
Lewin, G.3
Luebke, D.P.4
Humphreys, G.5
-
8
-
-
10644295769
-
Image registration by a regularized gradient flow - A streaming implementation in DX9 graphics hardware
-
Nov.
-
R. Strzodka, M. Droske, and M. Rumpf, "Image Registration by a Regularized Gradient Flow - a Streaming Implementation in DX9 Graphics Hardware," Computing, vol. 73, no. 4, pp. 373-389, Nov. 2004.
-
(2004)
Computing
, vol.73
, Issue.4
, pp. 373-389
-
-
Strzodka, R.1
Droske, M.2
Rumpf, M.3
-
9
-
-
33947588604
-
Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
-
Jan.
-
D. Göddeke, R. Strzodka, and S. Turek, "Performance and Accuracy of Hardware-Oriented Native-, Emulated- and Mixed-Precision Solvers in FEM Simulations," Int'l J. Parallel, Emergent and Distributed Systems, vol. 22, no. 4, pp. 221-256, Jan. 2007.
-
(2007)
Int'l J. Parallel, Emergent and Distributed Systems
, vol.22
, Issue.4
, pp. 221-256
-
-
Göddeke, D.1
Strzodka, R.2
Turek, S.3
-
10
-
-
49249134702
-
Streaming multigrid for gradient-domain operations on large images
-
Aug.
-
M. Kazhdan and H. Hoppe, "Streaming Multigrid for Gradient-Domain Operations on Large Images," ACM Trans. Graphics, vol. 27, no. 3, pp. 1-10, Aug. 2008.
-
(2008)
ACM Trans. Graphics
, vol.27
, Issue.3
, pp. 1-10
-
-
Kazhdan, M.1
Hoppe, H.2
-
12
-
-
54249162842
-
Large calculation of the flow over a hypersonic vehicle using a GPU
-
Dec.
-
E. Elsen, P. LeGresley, and E. Darve, "Large Calculation of the Flow over a Hypersonic Vehicle Using a GPU," J. Computational Physics, vol. 227, no. 24, pp. 10148-10161, Dec. 2008.
-
(2008)
J. Computational Physics
, vol.227
, Issue.24
, pp. 10148-10161
-
-
Elsen, E.1
LeGresley, P.2
Darve, E.3
-
13
-
-
70449768671
-
Interactive depth of field using simulated diffusion
-
Jan.
-
M. Kass, A.E. Lefohn, and J.D. Owens, "Interactive Depth of Field Using Simulated Diffusion," Technical Report 06-01, Pixar Animation Studios, Jan. 2006.
-
(2006)
Technical Report 06-01, Pixar Animation Studios
-
-
Kass, M.1
Lefohn, A.E.2
Owens, J.D.3
-
14
-
-
78651284120
-
Scan primitives for GPU computing
-
T. Aila and M. Segal, eds. Aug.
-
S. Sengupta, M.J. Harris, Y. Zhang, and J.D. Owens, "Scan Primitives for GPU Computing," Proc. Conf. Graphics Hardware, T. Aila and M. Segal, eds., pp. 97-106, Aug. 2007.
-
(2007)
Proc. Conf. Graphics Hardware
, pp. 97-106
-
-
Sengupta, S.1
Harris, M.J.2
Zhang, Y.3
Owens, J.D.4
-
15
-
-
84932220767
-
A fast direct solution of poisson's equation using fourier analysis
-
Jan.
-
R.W. Hockney, "A Fast Direct Solution of Poisson's Equation Using Fourier Analysis," J. ACM, vol. 12, no. 1, pp. 95-113, Jan. 1965.
-
(1965)
J. ACM
, vol.12
, Issue.1
, pp. 95-113
-
-
Hockney, R.W.1
-
17
-
-
84976729385
-
An efficient parallel algorithm for the solution of a tridiagonal linear system of equations
-
Jan.
-
H.S. Stone, "An Efficient Parallel Algorithm for the Solution of a Tridiagonal Linear System of Equations," J. ACM, vol. 20, no. 1, pp. 27-38, Jan. 1973.
-
(1973)
J. ACM
, vol.20
, Issue.1
, pp. 27-38
-
-
Stone, H.S.1
-
18
-
-
77749337487
-
Fast tridiagonal solvers on the GPU
-
Jan.
-
Y. Zhang, J. Cohen, and J.D. Owens, "Fast Tridiagonal Solvers on the GPU," Proc. 15th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP '10), pp. 127-136, Jan. 2010.
-
(2010)
Proc. 15th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming (PPoPP '10)
, pp. 127-136
-
-
Zhang, Y.1
Cohen, J.2
Owens, J.D.3
-
19
-
-
67649528185
-
Mathematical and numerical analysis of a robust and efficient grid deformation method in the finite element context
-
Nov.
-
M. Grajewski, M. Köster, and S. Turek, "Mathematical and Numerical Analysis of a Robust and Efficient Grid Deformation Method in the Finite Element Context," SIAM J. Scientific Computing, vol. 31, no. 2, pp. 1539-1557, Nov. 2008.
-
(2008)
SIAM J. Scientific Computing
, vol.31
, Issue.2
, pp. 1539-1557
-
-
Grajewski, M.1
Köster, M.2
Turek, S.3
-
20
-
-
26444596160
-
Hardware-oriented numerics and concepts for PDE software
-
Feb.
-
S. Turek, C. Becker, and S. Kilian, "Hardware-Oriented Numerics and Concepts for PDE Software," Future Generation Computer Systems, vol. 22, nos. 1/2, pp. 217-238, Feb. 2004.
-
(2004)
Future Generation Computer Systems
, vol.22
, Issue.1-2
, pp. 217-238
-
-
Turek, S.1
Becker, C.2
Kilian, S.3
-
21
-
-
79958771380
-
FEAST - Realisation of hardware-oriented numerics for HPC simulations with finite elements
-
Feb. doi:10.1002/cpe.1584
-
S. Turek, D. Göddeke, C. Becker, S.H. Buijssen, and H. Wobker, "FEAST - Realisation of Hardware-Oriented Numerics for HPC Simulations with Finite Elements," Concurrency and Computation: Practice and Expecience, special issue Proc. ISC 2008, Feb. 2010, doi:10.1002/cpe.1584.
-
(2010)
Concurrency and Computation: Practice and Expecience, Special Issue Proc. ISC 2008
-
-
Turek, S.1
Göddeke, D.2
Becker, C.3
Buijssen, S.H.4
Wobker, H.5
-
22
-
-
78649827068
-
-
S. Turek, C. Becker, S. Kilian, S.H.M. Buijssen, D. Göddeke, and H. Wobker, "FEAST - Finite Element Analysis and Solution Tools," http://www.feast.tu-dortmund.de, 2008.
-
(2008)
FEAST - Finite Element Analysis and Solution Tools
-
-
Turek, S.1
Becker, C.2
Kilian, S.3
Buijssen, S.H.M.4
Göddeke, D.5
Wobker, H.6
-
23
-
-
70749140797
-
Co-processor acceleration of an unmodified parallel solid mechanics code with FEASTGPU
-
Oct.
-
D. Göddeke, H. Wobker, R. Strzodka, J. Mohd-Yusof, P.S. McCormick, and S. Turek, "Co-Processor Acceleration of an Unmodified Parallel Solid Mechanics Code with FEASTGPU," Int'l J. Computational Science and Eng., vol. 4, no. 4, pp. 254-269, Oct. 2009.
-
(2009)
Int'l J. Computational Science and Eng.
, vol.4
, Issue.4
, pp. 254-269
-
-
Göddeke, D.1
Wobker, H.2
Strzodka, R.3
Mohd-Yusof, J.4
McCormick, P.S.5
Turek, S.6
-
24
-
-
70449488175
-
GPU acceleration of an unmodified parallel finite element navier-stokes solver
-
June
-
D. Göddeke, S.H. Buijssen, H. Wobker, and S. Turek, "GPU Acceleration of an Unmodified Parallel Finite Element Navier-Stokes Solver," Proc. IEEE Int'l Conf. High Performance Computing and Simulation (HPCS '09), pp. 12-21, June 2009.
-
(2009)
Proc. IEEE Int'l Conf. High Performance Computing and Simulation (HPCS '09)
, pp. 12-21
-
-
Göddeke, D.1
Buijssen, S.H.2
Wobker, H.3
Turek, S.4
-
26
-
-
27344435504
-
The design and implementation of a first-generation CELL processor
-
Feb.
-
D.C. Pham, S. Asano, M. Bolliger, M.N. Day, H.P. Hofstee, C.R. Johns, J.A. Kahle, A. Kameyama, J. Keaty, Y. Masubuchi, M. Riley, D. Shippy, D.L. Stasiak, M. Suzuoki, M. Wang, J. Warnock, S. Weitzel, D. Wendel, T. Yamazaki, and K. Yazawa, "The Design and Implementation of a First-Generation CELL Processor," Proc. Int'l Solid-State Circuits Conf. (ISSCC '05), Digest of Technical Papers, vol. 1, pp. 184-592, Feb. 2005.
-
(2005)
Proc. Int'l Solid-State Circuits Conf. (ISSCC '05), Digest of Technical Papers
, vol.1
, pp. 184-592
-
-
Pham, D.C.1
Asano, S.2
Bolliger, M.3
Day, M.N.4
Hofstee, H.P.5
Johns, C.R.6
Kahle, J.A.7
Kameyama, A.8
Keaty, J.9
Masubuchi, Y.10
Riley, M.11
Shippy, D.12
Stasiak, D.L.13
Suzuoki, M.14
Wang, M.15
Warnock, J.16
Weitzel, S.17
Wendel, D.18
Yamazaki, T.19
Yazawa, K.20
more..
-
29
-
-
0012065017
-
Iterative refinement of the solution of a positive definite system of equations
-
May
-
R.S. Martin, G. Peters, and J.H. Wilkinson, "Iterative Refinement of the Solution of a Positive Definite System of Equations," Numerische Mathematik, vol. 8, no. 3, pp. 203-216, May 1966.
-
(1966)
Numerische Mathematik
, vol.8
, Issue.3
, pp. 203-216
-
-
Martin, R.S.1
Peters, G.2
Wilkinson, J.H.3
-
30
-
-
0012066965
-
Solution of real and complex systems of linear equations
-
May
-
H.J. Bowdler, R.S. Martin, G. Peters, and J.H. Wilkinson, "Solution of Real and Complex Systems of Linear Equations," Numerische Mathematik, vol. 8, no. 3, pp. 217-234, May 1966.
-
(1966)
Numerische Mathematik
, vol.8
, Issue.3
, pp. 217-234
-
-
Bowdler, H.J.1
Martin, R.S.2
Peters, G.3
Wilkinson, J.H.4
-
31
-
-
0001467517
-
Iterative refinement in floating point
-
Apr.
-
C.B. Moler, "Iterative Refinement in Floating Point," J. ACM, vol. 14, no. 2, pp. 316-321, Apr. 1967.
-
(1967)
J. ACM
, vol.14
, Issue.2
, pp. 316-321
-
-
Moler, C.B.1
-
33
-
-
0003237190
-
Elliptic problems in linear difference equations over a network
-
Columbia Univ.
-
L.H. Thomas, "Elliptic Problems in Linear Difference Equations over a Network," Watson Scientific Computing Laboratory Report, Columbia Univ., 1949.
-
(1949)
Watson Scientific Computing Laboratory Report
-
-
Thomas, L.H.1
-
34
-
-
0002058827
-
The numerical solution of parabolic and elliptic differential equations
-
Mar.
-
D.W. Peaceman and H.H. Rachford Jr, "The Numerical Solution of Parabolic and Elliptic Differential Equations," J. Soc. for Industrial and Applied Math., vol. 3, no. 1, pp. 28-41, Mar. 1955.
-
(1955)
J. Soc. for Industrial and Applied Math.
, vol.3
, Issue.1
, pp. 28-41
-
-
Peaceman, D.W.1
Rachford Jr., H.H.2
|