-
2
-
-
77749253327
-
-
NVIDIA CUDA compute unified device architecture, programming guide, 2009. Version 2.0.
-
NVIDIA CUDA compute unified device architecture, programming guide, 2009. Version 2.0.
-
-
-
-
3
-
-
84963568066
-
Cyclic reduction on distributed shared memory machines
-
S. Allmann, T. Rauber, and G. Runger. Cyclic reduction on distributed shared memory machines. Euromicro Conference on Parallel, Distributed, and Network-Based Processing, pages 290-297, 2001.
-
(2001)
Euromicro Conference on Parallel, Distributed, and Network-Based Processing
, pp. 290-297
-
-
Allmann, S.1
Rauber, T.2
Runger, G.3
-
4
-
-
0025536635
-
LAPACK: A portable linear algebra library for high-performance computers
-
IEEE Computer Society Press
-
E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. DuCroz, A. Greenbaum, S. Hammarling, A. McKenney, and D. Sorensen. LAPACK: A portable linear algebra library for high-performance computers. In Proceedings of Supercomputing '90, pages 2-11. IEEE Computer Society Press, 1990.
-
(1990)
Proceedings of Supercomputing '90
, pp. 2-11
-
-
Anderson, E.1
Bai, Z.2
Bischof, C.3
Demmel, J.4
Dongarra, J.5
DuCroz, J.6
Greenbaum, A.7
Hammarling, S.8
McKenney, A.9
Sorensen, D.10
-
5
-
-
0002924004
-
Prefix sums and their applications
-
Technical Report CMU-CS-90-190, School of Computer Science, Carnegie Mellon University, Nov
-
G. E. Blelloch. Prefix sums and their applications. Technical Report CMU-CS-90-190, School of Computer Science, Carnegie Mellon University, Nov. 1990.
-
(1990)
-
-
Blelloch, G.E.1
-
6
-
-
0000161555
-
On direct methods for solving Poisson's equations
-
B. L. Buzbee, G. H. Golub, and C. W. Nielson. On direct methods for solving Poisson's equations. SIAM Journal on Numerical Analysis, 7(4):627-656, 1970.
-
(1970)
SIAM Journal on Numerical Analysis
, vol.7
, Issue.4
, pp. 627-656
-
-
Buzbee, B.L.1
Golub, G.H.2
Nielson, C.W.3
-
7
-
-
10044249026
-
An over-lapped two-way method for solving tridiagonal linear systems in a bsp computer
-
J.-J. Climent, C. Perea, L. Tortosa, and A. Zamora. An over-lapped two-way method for solving tridiagonal linear systems in a bsp computer. Applied Mathematics and Computation, 161(2):475-500, 2005.
-
(2005)
Applied Mathematics and Computation
, vol.161
, Issue.2
, pp. 475-500
-
-
Climent, J.-J.1
Perea, C.2
Tortosa, L.3
Zamora, A.4
-
8
-
-
57349184047
-
Fast scan algorithms on graphics processors
-
ACM, June
-
Y. Dotsenko, N. K. Govindaraju, P.-P. J. Sloan, C. Boyd, and J. Manferdelli. Fast scan algorithms on graphics processors. In Proceedings of the 22nd Annual International Conference on Supercomputing, pages 205-213. ACM, June 2008.
-
(2008)
Proceedings of the 22nd Annual International Conference on Supercomputing
, pp. 205-213
-
-
Dotsenko, Y.1
Govindaraju, N.K.2
Sloan, P.-P.J.3
Boyd, C.4
Manferdelli, J.5
-
9
-
-
0042458763
-
An analysis of the recursive doubling algorithm
-
D. Kuck, D. Lawrie, and A. Sameh, editors, Academic Press, New York, NY
-
P. Dubois and G. Rodrigue. An analysis of the recursive doubling algorithm. In D. Kuck, D. Lawrie, and A. Sameh, editors, High Speed Computer and Algorithm Organization, pages 299-305. Academic Press, New York, NY, 1977.
-
(1977)
High Speed Computer and Algorithm Organization
, pp. 299-305
-
-
Dubois, P.1
Rodrigue, G.2
-
10
-
-
0040098094
-
A recursive doubling algorithm for solution of tridiagonal systems on hypercube multiprocessors
-
Ö. Eǧecioǧlu, C. K. Koc, and A. J. Laub. A recursive doubling algorithm for solution of tridiagonal systems on hypercube multiprocessors. Journal of Computational and Applied Mathematics, 27:95-108, 1989.
-
(1989)
Journal of Computational and Applied Mathematics
, vol.27
, pp. 95-108
-
-
Eǧecioǧlu, O.1
Koc, C.K.2
Laub, A.J.3
-
11
-
-
77749253328
-
-
D. Göddeke and R. Strzodka. Accurate mixed-precision GPU-multigrid solvers on anisotropic grids. Submitted to IEEE Transactions on Parallel and Distributed Systems, Special Issue: High Performance Computing with Accelerators.
-
D. Göddeke and R. Strzodka. Accurate mixed-precision GPU-multigrid solvers on anisotropic grids. Submitted to IEEE Transactions on Parallel and Distributed Systems, Special Issue: High Performance Computing with Accelerators.
-
-
-
-
13
-
-
2342522154
-
Evaluation of vertical coordinate and vertical mixing algorithms in the HYbrid-Coordinate Ocean Model (HYCOM)
-
G. R. Halliwell. Evaluation of vertical coordinate and vertical mixing algorithms in the HYbrid-Coordinate Ocean Model (HYCOM). Ocean Modelling, 7:285-322, 2004.
-
(2004)
Ocean Modelling
, vol.7
, pp. 285-322
-
-
Halliwell, G.R.1
-
15
-
-
0000490624
-
Optimizing tridiagonal solvers for alternating direction methods on boolean cube multiprocessors
-
C. T. Ho and S. L. Johnsson. Optimizing tridiagonal solvers for alternating direction methods on boolean cube multiprocessors. SIAM Journal of Scientific and Statistical Computing, 11(3):563-592, 1990.
-
(1990)
SIAM Journal of Scientific and Statistical Computing
, vol.11
, Issue.3
, pp. 563-592
-
-
Ho, C.T.1
Johnsson, S.L.2
-
16
-
-
84932220767
-
A fast direct solution of Poisson's equation using Fourier analysis
-
Jan
-
R. W. Hockney. A fast direct solution of Poisson's equation using Fourier analysis. Journal of the ACM, 12(1):95-113, Jan. 1965.
-
(1965)
Journal of the ACM
, vol.12
, Issue.1
, pp. 95-113
-
-
Hockney, R.W.1
-
19
-
-
70449768671
-
Interactive depth of field using simulated diffusion
-
Technical Report 06-01, Pixar Animation Studios, Jan
-
M. Kass, A. Lefohn, and J. D. Owens. Interactive depth of field using simulated diffusion. Technical Report 06-01, Pixar Animation Studios, Jan. 2006.
-
(2006)
-
-
Kass, M.1
Lefohn, A.2
Owens, J.D.3
-
21
-
-
84976719982
-
The solution of tridiagonal linear systems on the CDC STAR-100 computer
-
J. J. Lambiotte and R. G. Voigt. The solution of tridiagonal linear systems on the CDC STAR-100 computer. ACM Trans. Math. Software, 1(4):308-329, 1975.
-
(1975)
ACM Trans. Math. Software
, vol.1
, Issue.4
, pp. 308-329
-
-
Lambiotte, J.J.1
Voigt, R.G.2
-
22
-
-
0026170724
-
A method to parallelize tridiagonal solvers
-
S. M. Müller and D. Sheerer. A method to parallelize tridiagonal solvers. Parallel Computing, 17:181-188, 1991.
-
(1991)
Parallel Computing
, vol.17
, pp. 181-188
-
-
Müller, S.M.1
Sheerer, D.2
-
23
-
-
78651550268
-
Scalable parallel programming with CUDA
-
Mar
-
J. Nickolls, I. Buck, M. Garland, and K. Skadron. Scalable parallel programming with CUDA. ACM Queue: Tomorrow's Computing Today, 6(2):40-53, Mar. 2008.
-
(2008)
ACM Queue: Tomorrow's Computing Today
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
Buck, I.2
Garland, M.3
Skadron, K.4
-
24
-
-
0035216418
-
Parallel multigrid for anisotropic elliptic equations
-
M. Prieto, R. Santiago, D. Espadas, I. M. Llorente, and F. Tirado. Parallel multigrid for anisotropic elliptic equations. J. Parallel Distrib. Comput., 61(1):96-114, 2001.
-
(2001)
J. Parallel Distrib. Comput
, vol.61
, Issue.1
, pp. 96-114
-
-
Prieto, M.1
Santiago, R.2
Espadas, D.3
Llorente, I.M.4
Tirado, F.5
-
25
-
-
78651284120
-
Scan primitives for GPU computing
-
Aug
-
S. Sengupta, M. Harris, Y. Zhang, and J. D. Owens. Scan primitives for GPU computing. In Graphics Hardware 2007, pages 97-106, Aug. 2007.
-
(2007)
Graphics Hardware 2007
, pp. 97-106
-
-
Sengupta, S.1
Harris, M.2
Zhang, Y.3
Owens, J.D.4
-
27
-
-
84976729385
-
An efficient parallel algorithm for the solution of a tridiagonal linear system of equations
-
Jan
-
H. S. Stone. An efficient parallel algorithm for the solution of a tridiagonal linear system of equations. Journal of the ACM, 20(1):27-38, Jan. 1973.
-
(1973)
Journal of the ACM
, vol.20
, Issue.1
, pp. 27-38
-
-
Stone, H.S.1
-
28
-
-
0026825865
-
Efficient tridiagonal solvers on multicomputers
-
Mar
-
X.-H. Sun, H. Zhang, and L. M. Ni. Efficient tridiagonal solvers on multicomputers. IEEE Transactions on Computers, C-41(3):286-296, Mar. 1992.
-
(1992)
IEEE Transactions on Computers
, vol.C-41
, Issue.3
, pp. 286-296
-
-
Sun, X.-H.1
Zhang, H.2
Ni, L.M.3
-
29
-
-
1342282168
-
A parallel two-level hybrid method for tridiagonal systems and its application to fast Poisson solvers
-
Feb
-
X.-H. Sun and W. Zhang. A parallel two-level hybrid method for tridiagonal systems and its application to fast Poisson solvers. IEEE Transactions on Parallel and Distributed Systems, PDS-15(2):97-106, Feb. 2004.
-
(2004)
IEEE Transactions on Parallel and Distributed Systems
, vol.PDS-15
, Issue.2
, pp. 97-106
-
-
Sun, X.-H.1
Zhang, W.2
-
31
-
-
84856252478
-
-
Department of Computer Science, University of Tennessee, Knoxville, Jan
-
V. Volkov and J. W. Demmel. Using GPUs to accelerate the bisection algorithm for finding eigenvalues of symmetric tridiagonal matrices. LAPACKWorking Note 197, Department of Computer Science, University of Tennessee, Knoxville, Jan. 2008.
-
(2008)
Using GPUs to accelerate the bisection algorithm for finding eigenvalues of symmetric tridiagonal matrices. LAPACKWorking Note
, vol.197
-
-
Volkov, V.1
Demmel, J.W.2
-
32
-
-
0019575493
-
A parallel method for tridiagonal equations
-
H. H. Wang. A parallel method for tridiagonal equations. ACM Trans. Math. Software, 7:170-183, 1981.
-
(1981)
ACM Trans. Math. Software
, vol.7
, pp. 170-183
-
-
Wang, H.H.1
-
33
-
-
65949107549
-
Roofline: An insightful visual performance model for multicore architectures
-
S. Williams, A. Waterman, and D. Patterson. Roofline: an insightful visual performance model for multicore architectures. Commun. ACM, 52(4):65-76, 2009.
-
(2009)
Commun. ACM
, vol.52
, Issue.4
, pp. 65-76
-
-
Williams, S.1
Waterman, A.2
Patterson, D.3
|