-
2
-
-
36049035884
-
Parallel prefix sum (scan) with CUDA
-
Addison Wesley, New York, H. Nguyen (Ed.)
-
Harris M., Sengupta S., Owens J.D. Parallel prefix sum (scan) with CUDA. GPU Gems 3 (Chapter 39) 2007, 851-876. Addison Wesley, New York. H. Nguyen (Ed.).
-
(2007)
GPU Gems 3 (Chapter 39)
, pp. 851-876
-
-
Harris, M.1
Sengupta, S.2
Owens, J.D.3
-
4
-
-
57349184047
-
Fast scan algorithms on graphics processors
-
Dotsenko Y., Govindaraju N.K., Sloan P.-P., Boyd C., Manferdelli J. Fast scan algorithms on graphics processors. Proceedings of the 22nd Annual International Conference on Supercomputing 2008, 205-213. http://portal.acm.org/citation.cfm?id=1375527&picked=prox.
-
(2008)
Proceedings of the 22nd Annual International Conference on Supercomputing
, pp. 205-213
-
-
Dotsenko, Y.1
Govindaraju, N.K.2
Sloan, P.-P.3
Boyd, C.4
Manferdelli, J.5
-
5
-
-
77952833958
-
Efficient Parallel scan algorithms for GPUs
-
Technical Report NVR-2008-003, NVIDIA Corporation.
-
S. Sengupta, M. Harris, M. Garland, Efficient Parallel scan algorithms for GPUs, Technical Report NVR-2008-003, NVIDIA Corporation, 2008.
-
(2008)
-
-
Sengupta, S.1
Harris, M.2
Garland, M.3
-
7
-
-
78149268496
-
Parallel Scan for Stream Architectures
-
Technical Report CS2009-14, Department of Computer Science, University of Virginia.
-
D. Merrill, A. Grimshaw, Parallel Scan for Stream Architectures, Technical Report CS2009-14, Department of Computer Science, University of Virginia, 2009.
-
(2009)
-
-
Merrill, D.1
Grimshaw, A.2
-
8
-
-
84975184940
-
Efficient parallel scan algorithms for many-core GPUs
-
Taylor & Francis, Boca Raton, FL, J. Dongarra, D.A. Bader, J. Kurzak (Eds.)
-
Sengupta S., Harris M., Garland M., Owens J.D. Efficient parallel scan algorithms for many-core GPUs. Scientific Computing with Multicore and Accelerators, Chapman & Hall/CRC Computational Science (Chapter 19) 2011, 413-442. Taylor & Francis, Boca Raton, FL. J. Dongarra, D.A. Bader, J. Kurzak (Eds.).
-
(2011)
Scientific Computing with Multicore and Accelerators, Chapman & Hall/CRC Computational Science (Chapter 19)
, pp. 413-442
-
-
Sengupta, S.1
Harris, M.2
Garland, M.3
Owens, J.D.4
-
10
-
-
78149265385
-
Revisiting Sorting for GPGPU Stream Architectures
-
Technical Report CS2010-03, Department of Computer Science, University of Virginia.
-
D. Merrill, A. Grimshaw, Revisiting Sorting for GPGPU Stream Architectures, Technical Report CS2010-03, Department of Computer Science, University of Virginia, 2010.
-
(2010)
-
-
Merrill, D.1
Grimshaw, A.2
-
11
-
-
84882464120
-
-
NVIDIA Corporation, NVIDIA CUDA Programming Guide, version 4.0. , 2010 (accessed 27.07.11).
-
NVIDIA Corporation, NVIDIA CUDA Programming Guide, version 4.0. , 2010 (accessed 27.07.11). http://developer.download.nvidia.com/compute/cuda/4_0/toolkit/docs/CUDA_ C_Programming_Guide.pdf.
-
-
-
-
12
-
-
67650661447
-
Optimizing Parallel Reduction in CUDA
-
(accessed 27.07.11).
-
M. Harris, Optimizing Parallel Reduction in CUDA. , 2007 (accessed 27.07.11). http://developer.download.nvidia.com/compute/DevZone/C/html/C/src/reduct ion/doc/reduction.pdf.
-
(2007)
-
-
Harris, M.1
|