-
1
-
-
78651550268
-
Scalable parallel programming with CUDA
-
J. Nickolls et al.,"Scalable Parallel Programming with CUDA," ACM Queue, vol.6, no.2, 2008, pp. 40-53.
-
(2008)
ACM Queue
, vol.6
, Issue.2
, pp. 40-53
-
-
Nickolls, J.1
-
2
-
-
77951194616
-
-
NVIDIA, NVIDIA CUDA Programming Guide, 2009; http://developer.download. nvidia. com/compute/cuda/2-3/toolkit/docs/ NVIDIA-CUDA-Programming-Guide-2.3. pdf.
-
(2009)
-
-
-
3
-
-
77951154206
-
Direct compute: Capturing the teraflop
-
C. Boyd,"DirectCompute: Capturing the Teraflop," Microsoft Personal Developers Conf., 2009; http://ecn.channel9.msdn. com/o9/pdc09/ppt/ CL03.pptx.
-
(2009)
Microsoft Personal Developers Conf
-
-
Boyd, C.1
-
4
-
-
77951169432
-
-
Khronos, The OpenCL Specification, 2009; http://www.khronos.org/OpenCL.
-
(2009)
-
-
-
5
-
-
20344381571
-
The ge gorce 6800
-
J. Montrym and H. Moreton,"The GeForce 6800," IEEE Micro, vol.25, no.2, 2005, pp. 41-51.
-
(2005)
IEEE Micro
, vol.25
, Issue.2
, pp. 41-51
-
-
Montrym, J.1
Moreton, H.2
-
6
-
-
77953983400
-
Cg: A system for programming graphics hardware in a c-like language
-
ACM Press
-
W.R. Mark et al.,"Cg: A System for Programming Graphics Hardware in a C-like Language," Proc. Special Interest Group on Computer Graphics (Siggraph), ACM Press, 2003, pp. 896-907.
-
(2003)
Proc. Special Interest Group on Computer Graphics (Siggraph)
, pp. 896-907
-
-
Mark, W.R.1
-
7
-
-
44849137198
-
NVIDIA Tesla: A unified graphics and computing architecture
-
DOI 10.1109/MM.2008.31
-
E. Lindholm et al.,"NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro, vol.28, no.2, 2008, pp. 39-55. (Pubitemid 351796170)
-
(2008)
IEEE Micro
, vol.28
, Issue.2
, pp. 39-55
-
-
Lindholm, E.1
Nickolls, J.2
Oberman, S.3
Montrym, J.4
-
8
-
-
77951148621
-
Graphics and computing GPUs
-
D.A. Patterson and J.L. Hennessy, 4th ed., Morgan Kaufmann
-
J. Nickolls and D. Kirk,"Graphics and Computing GPUs," Computer Organization and Design: The Hardware/Software Interface, D.A. Patterson and J.L. Hennessy, 4th ed., Morgan Kaufmann, 2009, pp. A2-A77.
-
(2009)
Computer Organization and Design: The Hardware/Software Interface
-
-
Nickolls, J.1
Kirk, D.2
-
9
-
-
77951181287
-
-
NVIDIA,"Fermi: NVIDIA's Next Generation CUDA Compute Architecture," 2009; http:// www.nvidia.com/content/PDF/fermi-white- papers/NVIDIA-Fermi-Compute-Architecture- Whitepaper.pdf.
-
(2009)
-
-
-
10
-
-
53749092570
-
Parallel computing experiences with CUDA
-
M. Garland et al.,"Parallel Computing Experiences with CUDA," IEEE Micro, vol.28, no.4, 2008, pp. 13-27.
-
(2008)
IEEE Micro
, vol.28
, Issue.4
, pp. 13-27
-
-
Garl, M.1
-
12
-
-
51649102178
-
Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation
-
I.S. Ufimtsev and T.J. Martinez,"Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation," J. Chemical Theory and Computation, vol.4, no.2, 2008, pp. 222-231.
-
(2008)
J. Chemical Theory and Computation
, vol.4
, Issue.2
, pp. 222-231
-
-
Ufimtsev, I.S.1
Martinez, T.J.2
-
13
-
-
64649105762
-
Accelerating molecular dynamic simulation on graphics processing units
-
M.S. Friedrichs et al.,"Accelerating Molecular Dynamic Simulation on Graphics Processing Units," J. Computational Chemistry, vol.30, no.6, 2009, pp. 864-872.
-
(2009)
J. Computational Chemistry
, vol.30
, Issue.6
, pp. 864-872
-
-
Friedrichs, M.S.1
-
14
-
-
48349119234
-
TeraFLOP computing on a desktop PC with GPUs for 3D CFD
-
J. Tölke and M. Krafczyk,"TeraFLOP Computing on a Desktop PC with GPUs for 3D CFD," Int'l J. Computational Fluid Dynamics, vol.22, no.7, 2008, pp. 443-456.
-
(2008)
Int'l J. Computational Fluid Dynamics
, vol.22
, Issue.7
, pp. 443-456
-
-
Tölke, J.1
Krafczyk, M.2
-
16
-
-
77951188394
-
Solving lattice QCD systems of equations using mixed precision solvers on GPUs
-
M.A. Clark et al.,"Solving Lattice QCD Systems of Equations Using Mixed Precision Solvers on GPUs," Computer Physics Comm., 2009; http://arxiv.org/abs/0911. 3191v2.
-
(2009)
Computer Physics Comm
-
-
Clark, M.A.1
-
17
-
-
70350618275
-
Ergebnisberichte des Instituts für Angewandte Mathematik
-
Dortmund Univ. of Technology
-
D. Göddeke and R. Strzodka,"Performance and Accuracy of Hardware-Oriented Native-, Emulated-, and Mixed-Precision Solvers in FEM Simulations (Part 2: Double Precision GPUs)," Ergebnisberichte des Instituts für Angewandte Mathematik [Reports on Findings of the Inst. for Applied Mathematics], Dortmund Univ. of Technology, no.370, 2008; http://www. mathematik.uni-dortmund.de/̃goeddeke/ pubs/GTX280-mixedprecision.pdf.
-
(2008)
Reports on Findings of the Inst. for Applied Mathematics
, Issue.370
-
-
Göddeke, D.1
Strzodka, R.2
-
18
-
-
35148867733
-
High performance direct gravitational n-body simulations on graphics processing units II: An implementation in CUDA
-
R.G. Belleman, J. Bedorf, and S.P. Zwart,"High Performance Direct Gravitational N-body Simulations on Graphics Processing Units II: An Implementation in CUDA," New Astronomy, vol.13, no.2, 2008, pp. 103-112.
-
(2008)
New Astronomy
, vol.13
, Issue.2
, pp. 103-112
-
-
Belleman, R.G.1
Bedorf, J.2
Zwart, S.P.3
-
19
-
-
71049182306
-
MSACUDA: Multiple sequence alignment on graphics processing units with CUDA
-
IEEE CS Press
-
Y. Liu, B. Schmidt, and D.L. Maskell,"MSACUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA," Proc. 20th IEEE Int'l Conf. Application- Specific Systems, Architectures and Processors, IEEE CS Press, 2009, pp. 121-128.
-
(2009)
Proc. 20th IEEE Int'l Conf. Application- Specific Systems, Architectures and Processors
, pp. 121-128
-
-
Liu, Y.1
Schmidt, B.2
Maskell, D.L.3
-
20
-
-
85127260940
-
Efficient, high-quality image contour detection
-
IEEE CS Press
-
B. Catanzaro et al.,"Efficient, High-Quality Image Contour Detection," Proc. IEEE Int'l Conf. Computer Vision, IEEE CS Press, 2009; http://www.cs.berkeley.edu/̃catanzar/ Damascene/iccv2009.pdf.
-
(2009)
Proc. IEEE Int'l Conf. Computer Vision
-
-
Catanzaro, B.1
-
21
-
-
70450121292
-
Data-parallel large vocabulary continuous speech recognition on graphics processors
-
Univ. of California at Berkeley
-
J. Chong et al.,"Data-Parallel Large Vocabulary Continuous Speech Recognition on Graphics Processors," tech. report UCB/EECS-2008-2069, Univ. of California at Berkeley, 2008; http://www.eecs.berkeley. edu/Pubs/TechRpts/ 2008/EECS-2008-69.pdf.
-
(2008)
Tech. Report UCB/EECS-2008-2069
-
-
Chong, J.1
-
22
-
-
66749118424
-
Feasibility of GPU-assisted iterative image reconstruction for mobile C-Arm CT
-
SPIE
-
Y. Pan et al.,"Feasibility of GPU-Assisted Iterative Image Reconstruction for Mobile C-Arm CT," Proc. Int'l Soc. for Photonics and Optonics (SPIE), vol.7258, SPIE 2009; http://www.sci.utah.edu/̃ypan/Pan- SPIE2009.pdf.
-
(2009)
Proc. Int'l Soc. for Photonics and Optonics (SPIE)
, vol.7258
-
-
Pan, Y.1
-
23
-
-
77951179434
-
2009: The GPU computing tipping point
-
J.H. Huang,"2009: The GPU Computing Tipping Point," Proc. IEEE Hot Chips 21, 2009; http://www.hotchips.org/archives/hc21.
-
(2009)
Proc. IEEE Hot Chips 21
-
-
Huang, J.H.1
|