메뉴 건너뛰기




Volumn 30, Issue 2, 2010, Pages 56-69

The GPU computing era

Author keywords

CUDA; Fermi GPU architecture; GPU computing; GPU coprocessing; Heterogeneous CPU+; NVIDIA.; Scalable parallel computing; Tesla GPU architecture

Indexed keywords

COPROCESSING; CUDA; FERMI GPU ARCHITECTURE; GPU COMPUTING; PARALLEL COMPUTING;

EID: 77951154340     PISSN: 02721732     EISSN: None     Source Type: Journal    
DOI: 10.1109/MM.2010.41     Document Type: Article
Times cited : (766)

References (23)
  • 1
    • 78651550268 scopus 로고    scopus 로고
    • Scalable parallel programming with CUDA
    • J. Nickolls et al.,"Scalable Parallel Programming with CUDA," ACM Queue, vol.6, no.2, 2008, pp. 40-53.
    • (2008) ACM Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1
  • 2
    • 77951194616 scopus 로고    scopus 로고
    • NVIDIA, NVIDIA CUDA Programming Guide, 2009; http://developer.download. nvidia. com/compute/cuda/2-3/toolkit/docs/ NVIDIA-CUDA-Programming-Guide-2.3. pdf.
    • (2009)
  • 3
    • 77951154206 scopus 로고    scopus 로고
    • Direct compute: Capturing the teraflop
    • C. Boyd,"DirectCompute: Capturing the Teraflop," Microsoft Personal Developers Conf., 2009; http://ecn.channel9.msdn. com/o9/pdc09/ppt/ CL03.pptx.
    • (2009) Microsoft Personal Developers Conf
    • Boyd, C.1
  • 4
    • 77951169432 scopus 로고    scopus 로고
    • Khronos, The OpenCL Specification, 2009; http://www.khronos.org/OpenCL.
    • (2009)
  • 5
    • 20344381571 scopus 로고    scopus 로고
    • The ge gorce 6800
    • J. Montrym and H. Moreton,"The GeForce 6800," IEEE Micro, vol.25, no.2, 2005, pp. 41-51.
    • (2005) IEEE Micro , vol.25 , Issue.2 , pp. 41-51
    • Montrym, J.1    Moreton, H.2
  • 6
    • 77953983400 scopus 로고    scopus 로고
    • Cg: A system for programming graphics hardware in a c-like language
    • ACM Press
    • W.R. Mark et al.,"Cg: A System for Programming Graphics Hardware in a C-like Language," Proc. Special Interest Group on Computer Graphics (Siggraph), ACM Press, 2003, pp. 896-907.
    • (2003) Proc. Special Interest Group on Computer Graphics (Siggraph) , pp. 896-907
    • Mark, W.R.1
  • 7
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla: A unified graphics and computing architecture
    • DOI 10.1109/MM.2008.31
    • E. Lindholm et al.,"NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro, vol.28, no.2, 2008, pp. 39-55. (Pubitemid 351796170)
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 9
    • 77951181287 scopus 로고    scopus 로고
    • NVIDIA,"Fermi: NVIDIA's Next Generation CUDA Compute Architecture," 2009; http:// www.nvidia.com/content/PDF/fermi-white- papers/NVIDIA-Fermi-Compute-Architecture- Whitepaper.pdf.
    • (2009)
  • 10
    • 53749092570 scopus 로고    scopus 로고
    • Parallel computing experiences with CUDA
    • M. Garland et al.,"Parallel Computing Experiences with CUDA," IEEE Micro, vol.28, no.4, 2008, pp. 13-27.
    • (2008) IEEE Micro , vol.28 , Issue.4 , pp. 13-27
    • Garl, M.1
  • 12
    • 51649102178 scopus 로고    scopus 로고
    • Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation
    • I.S. Ufimtsev and T.J. Martinez,"Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation," J. Chemical Theory and Computation, vol.4, no.2, 2008, pp. 222-231.
    • (2008) J. Chemical Theory and Computation , vol.4 , Issue.2 , pp. 222-231
    • Ufimtsev, I.S.1    Martinez, T.J.2
  • 13
    • 64649105762 scopus 로고    scopus 로고
    • Accelerating molecular dynamic simulation on graphics processing units
    • M.S. Friedrichs et al.,"Accelerating Molecular Dynamic Simulation on Graphics Processing Units," J. Computational Chemistry, vol.30, no.6, 2009, pp. 864-872.
    • (2009) J. Computational Chemistry , vol.30 , Issue.6 , pp. 864-872
    • Friedrichs, M.S.1
  • 14
    • 48349119234 scopus 로고    scopus 로고
    • TeraFLOP computing on a desktop PC with GPUs for 3D CFD
    • J. Tölke and M. Krafczyk,"TeraFLOP Computing on a Desktop PC with GPUs for 3D CFD," Int'l J. Computational Fluid Dynamics, vol.22, no.7, 2008, pp. 443-456.
    • (2008) Int'l J. Computational Fluid Dynamics , vol.22 , Issue.7 , pp. 443-456
    • Tölke, J.1    Krafczyk, M.2
  • 16
    • 77951188394 scopus 로고    scopus 로고
    • Solving lattice QCD systems of equations using mixed precision solvers on GPUs
    • M.A. Clark et al.,"Solving Lattice QCD Systems of Equations Using Mixed Precision Solvers on GPUs," Computer Physics Comm., 2009; http://arxiv.org/abs/0911. 3191v2.
    • (2009) Computer Physics Comm
    • Clark, M.A.1
  • 17
    • 70350618275 scopus 로고    scopus 로고
    • Ergebnisberichte des Instituts für Angewandte Mathematik
    • Dortmund Univ. of Technology
    • D. Göddeke and R. Strzodka,"Performance and Accuracy of Hardware-Oriented Native-, Emulated-, and Mixed-Precision Solvers in FEM Simulations (Part 2: Double Precision GPUs)," Ergebnisberichte des Instituts für Angewandte Mathematik [Reports on Findings of the Inst. for Applied Mathematics], Dortmund Univ. of Technology, no.370, 2008; http://www. mathematik.uni-dortmund.de/̃goeddeke/ pubs/GTX280-mixedprecision.pdf.
    • (2008) Reports on Findings of the Inst. for Applied Mathematics , Issue.370
    • Göddeke, D.1    Strzodka, R.2
  • 18
    • 35148867733 scopus 로고    scopus 로고
    • High performance direct gravitational n-body simulations on graphics processing units II: An implementation in CUDA
    • R.G. Belleman, J. Bedorf, and S.P. Zwart,"High Performance Direct Gravitational N-body Simulations on Graphics Processing Units II: An Implementation in CUDA," New Astronomy, vol.13, no.2, 2008, pp. 103-112.
    • (2008) New Astronomy , vol.13 , Issue.2 , pp. 103-112
    • Belleman, R.G.1    Bedorf, J.2    Zwart, S.P.3
  • 20
    • 85127260940 scopus 로고    scopus 로고
    • Efficient, high-quality image contour detection
    • IEEE CS Press
    • B. Catanzaro et al.,"Efficient, High-Quality Image Contour Detection," Proc. IEEE Int'l Conf. Computer Vision, IEEE CS Press, 2009; http://www.cs.berkeley.edu/̃catanzar/ Damascene/iccv2009.pdf.
    • (2009) Proc. IEEE Int'l Conf. Computer Vision
    • Catanzaro, B.1
  • 21
    • 70450121292 scopus 로고    scopus 로고
    • Data-parallel large vocabulary continuous speech recognition on graphics processors
    • Univ. of California at Berkeley
    • J. Chong et al.,"Data-Parallel Large Vocabulary Continuous Speech Recognition on Graphics Processors," tech. report UCB/EECS-2008-2069, Univ. of California at Berkeley, 2008; http://www.eecs.berkeley. edu/Pubs/TechRpts/ 2008/EECS-2008-69.pdf.
    • (2008) Tech. Report UCB/EECS-2008-2069
    • Chong, J.1
  • 22
    • 66749118424 scopus 로고    scopus 로고
    • Feasibility of GPU-assisted iterative image reconstruction for mobile C-Arm CT
    • SPIE
    • Y. Pan et al.,"Feasibility of GPU-Assisted Iterative Image Reconstruction for Mobile C-Arm CT," Proc. Int'l Soc. for Photonics and Optonics (SPIE), vol.7258, SPIE 2009; http://www.sci.utah.edu/̃ypan/Pan- SPIE2009.pdf.
    • (2009) Proc. Int'l Soc. for Photonics and Optonics (SPIE) , vol.7258
    • Pan, Y.1
  • 23
    • 77951179434 scopus 로고    scopus 로고
    • 2009: The GPU computing tipping point
    • J.H. Huang,"2009: The GPU Computing Tipping Point," Proc. IEEE Hot Chips 21, 2009; http://www.hotchips.org/archives/hc21.
    • (2009) Proc. IEEE Hot Chips 21
    • Huang, J.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.