메뉴 건너뛰기




Volumn 28, Issue 4, 2008, Pages 13-27

Parallel computing experiences with CUDA

Author keywords

Computational modeling; Computer architecture; Kernel; Object oriented modeling; Parallel processing; Program processors; Programming

Indexed keywords

PARALLEL PROCESSING SYSTEMS;

EID: 53749092570     PISSN: 02721732     EISSN: None     Source Type: Journal    
DOI: 10.1109/MM.2008.57     Document Type: Article
Times cited : (428)

References (22)
  • 1
    • 78651550268 scopus 로고    scopus 로고
    • Scalable Parallel Programming with CUDA
    • Mar./Apr
    • J. Nickolls et al., "Scalable Parallel Programming with CUDA," ACM Queue, vol. 6, no. 2, Mar./Apr. 2008, pp. 40-53.
    • (2008) ACM Queue , vol.6 , Issue.2 , pp. 40-53
    • Nickolls, J.1
  • 2
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla:A Unified Graphics and Computing Architecture
    • Mar./Apr
    • E. Lindholm et al., "NVIDIA Tesla:A Unified Graphics and Computing Architecture," IEEE Micro, vol. 28, no. 2, Mar./Apr. 2008, pp. 39-55.
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, E.1
  • 3
    • 53749089739 scopus 로고    scopus 로고
    • Fast Support Vector Machine Training and Classification on Graphics Processors
    • Omnipress
    • B. Catanzaro, N. Sundaram, and K. Keutzer, "Fast Support Vector Machine Training and Classification on Graphics Processors," Proc. 25th Ann. Int'l Conf. Machine Learning, Omnipress, 2008, pp. 104-111.
    • (2008) Proc. 25th Ann. Int'l Conf. Machine Learning , pp. 104-111
    • Catanzaro, B.1    Sundaram, N.2    Keutzer, K.3
  • 4
    • 54749089017 scopus 로고    scopus 로고
    • Relational Joins on Graphics Processors
    • ACM Press
    • B. He et al., "Relational Joins on Graphics Processors," Proc. ACM SIGMOD 2008, ACM Press, 2008; www.cse.ust.hk/catalac/papers/gpujoin_sigmod08.pdf.
    • (2008) Proc. ACM SIGMOD 2008
    • He, B.1
  • 5
    • 38849131252 scopus 로고    scopus 로고
    • High-Throughput Sequence Alignment Using Graphics Processing Units
    • M. Schatz et al., "High-Throughput Sequence Alignment Using Graphics Processing Units," BMC Bioinformatics, vol. 8, no. 1, 2007, p. 474; http://dx.doi.org/l0.1186/1471-2105-8-474.
    • (2007) BMC Bioinformatics , vol.8 , Issue.1 , pp. 474
    • Schatz, M.1
  • 6
    • 43349092363 scopus 로고    scopus 로고
    • CUDA Compatible GPU Cards as Efficient Hardware Accelerators for Smith-Waterman Sequence Alignment
    • S. Manavski and G. Valle, "CUDA Compatible GPU Cards as Efficient Hardware Accelerators for Smith-Waterman Sequence Alignment," BMC Bioinformatics, vol. 9, suppl. 2, 2008, p. S10; http://dx.doi.org/ 10.1186/1471-2105-9-S2-S10.
    • (2008) BMC Bioinformatics , vol.9 , Issue.SUPPL. 2
    • Manavski, S.1    Valle, G.2
  • 8
    • 35548967122 scopus 로고    scopus 로고
    • Molecular Simulations, Academic Press
    • D. Frenkel and B. Smit, Understanding Molecular Simulations, Academic Press, 2002.
    • (2002) Understanding
    • Frenkel, D.1    Smit, B.2
  • 9
    • 43949100299 scopus 로고    scopus 로고
    • Micellar Crystals in Solution from Molecular Dynamics Simulations
    • J.A. Anderson, C.D. Lorenz, and A. Travesset, "Micellar Crystals in Solution from Molecular Dynamics Simulations," J. Chemical Physics vol. 128, 2008, pp. 184906-184916.
    • (2008) J. Chemical Physics , vol.128 , pp. 184906-184916
    • Anderson, J.A.1    Lorenz, C.D.2    Travesset, A.3
  • 10
    • 41249087856 scopus 로고    scopus 로고
    • General Purpose Molecular Dynamics Simulations Fully Implemented on Graphics Processing Units
    • May
    • J.A. Anderson, C.D. Lorenz, and A. Travesset, "General Purpose Molecular Dynamics Simulations Fully Implemented on Graphics Processing Units," J. Computational Physics, vol. 227, no. 10, May 2008, pp. 5342-5359.
    • (2008) J. Computational Physics , vol.227 , Issue.10 , pp. 5342-5359
    • Anderson, J.A.1    Lorenz, C.D.2    Travesset, A.3
  • 11
    • 35948963714 scopus 로고    scopus 로고
    • Accelerating Molecular Modeling Applications with Graphics Processors
    • J.E. Stone et al., "Accelerating Molecular Modeling Applications with Graphics Processors," J. Computational Chemistry, vol. 28, no. 16, 2007, pp. 2618-2640.
    • (2007) J. Computational Chemistry , vol.28 , Issue.16 , pp. 2618-2640
    • Stone, J.E.1
  • 12
    • 53749106683 scopus 로고    scopus 로고
    • GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications
    • ACM Press
    • C.I. Rodrigues et al., "GPU Acceleration of Cutoff Pair Potentials for Molecular Modeling Applications," Proc. 2008 Conf. Computing Frontiers (CF 08), ACM Press, 2008, pp. 273-282.
    • (2008) Proc. 2008 Conf. Computing Frontiers (CF 08) , pp. 273-282
    • Rodrigues, C.I.1
  • 13
    • 0002467378 scopus 로고
    • Fast Parallel Algorithms for Short-Range Molecular Dynamics
    • S. Plimpton, "Fast Parallel Algorithms for Short-Range Molecular Dynamics," J. Computational Physics, vol. 117, no. 1, 1995, pp. 1-19.
    • (1995) J. Computational Physics , vol.117 , Issue.1 , pp. 1-19
    • Plimpton, S.1
  • 14
    • 0034623787 scopus 로고    scopus 로고
    • Screen Savers of The World Unite
    • M. Shirts and V.S. Pande, "Screen Savers of The World Unite," Science, vol. 290, no. 5498, 2000, pp. 1903-1904.
    • (2000) Science , vol.290 , Issue.5498 , pp. 1903-1904
    • Shirts, M.1    Pande, V.S.2
  • 15
    • 53749087169 scopus 로고    scopus 로고
    • V. Volkov and J.W. Demmel, LU, QR and Cholesky Factorizations Using Vector Capabilities of GPUs, tech. report UCB/EECS-2008-49, EECS Dept., Univ. of Calif., Berkeley, 2008.
    • V. Volkov and J.W. Demmel, "LU, QR and Cholesky Factorizations Using Vector Capabilities of GPUs," tech. report UCB/EECS-2008-49, EECS Dept., Univ. of Calif., Berkeley, 2008.
  • 16
    • 79959466764 scopus 로고    scopus 로고
    • Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA
    • ACM Press
    • S. Ryoo et al., "Optimization Principles and Application Performance Evaluation of a Multithreaded GPU using CUDA," Proc. 13th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming, ACM Press, 2008, pp. 73-82.
    • (2008) Proc. 13th ACM SIGPLAN Symp. Principles and Practice of Parallel Programming , pp. 73-82
    • Ryoo, S.1
  • 17
    • 53749100057 scopus 로고    scopus 로고
    • Apparatus and Method for Imaging Objects with Wavefields
    • US patent 6,636,584, Patent and Trademark Office, 2003
    • S.A. Johnson et al., Apparatus and Method for Imaging Objects with Wavefields, US patent 6,636,584, Patent and Trademark Office, 2003.
    • Johnson, S.A.1
  • 18
    • 53749097239 scopus 로고
    • A Multiple Grid Scheme for Solving the Euler Equations
    • AIAA Press
    • R.H. Ni, "A Multiple Grid Scheme for Solving the Euler Equations," Proc. AIAA 5th Computational Fluid Dynamics Conf., AIAA Press, 1981, pp. 257-264.
    • (1981) Proc. AIAA 5th Computational Fluid Dynamics Conf , pp. 257-264
    • Ni, R.H.1
  • 19
    • 53749092877 scopus 로고    scopus 로고
    • A Multi-Grid Solver for the 2D Compressible Euler Equations on a GPU Cluster,
    • ECE-CE-2008-2, Computer Eng. Research Lab, Univ. of California, Davis
    • E.H. Phillips et al., "A Multi-Grid Solver for the 2D Compressible Euler Equations on a GPU Cluster," tech. report ECE-CE-2008-2, Computer Eng. Research Lab., Univ. of California, Davis, 2008; www.ece.ucdavis.edu/cerl/techreports/2008-2.
    • (2008) tech. report
    • Phillips, E.H.1
  • 21
    • 84976827056 scopus 로고
    • Parallel Tridiagonal Equation Solvers
    • Dec
    • H.S. Stone, "Parallel Tridiagonal Equation Solvers," ACM Trans. Mathematical Software, vol. 1, no. 4, Dec. 1975, pp. 289-307, http://doi.acm.org/l0.1145/355656.355657.
    • (1975) ACM Trans. Mathematical Software , vol.1 , Issue.4 , pp. 289-307
    • Stone, H.S.1
  • 22
    • 53749100179 scopus 로고    scopus 로고
    • Interactive Depth of Field Using Simulated Diffusion on a GPU,
    • 06-01, Pixar Animation Studios
    • M. Kass, A. Lefohn, and J. Owens, "Interactive Depth of Field Using Simulated Diffusion on a GPU," tech. report 06-01, Pixar Animation Studios, 2006; http://graphics.pixar.com/DepthOfField/.
    • (2006) tech. report
    • Kass, M.1    Lefohn, A.2    Owens, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.