메뉴 건너뛰기




Volumn 30, Issue , 2011, Pages 82-89

Topical perspective on massive threading and parallelism

Author keywords

Computer architecture; CUDA; GPU; OpenCL; Parallel computing

Indexed keywords

APPLICATION PROGRAMS; COMPUTER HARDWARE; COMPUTER WORKSTATIONS; GRAPHICS PROCESSING UNIT; PARALLEL PROCESSING SYSTEMS; PROGRAM PROCESSORS; SOFTWARE DESIGN; SUPERCOMPUTERS;

EID: 84855334881     PISSN: 10933263     EISSN: 18734243     Source Type: Journal    
DOI: 10.1016/j.jmgm.2011.06.007     Document Type: Article
Times cited : (18)

References (61)
  • 1
    • 44949253519 scopus 로고    scopus 로고
    • A 30 year retrospective on Dennard's MOSFET scaling paper
    • IEEE
    • M. Bohr, A 30 year retrospective on Dennard's MOSFET scaling paper, IEEE SSCS 12 (2007).
    • (2007) SSCS , vol.12
    • Bohr, M.1
  • 2
    • 0016116644 scopus 로고    scopus 로고
    • Design of ion-implanted MOSFETs with very small physical dimensions
    • R.H. Dennard et al., Design of Ion-implanted MOSFETs with Very Small Physical Dimensions, IEEE J. Solid-State Circuits 9, 0018-9200.
    • IEEE J. Solid-State Circuits , vol.9 , pp. 0018-9200
    • Dennard, R.H.1
  • 3
    • 85030486166 scopus 로고    scopus 로고
    • NVIDIA, CUDA Zone, Online
    • NVIDIA, CUDA Zone, Online: http://www.nvidia.com/cuda.
  • 4
    • 85030494532 scopus 로고    scopus 로고
    • Group, Khronos, Online
    • Group, Khronos, Online: http://www.khronos.org/.
  • 5
    • 85030495446 scopus 로고    scopus 로고
    • Multiscalelab Online:
    • Multiscalelab, Online: http://www.multiscalelab.org/swan.
  • 6
    • 58449109179 scopus 로고    scopus 로고
    • MCUDA: An efficient implementation of CUDA Kernels for multi-core CPUs
    • A. Josà (Ed.), Springer, Berlin/Heidelberg, (doi:10.1007/978-3-540- 89740-8 2)
    • Stratton, John, Stone, Sam, Hwu, Wen-mei, MCUDA: an efficient implementation of CUDA Kernels for multi-core CPUs, in: A. Josà (Ed.), Languages and Compilers for Parallel Computing, vol. 5335, Springer, Berlin/Heidelberg, 2008, pp. 16-30 (doi:10.1007/978-3-540-89740-8 2).
    • (2008) Languages and Compilers for Parallel Computing , vol.5335 , pp. 16-30
    • John, S.1    Sam, S.2    Wen-mei, H.3
  • 7
    • 85030492009 scopus 로고    scopus 로고
    • Gpuocelot Online:
    • Gpuocelot, Online: http://www.code.google.com/p/gpuocelot/.
  • 10
    • 84882056790 scopus 로고    scopus 로고
    • Redefining what is possible
    • R.M. Farber, Redefining what is possible, Sci. Comput. (2011), http://www.scientificcomputing.com/articles-HPC-GPGPU-Redefining-What-is- Possible-010711.aspx.
    • (2011) Sci. Comput.
    • Farber, R.M.1
  • 11
    • 79953779756 scopus 로고    scopus 로고
    • Morgan Kaufmann, ISBN 978- 0-12-384988-5
    • Hwu, W. Wen-mei, GPU Computing Gems, Morgan Kaufmann, 2011, ISBN 978- 0-12-384988-5.
    • (2011) GPU Computing Gems
    • Wen-Mei, H.W.1
  • 12
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100X GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU
    • V.W. Lee, et al., Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU, SIGARCH Comput. Archit. News 38 (2010) 451-460.
    • SIGARCH Comput. Archit. News , vol.38 , Issue.2010 , pp. 451-460
    • Lee, V.W.1
  • 13
    • 58149151095 scopus 로고    scopus 로고
    • Accelerating density functional calculations with graphics processing unit
    • K. Yasuda, Accelerating density functional calculations with graphics processing unit, J. Chem. Theor. Comput. 4 (2008) 1230-1236.
    • (2008) J. Chem. Theor. Comput. , vol.4 , pp. 1230-1236
    • Yasuda, K.1
  • 14
    • 79953697389 scopus 로고    scopus 로고
    • Niumerical precision: How much is enough?
    • R. Farber, Niumerical precision: how much is enough? Sci. Comput. (2009) 14.
    • (2009) Sci. Comput. , vol.14
    • Farber, R.1
  • 16
    • 62449169653 scopus 로고    scopus 로고
    • Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration
    • doi:10.1117/1.3041496
    • Alerstam Erik, Svensson Tomas, Andersson-Engels Stefan, Parallel computing with graphics processing units for high-speed Monte Carlo simulation of photon migration, J. Biomed. Opt. 13 (2008), doi:10.1117/1.3041496.
    • (2008) J. Biomed. Opt. , vol.13
    • Erik, A.1    Tomas, S.2    Stefan, A.-E.3
  • 17
    • 70749119824 scopus 로고    scopus 로고
    • Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units
    • Q. Fang, D.A. Boas, Monte Carlo simulation of photon migration in 3D turbid media accelerated by graphics processing units, Opt. Express 17 (2009) 20178-20190.
    • (2009) Opt. Express , vol.17 , pp. 20178-20190
    • Fang, Q.1    Boas, D.A.2
  • 18
    • 84855338953 scopus 로고    scopus 로고
    • Fast calculation of DNMR spectra on CUDA-enabled graphics card
    • Z. Szalay, J. Rohonczy, Fast calculation of DNMR spectra on CUDA-enabled graphics card, J. Comput. Chem. (2010).
    • (2010) J. Comput. Chem.
    • Szalay, Z.1    Rohonczy, J.2
  • 20
    • 78649831026 scopus 로고    scopus 로고
    • Comparing hardware accelerators in scientific applications: A case study
    • Rick Weber, et al., Comparing hardware accelerators in scientific applications: a case study, IEEE Trans. Parallel Distrib. Syst. 22 (2011) 58-68.
    • (2011) IEEE Trans. Parallel Distrib. Syst. , vol.22 , pp. 58-68
    • Weber, R.1
  • 23
    • 77955516707 scopus 로고    scopus 로고
    • Understanding GPU programming for statistical computation: Studies in massively parallel massive mixtures
    • M.A. Suchard, et al., Understanding GPU programming for statistical computation: studies in massively parallel massive mixtures, J. Comput. Graph. Stat. 19 (2010) 419-438.
    • J. Comput. Graph. Stat. , vol.19 , Issue.2010 , pp. 419-438
    • Suchard, M.A.1
  • 24
    • 78751637391 scopus 로고    scopus 로고
    • Graphics processing units and highdimensional optimization
    • H. Zhou, L. Kenneth, A. Suchard, Graphics processing units and highdimensional optimization, Stat. Sci. 25 (2010) 311-324.
    • (2010) Stat. Sci. , vol.25 , pp. 311-324
    • Zhou, H.1    Kenneth, L.2    Suchard, A.3
  • 26
    • 85030488242 scopus 로고    scopus 로고
    • Online
    • NVIDIA, Computational Chemistry, 2011 Online: http://www.nvidia.com/ object/computationalchemistry.html.
    • (2011) NVIDIA Computational Chemistry
  • 27
    • 51649102178 scopus 로고    scopus 로고
    • Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation
    • I.S. Ufimtsev, T.J. Martinez, Quantum chemistry on graphical processing units. 1. Strategies for two-electron integral evaluation, J. Chem. Theory Comput. 4 (2008) 222-231.
    • (2008) J. Chem. Theory Comput. , vol.4 , pp. 222-231
    • Ufimtsev, I.S.1    Martinez, T.J.2
  • 28
    • 65249137652 scopus 로고    scopus 로고
    • Quantumchemistry on graphical processing units. 2. Direct self-consistent-field implementation
    • I.S. Ufimtsev, T.J. Martinez,Quantumchemistry on graphical processing units. 2. Direct self-consistent-field implementation, J. Chem. Theory Comput. 5 (2009) 1004-1015.
    • (2009) J. Chem. Theory Comput. , vol.5 , pp. 1004-1015
    • Ufimtsev, I.S.1    Martinez, T.J.2
  • 29
    • 73949083571 scopus 로고    scopus 로고
    • Quantumchemistry on graphical processing units.3. Analytical energy gradients, geometry optimization, and first principles molecular dynamics
    • I.S. Ufimtsev, T.J. Martinez,Quantumchemistry on graphical processing units. 3. Analytical energy gradients, geometry optimization, and first principles molecular dynamics, J. Chem. Theory Comput. 5 (2009) 2619-2628.
    • (2009) J. Chem. Theory Comput. , vol.5 , pp. 2619-2628
    • Ufimtsev, I.S.1    Martinez, T.J.2
  • 30
    • 85030498575 scopus 로고    scopus 로고
    • BigDFT Institut Nanosciences et Cryogénie. Online
    • BigDFT, Institut Nanosciences et Cryogénie. Online: http://www.inac.cea.fr/LSim/BigDFT/.
  • 35
    • 77950551363 scopus 로고    scopus 로고
    • Efficient nonbonded interactions for molecular dynamics on a graphics processing unit
    • P. Eastman, V.S. Pande, Efficient nonbonded interactions for molecular dynamics on a graphics processing unit, J. Comput. Chem. 31 (2010) 1268-1272.
    • (2010) J. Comput. Chem. , vol.31 , pp. 1268-1272
    • Eastman, P.1    Pande, V.S.2
  • 37
    • 0042415783 scopus 로고    scopus 로고
    • NAMD2: Greater scalability for parallel molecular dynamics
    • K. Laxmikant, et al.,NAMD2:greater scalability for parallel molecular dynamics, J. Comput. Phys. 151 (1999) 283-312.
    • (1999) J. Comput. Phys. , vol.151 , pp. 283-312
    • Laxmikant, K.1
  • 38
    • 85030494108 scopus 로고    scopus 로고
    • NAMD, Theoretical and Computational Biophysics Grpup, University of Illinois at Urbana, Champaign
    • NAMD, Theoretical and Computational Biophysics Grpup, University of Illinois at Urbana, Champaign, http://www.ks.uiuc.edu/Research/namd/.
  • 39
    • 77956286977 scopus 로고    scopus 로고
    • GPU-accelerated molecular modeling coming of age
    • J.E. Stone, et al., GPU-accelerated molecular modeling coming of age, J. Mol. Graph. Modell. 29 (2010) 116-125.
    • (2010) J. Mol. Graph. Modell. , vol.29 , pp. 116-125
    • Stone, J.E.1
  • 40
    • 41249087856 scopus 로고    scopus 로고
    • General purpose molecular dynamics simulations fully implemented on graphics processing units
    • doi:10.1016/j.jcp.2008.01.047 (10, San Diego, CA)
    • J.A. Anderson, C.D. Lorenz, A. Travesset, General purpose molecular dynamics simulations fully implemented on graphics processing units, J. Comput. Phys. 227 (2008) 5342-5359, doi:10.1016/j.jcp.2008.01.047 (10, San Diego, CA).
    • (2008) J. Comput. Phys. , vol.227 , pp. 5342-5359
    • Anderson, J.A.1    Lorenz, C.D.2    Travesset, A.3
  • 41
    • 85030489126 scopus 로고    scopus 로고
    • NCSA, NCSA deploys GPU-enabled TeraChem software on Lincoln cluster, 2010 Online,(accessed 22.04.11) (http://w ww.illinois.ed u/lb/artic le/2101/464 46/page=1/l ist=list)
    • NCSA, NCSA deploys GPU-enabled TeraChem software on Lincoln cluster, 2010 Online: http://www.illinois.edu (accessed 22.04.11) (http://www.illinois.edu/ lb/article/2101/46446/page=1/list=list).
  • 42
    • 54849440125 scopus 로고    scopus 로고
    • Graphical processing units for quantum chemistry
    • doi:10.1109/MCSE. 2008.148
    • I.S. Ufimtsev, T.J. Martinez, Graphical processing units for quantum chemistry, Comput. Sci. Eng. 10 (2008), doi:10.1109/MCSE. 2008.148.
    • (2008) Comput. Sci. Eng. , vol.10
    • Ufimtsev, I.S.1    Martinez, T.J.2
  • 44
    • 77951978448 scopus 로고    scopus 로고
    • Accelerating electrostatic surface potential calculation with multi-scale approximationongraphics processing units
    • R. Anandakrishnan, et al., Accelerating electrostatic surface potential calculation with multi-scale approximationongraphics processing units, J. Mol. Graph. Modell. 28 (2010) 904-910.
    • (2010) J. Mol. Graph. Modell. , vol.28 , pp. 904-910
    • Anandakrishnan, R.1
  • 45
    • 77953129081 scopus 로고    scopus 로고
    • Real-time optical micro-manipulation using optimized holograms generated on the GPU
    • S. Bianchi, R. Di Leonardo, Real-time optical micro-manipulation using optimized holograms generated on the GPU, Comput. Phys. Commun. 181 (2010) 1444-1448.
    • (2010) Comput. Phys. Commun. , vol.181 , pp. 1444-1448
    • Bianchi, S.1    Di Leonardo, R.2
  • 46
    • 78650777505 scopus 로고    scopus 로고
    • Immersive molecular visualization and interactive modeling with commodity hardware
    • B. George, et al. (Eds.), Springer, Berlin/Heidelberg, (doi: 10.1007/978- 3-642-17274-838)
    • J. Stone, et al., Immersive molecular visualization and interactive modeling with commodity hardware, in: B. George, et al. (Eds.), Advances in Visual Computing, vol. 6454, Springer, Berlin/Heidelberg, 2010, pp. 382-393 (doi: 10.1007/978- 3-642-17274-8 38).
    • (2010) Advances in Visual Computing , vol.6454 , pp. 382-393
    • Stone, J.1
  • 47
    • 77649218601 scopus 로고    scopus 로고
    • Toward discovery science of human brain function
    • U S A
    • B.B. Biswal, et al., Toward discovery science of human brain function, Proc. Natl. Acad. Sci. U S A 107 (2010) 4734-4739.
    • (2010) Proc. Natl. Acad. Sci. , vol.107 , pp. 4734-4739
    • Biswal, B.B.1
  • 48
    • 77951870906 scopus 로고    scopus 로고
    • Ssecrett and neurotrace: Interactive visualization and analysis tools for large-scale neuroscience data sets
    • W.-K. Jeong, et al., Ssecrett and neurotrace: interactive visualization and analysis tools for large-scale neuroscience data sets, IEEE M CGA 30 (2010) 58-70.
    • (2010) IEEE M CGA , vol.30 , pp. 58-70
    • Jeong, W.-K.1
  • 49
    • 66249148685 scopus 로고    scopus 로고
    • Semi-automated reconstruction of neural processes from large numbers of fluorescence images
    • Ju Lu, Semi-automated reconstruction of neural processes from large numbers of fluorescence images, PLoS One 4 (2009) e5655.
    • (2009) PLoS One , vol.4
    • Ju, Lu.1
  • 51
    • 77953889259 scopus 로고    scopus 로고
    • GPU computing for systems biology
    • L. Dematte, D. Prandi, GPU computing for systems biology, Brief. Bioinform. 11 (2010) 323-333.
    • (2010) Brief. Bioinform. , vol.11 , pp. 323-333
    • Dematte, L.1    Prandi, D.2
  • 52
    • 77955283080 scopus 로고    scopus 로고
    • Integrative multicellular biological modeling: A case study of 3D epidermal development using GPU algorithms
    • S. Christley, et al., Integrative multicellular biological modeling: a case study of 3D epidermal development using GPU algorithms, BMC Syst. Biol. 4 (2010) 107.
    • (2010) BMC Syst. Biol. , vol.4 , pp. 107
    • Christley, S.1
  • 53
    • 77952004390 scopus 로고    scopus 로고
    • SIML: A fast SIMD algorithm for calculating LINGO chemical similarities on GPUs and CPUs
    • I.S. Haque, V.S. Pande, W.P. Walters, SIML: a fast SIMD algorithm for calculating LINGO chemical similarities on GPUs and CPUs, J. Chem. Inf. Model. 50 (2010) 560-564.
    • (2010) J. Chem. Inf. Model. , vol.50 , pp. 560-564
    • Haque, I.S.1    Pande, V.S.2    Walters, W.P.3
  • 54
    • 77956481825 scopus 로고    scopus 로고
    • Fast and accurate protein substructure searching with simulated annealing and GPUs
    • A. Stivala, P. Stuckey, A. Wirth, Fast and accurate protein substructure searching with simulated annealing and GPUs, BMC Bioinformatics 11 (2010) 446.
    • (2010) BMC Bioinformatics , vol.11 , pp. 446
    • Stivala, A.1    Stuckey, P.2    Wirth, A.3
  • 55
    • 85030487573 scopus 로고    scopus 로고
    • MAGMA. Innovative Computing Laboratory, The University of Tennessee. Online
    • MAGMA. Innovative Computing Laboratory, The University of Tennessee. Online: http://icl.cs.utk.edu/magma/.
  • 60
    • 85030497250 scopus 로고    scopus 로고
    • Practical lock-free data structures, Computer Laboratory. University of Cambridge. Online, (accessed 10.01.11)
    • Practical lock-free data structures, Computer Laboratory. University of Cambridge. Online: http://www.cl.cam.ac.uk/research/srg/netos/lock-free/, 2011 (accessed 10.01.11).
    • (2011)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.