메뉴 건너뛰기




Volumn 55, Issue 2, 2012, Pages 138-153

On the acceleration of wavefront applications using distributed many-core architectures

Author keywords

CUDA; GPU; many core computing; optimization; performance modelling; wavefront

Indexed keywords

APPLICATION PERFORMANCE; CUDA; DISTRIBUTED GRAPHICS; FUTURE PERFORMANCE; GPU; HIGH-PERFORMANCE COMPUTING; MANY-CORE ARCHITECTURE; MANY-CORE COMPUTING; NAS PARALLEL BENCHMARKS; PERFORMANCE MODEL; PERFORMANCE MODELLING; SCIENTIFIC AND ENGINEERING APPLICATIONS; THEORETICAL PERFORMANCE;

EID: 84856898868     PISSN: 00104620     EISSN: 14602067     Source Type: Journal    
DOI: 10.1093/comjnl/bxr073     Document Type: Article
Times cited : (18)

References (27)
  • 1
    • 0003605996 scopus 로고
    • RNR-94-007. NASA Ames Research Center, Moffet Field, CA
    • RNR-94-007. (1994) The NAS Parallel Benchmarks. NASA Ames Research Center, Moffet Field, CA.
    • (1994) The NAS Parallel Benchmarks
  • 2
    • 51049124075 scopus 로고    scopus 로고
    • A plugand-Play model for evaluating wavefront computations on parallel architectures
    • Miami, FL, April. IEEE Computer Society, Los Alamitos, CA
    • Mudalige, G.R., Vernon, M.K. and Jarvis, S.A. (2008) A Plugand-Play Model for Evaluating Wavefront Computations on Parallel Architectures. Proc. IEEE Int. Parallel and Distributed Processing Symp., Miami, FL, April 14-18. IEEE Computer Society, Los Alamitos, CA.
    • (2008) Proc. IEEE Int. Parallel and Distributed Processing Symp. , pp. 14-18
    • Mudalige, G.R.1    Vernon, M.K.2    Jarvis, S.A.3
  • 3
    • 84922896495 scopus 로고    scopus 로고
    • WARPP: A toolkit for simulating high-performance parallel scientific codes
    • Rome, Italy, March 2-6,. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, Brussels, Belgium
    • Hammond, S.D., Mudalige, G.R., Smith, J.A., Jarvis, S.A., Herdman, J.A. and Vadgama, A. (2009) WARPP: A Toolkit for Simulating High-Performance Parallel Scientific Codes. Proc. Int. Conf. Simulation Tools and Techniques, Rome, Italy, March 2-6, pp. 19:1-19:10. Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, Brussels, Belgium.
    • (2009) Proc. Int. Conf. Simulation Tools and Techniques , pp. 191-1910
    • Hammond, S.D.1    Mudalige, G.R.2    Smith, J.A.3    Jarvis, S.A.4    Herdman, J.A.5    Vadgama, A.6
  • 4
    • 84949489562 scopus 로고    scopus 로고
    • A general predictive performance model for wavefront algorithms on clusters of SMPs
    • Toronto, Canada, August 21-24, IEEE Computer Society, Los Alamitos, CA
    • Hoisie, A., Lubeck, O.,Wasserman, H., Petrini, F. and Alme, H. (2000) A General Predictive Performance Model for Wavefront Algorithms on Clusters of SMPs. Proc. Int. Conf. Parallel Processing, Toronto, Canada, August 21-24, pp. 219-228. IEEE Computer Society, Los Alamitos, CA.
    • (2000) Proc. Int. Conf. Parallel Processing , pp. 219-228
    • Hoisie, A.1    Lubeck, O.2    Wasserman, H.3    Petrini, F.4    Alme, H.5
  • 6
    • 67549093800 scopus 로고    scopus 로고
    • Design and implementation of the smith-waterman algorithm on the CUDA-compatible gPU
    • Athens, Greece, October 8-10, IEEE Computer Society, Los Alamitos, CA
    • Munekawa, Y., Ino, F. and Hagihara, K. (2008) Design and Implementation of the Smith-Waterman Algorithm on the CUDA-Compatible GPU. Proc. IEEE Int. Conf. Bioinformatics and Bioengineering, Athens, Greece, October 8-10, pp. 1-6. IEEE Computer Society, Los Alamitos, CA.
    • (2008) Proc. IEEE Int. Conf. Bioinformatics and Bioengineering , pp. 1-6
    • Munekawa, Y.1    Ino, F.2    Hagihara, K.3
  • 7
    • 43349092363 scopus 로고    scopus 로고
    • CUDA Compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment
    • Manavski, S. andValle, G. (2008) CUDA Compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment. BMC Bioinf., 9, S10.
    • (2008) BMC Bioinf. , vol.9
    • Manavski, S.1    Valle, G.2
  • 10
    • 84856917433 scopus 로고
    • Los Alamos National Laboratory. (accessed May 12, 2011)
    • (1995) The ASCI Sweep 3D Benchmark. Los Alamos National Laboratory. http://www.c3.lanl.gov/pal/software/sweep3d/sweep3d-readme.html (accessed May 12, 2011).
    • (1995) The ASCI Sweep 3D Benchmark
  • 11
    • 70450059008 scopus 로고    scopus 로고
    • Accelerating leukocyte tracking using CUDA:A case study in leveragingmanycore coprocessors
    • Rome, Italy, May. IEEE Computer Society, Los Alamitos, CA
    • Boyer, M., Tarjan, D., Acton, S.T. and Skadron, K. (2009) Accelerating Leukocyte Tracking using CUDA:A Case Study in LeveragingManycore Coprocessors. Proc. IEEE Int.Parallel and Distributed Processing Symp., Rome, Italy, May 23-29. IEEE Computer Society, Los Alamitos, CA.
    • (2009) Proc. IEEE Int.Parallel and Distributed Processing Symp. , pp. 23-29
    • Boyer, M.1    Tarjan, D.2    Acton, S.T.3    Skadron, K.4
  • 12
    • 70350754502 scopus 로고    scopus 로고
    • High performance discrete fourier transforms on graphics processors
    • Austin, TX, November 15-21, IEEE Press Piscataway, NJ
    • Govindaraju, N.K., Lloyd, B., Dotsenko, Y., Smith, B. and Manferdelli, J. (2008) High Performance Discrete Fourier Transforms on Graphics Processors. Proc. ACM/IEEE Conf. Supercomputing, Austin, TX, November 15-21, pp. 2:1-2:12. IEEE Press Piscataway, NJ.
    • (2008) Proc. ACM/IEEE Conf. Supercomputing , pp. 21-212
    • Govindaraju, N.K.1    Lloyd, B.2    Dotsenko, Y.3    Smith, B.4    Manferdelli, J.5
  • 14
    • 78649859889 scopus 로고    scopus 로고
    • An MPICUDA implementation for massively parallel incompressible flow computations on multi-GPU clusters
    • Orlando, FL, January. American Institute of Aeronautics and Astronautics, Reston,VA
    • Jacobsen, D.A., Thibault, J.C. and Senocak, I. (2010) An MPICUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters. Proc. 48th AIAA Aerospace Sciences Meeting, Orlando, FL, January 4-7. American Institute of Aeronautics and Astronautics, Reston,VA.
    • (2010) Proc. 48th AIAA Aerospace Sciences Meeting , pp. 4-7
    • Jacobsen, D.A.1    Thibault, J.C.2    Senocak, I.3
  • 15
    • 77954995885 scopus 로고    scopus 로고
    • Debunking the 100X GPU vs. CPU Myth: An evaluation of throughput computing on CPU and GPU
    • Saint-Malo, France, June 21-23,. ACM NewYork, NY
    • Lee,V.W. et al. (2010) Debunking the 100X GPU vs. CPU Myth: An Evaluation of Throughput Computing on CPU and GPU. Proc. ACM/IEEE Int. Symp. Computer Architecture, Saint-Malo, France, June 21-23, pp. 451-460. ACM NewYork, NY.
    • (2010) Proc. ACM/IEEE Int. Symp. Computer Architecture , pp. 451-460
    • Lee, V.W.1
  • 20
    • 84856919010 scopus 로고    scopus 로고
    • HPC Wire. (accessed November 4, 2010)
    • Lazou, C. (2010) Should I Buy GPGPUs or Blue Gene- HPC Wire. http://www.hpcwire.com/hpcwire/2010-11-04/should-i-buy-gpgpus-or-blue-gene.html (accessed November 4, 2010).
    • (2010) Should I buy GPGPUs or Blue Gene
    • Lazou, C.1
  • 23
    • 0016026944 scopus 로고
    • The parallel execution of DOloops
    • Lamport, L. (1974) The parallel execution of DOloops. Commun. ACM, 17, 83-93.
    • (1974) Commun. ACM , vol.17 , pp. 83-93
    • Lamport, L.1
  • 26
    • 84856919926 scopus 로고    scopus 로고
    • Lawrence Livermore National Laboratory. (accessed May 12, 2011
    • (2010) Livermore Computing Systems Summary. Lawrence Livermore National Laboratory. https://computing.llnl.gov/resources/systems-summary.pdf (accessed May 12, 2011).
    • (2010) Livermore Computing Systems Summary
  • 27
    • 23244465694 scopus 로고    scopus 로고
    • A performance comparison between the Earth Simulator and other terascale systems on a characteristic ASCI workload
    • DOI 10.1002/cpe.891
    • Kerbyson, D.J., Hoisie, A. and Wasserman, H. (2005) A performance comparison between the earth simulator and other terascale systems on a characteristic ASCI workload. Concurrency Comput.: Pract. Exp., 17, 1219-1238. (Pubitemid 41092969)
    • (2005) Concurrency Computation Practice and Experience , vol.17 , Issue.10 , pp. 1219-1238
    • Kerbyson, D.J.1    Hoisie, A.2    Wasserman, H.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.