메뉴 건너뛰기




Volumn , Issue , 2008, Pages 73-82

Optimization principles and application performance evaluation of a multithreaded GPU using CUDA

Author keywords

GPU computing; Parallel computing

Indexed keywords

BANDWIDTH; COMPUTER HARDWARE; MULTITASKING; PARALLEL PROCESSING SYSTEMS; PARALLEL PROGRAMMING; PROGRAM PROCESSORS;

EID: 79959466764     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1345206.1345220     Document Type: Conference Paper
Times cited : (706)

References (28)
  • 1
    • 56749110402 scopus 로고    scopus 로고
    • AMD Stream Processor, http://ati.amd.com/products/ streamprocessor/index. html.
    • AMD Stream Processor
  • 2
    • 79959464460 scopus 로고    scopus 로고
    • CUDA benchmark suite.
    • CUDA benchmark suite. http://www.crhc.uiuc.edu/impact/cudabench.html.
  • 3
    • 79959387698 scopus 로고    scopus 로고
    • NVIDIA CUDA.
    • NVIDIA CUDA. http://developer.nvidia.com/object/cuda.html.
  • 4
    • 79952798755 scopus 로고    scopus 로고
    • The PeakStream platform: High productivity software development for multi-core processors
    • The PeakStream platform: High productivity software development for multi-core processors. Technical report, 2006.
    • (2006) Technical Report
  • 5
    • 79959428942 scopus 로고    scopus 로고
    • ECE 498AL1: Programming massively parallel processors
    • ECE 498AL1: Programming massively parallel processors, Fall 2007. http://courses.ece.uiuc.edu/ece498/al1/.
    • (2007) Fall
  • 7
    • 0023438847 scopus 로고
    • AUTOMATIC TRANSLATION OF FORTRAN PROGRAMS TO VECTOR FORM
    • DOI 10.1145/29873.29875
    • R. Allen and K. Kennedy. Automatic translation of Fortran programs to vector form. ACM Transactions on Programming Langugages and Systems, 9(4): 491-542, 1987. (Pubitemid 18531687)
    • (1987) ACM Transactions on Programming Languages and Systems , vol.9 , Issue.4 , pp. 491-542
    • Allen, R.1    Kennedy, K.2
  • 10
    • 3142766244 scopus 로고    scopus 로고
    • Improving register allocation for subscripted variables
    • D. Callahan, S. Carr, and K. Kennedy. Improving register allocation for subscripted variables. ACM SIGPLAN Notices, 9(4): 328-342, 2004.
    • (2004) ACM SIGPLAN Notices , vol.9 , Issue.4 , pp. 328-342
    • Callahan, D.1    Carr, S.2    Kennedy, K.3
  • 20
    • 30744455496 scopus 로고    scopus 로고
    • Streaming architectures and technology trends
    • March
    • J. Owens. Streaming architectures and technology trends. GPU Gems 2, pages 457-470, March 2005.
    • (2005) GPU Gems , vol.2 , pp. 457-470
    • Owens, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.