메뉴 건너뛰기




Volumn , Issue , 2009, Pages 292-299

Program optimization of array-intensive SPEC2k benchmarks on multithreaded GPU using CUDA and brook+

Author keywords

Brook+; CUDA; GPGPU; mgrid; Optimization; Swim

Indexed keywords

BROOK+; COMPUTING CAPACITY; DATA LOCALITY; DATA PARALLEL; ELIMINATION TECHNOLOGY; EQUILIBRIUM POINT; GENERAL PURPOSE; GRAPHIC PROCESSING UNITS; HARDWARE AND SOFTWARE; LONG MEMORIES; MULTI-LEVEL MEMORY HIERARCHY; MULTITHREADED; PARALLEL COMPUTING; PROGRAM OPTIMIZATION; SOFTWARE PLATFORMS;

EID: 77949647837     PISSN: 15219097     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICPADS.2009.12     Document Type: Conference Paper
Times cited : (12)

References (15)
  • 1
    • 79959466764 scopus 로고    scopus 로고
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu, Optimization principles and application performance evaluation of a multithreaded GPU using CUDA, in Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (23th PPOPP'2008). Salt Lake City, UT: ACM SIGPLAN, Feb. 2008, pp. 73-82.
    • S. Ryoo, C. I. Rodrigues, S. S. Baghsorkhi, S. S. Stone, D. B. Kirk, and W. mei W. Hwu, "Optimization principles and application performance evaluation of a multithreaded GPU using CUDA," in Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (23th PPOPP'2008). Salt Lake City, UT: ACM SIGPLAN, Feb. 2008, pp. 73-82.
  • 2
    • 51449118065 scopus 로고    scopus 로고
    • A performance study of general-purpose applications on graphics processors using CUDA
    • S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer, and K. Skadron, "A performance study of general-purpose applications on graphics processors using CUDA," J. Parallel Distrib. Comput, vol. 68, no. 10, pp. 1370-1380, 2008.
    • (2008) J. Parallel Distrib. Comput , vol.68 , Issue.10 , pp. 1370-1380
    • Che, S.1    Boyer, M.2    Meng, J.3    Tarjan, D.4    Sheaffer, J.W.5    Skadron, K.6
  • 3
    • 67650021816 scopus 로고    scopus 로고
    • G. Quintana-Ort?́, F. D. Igual, E. S. Quintana-Ort?́, and R. A. van de Geijn, Solving dense linear systems on platforms with multiple hardware accelerators, in PPOPP, D. A. Reed and V. Sarkar, Eds. ACM, 2009, pp. 121-130.
    • G. Quintana-Ort?́, F. D. Igual, E. S. Quintana-Ort?́, and R. A. van de Geijn, "Solving dense linear systems on platforms with multiple hardware accelerators," in PPOPP, D. A. Reed and V. Sarkar, Eds. ACM, 2009, pp. 121-130.
  • 4
    • 35948931417 scopus 로고    scopus 로고
    • Cache-efficient numerical algorithms using graphics hardware
    • N. K. Govindaraju and D. Manocha, "Cache-efficient numerical algorithms using graphics hardware," Parallel Comput., vol. 33, no. 10-11, pp. 663-684, 2007.
    • (2007) Parallel Comput , vol.33 , Issue.10-11 , pp. 663-684
    • Govindaraju, N.K.1    Manocha, D.2
  • 8
    • 24644456455 scopus 로고    scopus 로고
    • Automatic tiling of iterative stencil loops
    • Z. Li and Y. Song, "Automatic tiling of iterative stencil loops," ACM Trans. Program. Lang. Syst, vol. 26, no. 6, pp. 975-1028, 2004.
    • (2004) ACM Trans. Program. Lang. Syst , vol.26 , Issue.6 , pp. 975-1028
    • Li, Z.1    Song, Y.2
  • 13
    • 43449094719 scopus 로고    scopus 로고
    • S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S.-Z. Ueng, J. A. Stratton, and W. mei W. Hwu, Program optimization space pruning for a multithreaded gpu, in CGO, M. L. Soffa and E. Duesterwald, Eds. ACM, 2008, pp. 195-204.
    • S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S.-Z. Ueng, J. A. Stratton, and W. mei W. Hwu, "Program optimization space pruning for a multithreaded gpu," in CGO, M. L. Soffa and E. Duesterwald, Eds. ACM, 2008, pp. 195-204.
  • 14
    • 67650784628 scopus 로고    scopus 로고
    • Feedback-driven threading: Power-efficient and high-performance execution of multi-threaded workloads on cmps
    • M. A. Suleman, M. K. Qureshi, and Y. N. Patt, "Feedback-driven threading: power-efficient and high-performance execution of multi-threaded workloads on cmps," SIGARCH Comput. Archit. News, vol. 36, no. 1, pp. 277-286, 2008.
    • (2008) SIGARCH Comput. Archit. News , vol.36 , Issue.1 , pp. 277-286
    • Suleman, M.A.1    Qureshi, M.K.2    Patt, Y.N.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.