메뉴 건너뛰기




Volumn , Issue , 2012, Pages 49-60

Simultaneous branch and warp interweaving for sustained GPU performance

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER ARCHITECTURE; COMPUTER GRAPHICS; LOCKS (FASTENERS); PROGRAM PROCESSORS;

EID: 84864834311     PISSN: 10636897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2366231.2337166     Document Type: Conference Paper
Times cited : (68)

References (31)
  • 8
    • 84856515692 scopus 로고    scopus 로고
    • PEPSC: A power-efficient processor for scientific computing
    • G. Dasika, A. Sethia, T. Mudge, and S. Mahlke. PEPSC: A power-efficient processor for scientific computing. In PACT, 2011.
    • (2011) PACT
    • Dasika, G.1    Sethia, A.2    Mudge, T.3    Mahlke, S.4
  • 11
    • 70449647744 scopus 로고    scopus 로고
    • CASH: Revisiting hardware sharing in single-chip parallel processor
    • R. Dolbeau and A. Seznec. CASH: Revisiting hardware sharing in single-chip parallel processor. Journal of Instruction-Level Parallelism, 6:1-16, 2004.
    • (2004) Journal of Instruction-Level Parallelism , vol.6 , pp. 1-16
    • Dolbeau, R.1    Seznec, A.2
  • 15
    • 68549096107 scopus 로고    scopus 로고
    • Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware
    • July
    • W. W. L. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic warp formation: Efficient MIMD control flow on SIMD graphics hardware. ACM Trans. Archit. Code Optim., 6:7:1-7:37, July 2009.
    • (2009) ACM Trans. Archit. Code Optim. , vol.6 , pp. 71-737
    • Fung, W.W.L.1    Sham, I.2    Yuan, G.3    Aamodt, T.M.4
  • 22
    • 44849137198 scopus 로고    scopus 로고
    • NVIDIA Tesla: A unified graphics and computing architecture
    • J. E. Lindholm, J. Nickolls, S. Oberman, and J. Montrym. NVIDIA Tesla: A unified graphics and computing architecture. IEEE Micro, 28(2):39-55, 2008.
    • (2008) IEEE Micro , vol.28 , Issue.2 , pp. 39-55
    • Lindholm, J.E.1    Nickolls, J.2    Oberman, S.3    Montrym, J.4
  • 24
    • 77954976292 scopus 로고    scopus 로고
    • Dynamic warp subdivision for int rated branch and memory divergence tolerance
    • J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for int rated branch and memory divergence tolerance. SIGARCH Comput. Archit. News, 38(3):235-246, 2010.
    • (2010) SIGARCH Comput. Archit. News , vol.38 , Issue.3 , pp. 235-246
    • Meng, J.1    Tarjan, D.2    Skadron, K.3
  • 25
    • 84864829539 scopus 로고    scopus 로고
    • Scheduler in multi-threaded processor prioritizing instructions passing qualification rule
    • US Patent 7949855, May
    • P. C. Mills, J. E. Lindholm, B. W. Coon, G. M. Tarolli, and J. M. Burgess. Scheduler in multi-threaded processor prioritizing instructions passing qualification rule. US Patent 7949855, May 2011.
    • (2011)
    • Mills, P.C.1    Lindholm, J.E.2    Coon, B.W.3    Tarolli, G.M.4    Burgess, J.M.5
  • 27
    • 77951154340 scopus 로고    scopus 로고
    • The GPU computing era
    • March
    • J. Nickolls and W. J. Dally. The GPU computing era. IEEE Micro, 30:56-69, March 2010.
    • (2010) IEEE Micro , vol.30 , pp. 56-69
    • Nickolls, J.1    Dally, W.J.2
  • 28
    • 85184640695 scopus 로고    scopus 로고
    • NVIDIA CUDA SDK, 2010. http://www. nvidia.com/cuda/.
    • (2010)
  • 29
    • 33644661238 scopus 로고    scopus 로고
    • Contentaddressable memory (CAM) circuits and architectures: A tutorial and survey
    • march
    • K. Pagiamtzis and A. Sheikholeslami. Contentaddressable memory (CAM) circuits and architectures: a tutorial and survey. IEEE Journal of Solid-State Circuits, 41(3):712-727, march 2006.
    • (2006) IEEE Journal of Solid-State Circuits , vol.41 , Issue.3 , pp. 712-727
    • Pagiamtzis, K.1    Sheikholeslami, A.2
  • 31
    • 0029183524 scopus 로고
    • Simultaneous multithreading: Maximizing on-chip parallelism
    • May
    • D. M. Tullsen, S. J. Eggers, and H. M. Levy. Simultaneous multithreading: maximizing on-chip parallelism. SIGARCH Comput. Archit. News, 23:392-403, May 1995.
    • (1995) SIGARCH Comput. Archit. News , vol.23 , pp. 392-403
    • Tullsen, D.M.1    Eggers, S.J.2    Levy, H.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.