메뉴 건너뛰기




Volumn , Issue , 2012, Pages 72-83

iGPU: Exception support and speculative execution on GPUs

Author keywords

[No Author keywords available]

Indexed keywords

CONTEXT SWITCH; DYNAMIC COMPILATION; ENTRY POINT; EXCEPTION HANDLING; GPU COMPUTING; GPU PROGRAMS; HARDWARE DESIGN; HARDWARE EXTENSION; LOW OVERHEAD; SPECULATIVE EXECUTION; VERTEX SHADER; VIRTUAL MEMORY;

EID: 84864857149     PISSN: 10636897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISCA.2012.6237007     Document Type: Conference Paper
Times cited : (62)

References (43)
  • 3
    • 33847113086 scopus 로고    scopus 로고
    • Cost reduction and evaluation of a temporary faults detecting technique
    • L. Anghel and M. Nicolaidis. Cost reduction and evaluation of a temporary faults detecting technique. In DATE '00.
    • DATE '00
    • Anghel, L.1    Nicolaidis, M.2
  • 4
    • 84892495526 scopus 로고    scopus 로고
    • DIVA: A reliable substrate for deep submicron microarchitecturedesign
    • T. Austin. DIVA: A Reliable Substrate for Deep Submicron MicroarchitectureDesign. In MICRO '99.
    • MICRO '99
    • Austin, T.1
  • 7
    • 77952374697 scopus 로고    scopus 로고
    • GPU technology trends and future requirements
    • J. Chen. GPU technology trends and future requirements. In IEDM '09.
    • IEDM '09
    • Chen, J.1
  • 10
    • 84863433943 scopus 로고    scopus 로고
    • Static analysis and compiler design for idempotent processing
    • M. de Kruijf, K. Sankaralingam, and S. Jha. Static analysis and compiler design for idempotent processing. In PLDI '12.
    • PLDI '12
    • De Kruijf, M.1    Sankaralingam, K.2    Jha, S.3
  • 11
    • 85051053411 scopus 로고    scopus 로고
    • Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneous systems
    • G. Diamos, A. Kerr, S. Yalamanchili, and N. Clark. Ocelot: A dynamic compiler for bulk-synchronous applications in heterogeneous systems. In PACT '10.
    • PACT '10
    • Diamos, G.1    Kerr, A.2    Yalamanchili, S.3    Clark, N.4
  • 17
    • 57749207483 scopus 로고    scopus 로고
    • Decor: A delayed commit and rollback mechanism for handling inductive noise in processors
    • M. Gupta, K. Rangan, M. Smith, G.-Y. Wei, and D. Brooks. Decor: A delayed commit and rollback mechanism for handling inductive noise in processors. In HPCA '08.
    • HPCA '08
    • Gupta, M.1    Rangan, K.2    Smith, M.3    Wei, G.-Y.4    Brooks, D.5
  • 18
    • 34547478253 scopus 로고    scopus 로고
    • Implementing virtual memory in a vector processor with software restart markers
    • M. Hampton and K. Asanović. Implementing virtual memory in a vector processor with software restart markers. In ICS '06.
    • ICS '06
    • Hampton, M.1    Asanović, K.2
  • 20
    • 84864837561 scopus 로고    scopus 로고
    • id. id tech 5 challenges: From texture virtualization to massive vparallelization
    • id. id tech 5 challenges: From texture virtualization to massive vparallelization. In SIGGRAPH '09.
    • SIGGRAPH '09
  • 21
    • 85086812274 scopus 로고    scopus 로고
    • Overcoming the limitations of conventional vector processors
    • C. Kozyrakis and D. Patterson. Overcoming the limitations of conventional vector processors. In ISCA '03.
    • ISCA '03
    • Kozyrakis, C.1    Patterson, D.2
  • 24
    • 77954976292 scopus 로고    scopus 로고
    • Dynamic warp subdivision for int rated branch and memory divergence tolerance
    • J. Meng, D. Tarjan, and K. Skadron. Dynamic warp subdivision for int rated branch and memory divergence tolerance. In ISCA '10.
    • ISCA '10
    • Meng, J.1    Tarjan, D.2    Skadron, K.3
  • 25
    • 84864863031 scopus 로고    scopus 로고
    • Register renaming and dynamic speculation: An alternative approach
    • M. Moudgill, K. Pingali, and S. Vassiliadis. Register renaming and dynamic speculation: an alternative approach. In MICRO '93.
    • MICRO '93
    • Moudgill, M.1    Pingali, K.2    Vassiliadis, S.3
  • 28
    • 0024013815 scopus 로고
    • The IBM System/370 vector architecture: Design considerations
    • May
    • A. Padegs, B. Moore, R. Smith, and W. Buchholz. The IBM System/370 vector architecture: design considerations. Computers, IEEE Transactions on, 37(5):509-520, May 1988.
    • (1988) Computers, IEEE Transactions on , vol.37 , Issue.5 , pp. 509-520
    • Padegs, A.1    Moore, B.2    Smith, R.3    Buchholz, W.4
  • 29
  • 30
    • 84864863030 scopus 로고    scopus 로고
    • Dual use of superscalar datapath for transient-fault detection and recovery
    • J. Ray, J. Hoe, and B. Falsafi. Dual use of superscalar datapath for transient-fault detection and recovery. In MICRO '01.
    • MICRO '01
    • Ray, J.1    Hoe, J.2    Falsafi, B.3
  • 34
    • 84864837565 scopus 로고    scopus 로고
    • Microprocessor with reduced context switching and overhead and corresponding method
    • United States Patent 6,314,510, November
    • A. Saulsbury and D. Rice. Microprocessor with reduced context switching and overhead and corresponding method. United States Patent 6,314,510, November 2001.
    • (2001)
    • Saulsbury, A.1    Rice, D.2
  • 35
    • 56349149338 scopus 로고    scopus 로고
    • A hardware redundancy and recovery mechanism for reliable scientific computation on graphics processors
    • J. W. Sheaffer, D. P. Luebke, and K. Skadron. A hardware redundancy and recovery mechanism for reliable scientific computation on graphics processors. In EUROGRAPHICS '07
    • EUROGRAPHICS '07
    • Sheaffer, J.W.1    Luebke, D.P.2    Skadron, K.3
  • 36
    • 0024013595 scopus 로고
    • Implementing precise interrupts in pipelined processors
    • May
    • J. E. Smith and A. R. Pleszkun. Implementing precise interrupts in pipelined processors. IEEE Transactions on Computers, 37:562-573, May 1988.
    • (1988) IEEE Transactions on Computers , vol.37 , pp. 562-573
    • Smith, J.E.1    Pleszkun, A.R.2
  • 37
    • 0029251909 scopus 로고
    • Fast context switches: Compiler and architectural support for preemptive scheduling
    • J. S. Snyder, D. B. Whalley, and T. P. Baker. Fast context switches: Compiler and architectural support for preemptive scheduling. Microprocessors and Microsystems, 19(1):35-42, 1995.
    • (1995) Microprocessors and Microsystems , vol.19 , Issue.1 , pp. 35-42
    • Snyder, J.S.1    Whalley, D.B.2    Baker, T.P.3
  • 38
    • 85084722295 scopus 로고    scopus 로고
    • Instruction issue logic for high-performance, interruptable pipelined processors
    • G. S. Sohi and S. Vajapeyam. Instruction issue logic for high-performance, interruptable pipelined processors. In ISCA '87.
    • ISCA '87
    • Sohi, G.S.1    Vajapeyam, S.2
  • 39
    • 0027271463 scopus 로고
    • Interrupt handling for out-of-order execution processors
    • H. Torng and M. Day. Interrupt handling for out-of-order execution processors. Computers, IEEE Transactions on, 42(1), 1993.
    • (1993) Computers, IEEE Transactions on , vol.42 , Issue.1
    • Torng, H.1    Day, M.2
  • 41
    • 0030129806 scopus 로고    scopus 로고
    • The MIPS R10000 superscalar microprocessor
    • K. C. Yeager. The MIPS R10000 superscalar microprocessor. IEEE Micro, 16(2):28-40, 1996.
    • (1996) IEEE Micro , vol.16 , Issue.2 , pp. 28-40
    • Yeager, K.C.1
  • 42
    • 84864863034 scopus 로고    scopus 로고
    • Two-level adaptive training branch prediction
    • T.-Y. Yeh and Y. N. Patt. Two-level adaptive training branch prediction. In MICRO '91.
    • MICRO '91
    • Yeh, T.-Y.1    Patt, Y.N.2
  • 43
    • 85165845488 scopus 로고    scopus 로고
    • Rapid and low-cost context-switch through embedded processor customization for real-time and control applications
    • X. Zhou and P. Petrov. Rapid and low-cost context-switch through embedded processor customization for real-time and control applications. In DAC '06.
    • DAC '06
    • Zhou, X.1    Petrov, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.