메뉴 건너뛰기




Volumn , Issue , 2008, Pages 393-404

Supporting highly-decoupled thread-level redundancy for parallel programs

Author keywords

[No Author keywords available]

Indexed keywords

ARCHITECTURAL SUPPORTS; CIRCUIT ERRORS; CRITICAL CHARGES; CRITICAL PATHS; DESIGN GOALS; DESIGN ISSUES; DEVICE DIMENSIONS; MEMORY SUB SYSTEMS; MICROARCHITECTURE; NOISE TOLERANCES; ON THE FLIES; OPERATING VOLTAGES; PARALLEL CODES; PARALLEL EXECUTIONS; PARALLEL PROGRAMS; PERFORMANCE DEGRADATIONS; PROGRAM EXECUTIONS; THROUGHPUT LOSSES;

EID: 57749181987     PISSN: 15300897     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/HPCA.2008.4658655     Document Type: Conference Paper
Times cited : (30)

References (62)
  • 1
    • 0030083764 scopus 로고    scopus 로고
    • TreadMarks: Shared Memory Computing on Networks of Workstations
    • Feb
    • C. Amza et al. TreadMarks: Shared Memory Computing on Networks of Workstations. IEEE Computer, Vol. 29(2):18-28, Feb. 1996.
    • (1996) IEEE Computer , vol.29 , Issue.2 , pp. 18-28
    • Amza, C.1
  • 2
    • 10744221866 scopus 로고    scopus 로고
    • A 1.3-GHz Fifth-Generation SPARC64 Microprocessor
    • Nov
    • H. Ando et al. A 1.3-GHz Fifth-Generation SPARC64 Microprocessor. IEEE J. Solid-State Circuits, Vol. 38(11):1896-1905, Nov. 2003.
    • (2003) IEEE J. Solid-State Circuits , vol.38 , Issue.11 , pp. 1896-1905
    • Ando, H.1
  • 3
    • 0033321638 scopus 로고    scopus 로고
    • DIVA: A Reliable Substrate for Deep Submicron Microarchitecture Design
    • Nov
    • T. Austin. DIVA: A Reliable Substrate for Deep Submicron Microarchitecture Design. In Int. Symp. Microarchitecture. Nov. 1999.
    • (1999) Int. Symp. Microarchitecture
    • Austin, T.1
  • 5
    • 0023963250 scopus 로고
    • Sequoia: A Fault-tolerant Tightly Coupled Multiprocessor for Transaction Processing
    • Feb
    • P. Bernstein. Sequoia: A Fault-tolerant Tightly Coupled Multiprocessor for Transaction Processing. IEEE Computer, Vol. 21(2):37-45, Feb. 1988.
    • (1988) IEEE Computer , vol.21 , Issue.2 , pp. 37-45
    • Bernstein, P.1
  • 7
    • 57749190640 scopus 로고    scopus 로고
    • D. Burger and T. Austin. The SimpleScalar Tool Set, Version 2.0. Technical report 1342, Computer Sciences Department, University of Wisconsin-Madison, Jun. 1997
    • D. Burger and T. Austin. The SimpleScalar Tool Set, Version 2.0. Technical report 1342, Computer Sciences Department, University of Wisconsin-Madison, Jun. 1997.
  • 8
    • 29344433137 scopus 로고    scopus 로고
    • Fault Tolerant 32-bit RISC Processor: Implementation and Radiation test Results
    • Apr
    • D. Chardonnereau et al. Fault Tolerant 32-bit RISC Processor: Implementation and Radiation test Results. In Single-Event Effects Symposium. Apr. 2002.
    • (2002) Single-Event Effects Symposium
    • Chardonnereau, D.1
  • 14
    • 0028273364 scopus 로고
    • Parallelization of General Linkage Analysis Problems
    • S. Dwarkadas et al. Parallelization of General Linkage Analysis Problems. Human Heredity, Vol. 44:127-141, 1994.
    • (1994) Human Heredity , vol.44 , pp. 127-141
    • Dwarkadas, S.1
  • 15
    • 29344440163 scopus 로고    scopus 로고
    • An Efficient BICS Design for SEUs Detection and Correction in Semiconductor Memories
    • Mar
    • B. Gill et al. An Efficient BICS Design for SEUs Detection and Correction in Semiconductor Memories. In Design, Automation and Test in Europe. Mar. 2005.
    • (2005) Design, Automation and Test in Europe
    • Gill, B.1
  • 16
    • 34147172769 scopus 로고    scopus 로고
    • TRUSS: A Reliable, Scalable Server Architecture
    • Nov./Dec
    • B. Gold et al. TRUSS: A Reliable, Scalable Server Architecture. IEEE Micro, Vol. 25(6):51-59, Nov./Dec. 2005.
    • (2005) IEEE Micro , vol.25 , Issue.6 , pp. 51-59
    • Gold, B.1
  • 17
    • 29244467743 scopus 로고    scopus 로고
    • Transient-Fault Recovery for Chip Multiprocessors
    • Jun
    • M. Gomaa et al. Transient-Fault Recovery for Chip Multiprocessors. In Int. Symp. Computer Architecture. Jun. 2003.
    • (2003) Int. Symp. Computer Architecture
    • Gomaa, M.1
  • 18
    • 0034789870 scopus 로고    scopus 로고
    • Impact of CMOS Process Scaling and SOI on the Soft Error Rates of Logic Processes
    • Jun
    • S. Hareland et al. Impact of CMOS Process Scaling and SOI on the Soft Error Rates of Logic Processes. In IEEE Symp. VLSI Technology. Jun. 2001.
    • (2001) IEEE Symp. VLSI Technology
    • Hareland, S.1
  • 19
    • 4444365711 scopus 로고    scopus 로고
    • Measurements and Analysis of SER-Tolerant Latch in a 90-nm Dual-Vt CMOS Process
    • Sep
    • P. Hazucha et al. Measurements and Analysis of SER-Tolerant Latch in a 90-nm Dual-Vt CMOS Process. IEEE J. Solid-State Circuits, Vol. 39(9):1536-1543, Sep. 2004.
    • (2004) IEEE J. Solid-State Circuits , vol.39 , Issue.9 , pp. 1536-1543
    • Hazucha, P.1
  • 20
    • 84944403811 scopus 로고    scopus 로고
    • Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction
    • Dec
    • R. Kumar et al. Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction. In Int. Symp. Microarchitecture. Dec. 2003.
    • (2003) Int. Symp. Microarchitecture
    • Kumar, R.1
  • 22
    • 0021382125 scopus 로고
    • Design and Evaluation of a Fault-Tolerant Multiprocessor Using Hardware Recovery Blocks
    • Feb
    • Y. Lee and K. Shin. Design and Evaluation of a Fault-Tolerant Multiprocessor Using Hardware Recovery Blocks. IEEE Transactions on Computers, Vol. 33(2):113-124, Feb. 1984.
    • (1984) IEEE Transactions on Computers , vol.33 , Issue.2 , pp. 113-124
    • Lee, Y.1    Shin, K.2
  • 23
    • 57749200348 scopus 로고    scopus 로고
    • Designing A Single Board Computer For Space Using The Most Advanced Processor and Mitigation Technologies
    • Sep
    • L. Longden et al. Designing A Single Board Computer For Space Using The Most Advanced Processor and Mitigation Technologies. In European Space Components Conference, ESCCON 2002. Sep. 2002.
    • (2002) European Space Components Conference, ESCCON 2002
    • Longden, L.1
  • 24
    • 57749181168 scopus 로고    scopus 로고
    • Nov. 2000
    • D. Lyons. Sun Screen. Forbes, Nov. 2000. http://www.forbes.com/ global/2000/1113/0323026a.html.
    • Forbes
    • Lyons, D.1    Sun, S.2
  • 25
    • 0036082034 scopus 로고    scopus 로고
    • Soft Error Rate Mitigation Techniques for Modern Microcircuits
    • Apr
    • D. Mavis and P. Eaton. Soft Error Rate Mitigation Techniques for Modern Microcircuits. In Int. Reliability Physics Symp. Apr. 2002.
    • (2002) Int. Reliability Physics Symp
    • Mavis, D.1    Eaton, P.2
  • 26
    • 15044363155 scopus 로고    scopus 로고
    • Robust System Design with Build-in Soft-Error Resilience
    • S. Mitra et al. Robust System Design with Build-in Soft-Error Resilience. IEEE Computer, Vol. 38(2):43-52, 2005.
    • (2005) IEEE Computer , vol.38 , Issue.2 , pp. 43-52
    • Mitra, S.1
  • 29
    • 0003133883 scopus 로고
    • Probabilistic Logics and the Synthesis of Reliable Organisms form Unreliable Components
    • C. Shannon and J. McCarthy, editors, Princeton University Press
    • J. Neumann. Probabilistic Logics and the Synthesis of Reliable Organisms form Unreliable Components. In C. Shannon and J. McCarthy, editors, Automata Studies, pp. 43-98. Princeton University Press, 1956.
    • (1956) Automata Studies , pp. 43-98
    • Neumann, J.1
  • 30
    • 0032684765 scopus 로고    scopus 로고
    • Time Redundancy Based Soft-Error Tolerance to Rescue Nanometer Technologies
    • Apr
    • M. Nicolaidis. Time Redundancy Based Soft-Error Tolerance to Rescue Nanometer Technologies. In IEEE VLSI Test Symp. Apr. 1999.
    • (1999) IEEE VLSI Test Symp
    • Nicolaidis, M.1
  • 31
    • 4644301626 scopus 로고    scopus 로고
    • A Complexity-Effective Approach to ALU Bandwidth Enhancement for Instruction-Level Temporal Redundancy
    • Jun
    • A. Parashar et al. A Complexity-Effective Approach to ALU Bandwidth Enhancement for Instruction-Level Temporal Redundancy. In Int. Symp. Computer Architecture. Jun. 2004.
    • (2004) Int. Symp. Computer Architecture
    • Parashar, A.1
  • 32
    • 0020735124 scopus 로고
    • Concurrent Error Detection in Multiply and Divide Arrays
    • J. Patel and L. Fung. Concurrent Error Detection in Multiply and Divide Arrays. IEEE Transactions on Computers, Vol. 32:417-422, 1983.
    • (1983) IEEE Transactions on Computers , vol.32 , pp. 417-422
    • Patel, J.1    Fung, L.2
  • 34
    • 0036290620 scopus 로고    scopus 로고
    • ReVive: Cost-Effective Architectural Support for Rollback Recovery in Shared-Memory Multiprocessors
    • May
    • M. Prvulovic, Z. Zhang, and J. Torrellas. ReVive: Cost-Effective Architectural Support for Rollback Recovery in Shared-Memory Multiprocessors. In Int. Symp. Computer Architecture. May 2002.
    • (2002) Int. Symp. Computer Architecture
    • Prvulovic, M.1    Zhang, Z.2    Torrellas, J.3
  • 35
    • 33746693677 scopus 로고    scopus 로고
    • Exploiting Coarse-Grain Verification Parallelism for Power-Efficient Fault Tolerance
    • Sep
    • M. Rashid et al. Exploiting Coarse-Grain Verification Parallelism for Power-Efficient Fault Tolerance. In Int. Conf. Parallel Architectures and Compilation Techniques. Sep. 2005.
    • (2005) Int. Conf. Parallel Architectures and Compilation Techniques
    • Rashid, M.1
  • 36
    • 0035691556 scopus 로고    scopus 로고
    • Dual Use of Superscalar Datapath for Transient-Fault Detection and Recovery
    • Dec
    • J. Ray et al. Dual Use of Superscalar Datapath for Transient-Fault Detection and Recovery. In Int. Symp. Microarchitecture. Dec. 2001.
    • (2001) Int. Symp. Microarchitecture
    • Ray, J.1
  • 38
    • 0015765055 scopus 로고
    • STAREX Self-Repair Routines: Software Recovery in the JPL-STAR Computer
    • J. Rohr. STAREX Self-Repair Routines: Software Recovery in the JPL-STAR Computer. In Int. Symp. Fault-Tolerant Computing. 1973.
    • (1973) Int. Symp. Fault-Tolerant Computing
    • Rohr, J.1
  • 39
    • 0032597692 scopus 로고    scopus 로고
    • AR-SMT: A Microarchitectural Approach to Fault Tolerance in Microprocessors
    • Jun
    • E. Rotenberg. AR-SMT: A Microarchitectural Approach to Fault Tolerance in Microprocessors. In Int. Symp. Fault-Tolerant Computing. Jun. 1999.
    • (1999) Int. Symp. Fault-Tolerant Computing
    • Rotenberg, E.1
  • 40
    • 0242636503 scopus 로고    scopus 로고
    • A 1.5-Ghz 130-nm Itanium® 2 Processor With 6-MB On-die L3 Cache
    • Nov
    • S. Rusu et al. A 1.5-Ghz 130-nm Itanium® 2 Processor With 6-MB On-die L3 Cache. IEEE J. Solid-State Circuits, Vol. 38(11):1887-1895, Nov. 2003.
    • (2003) IEEE J. Solid-State Circuits , vol.38 , Issue.11 , pp. 1887-1895
    • Rusu, S.1
  • 41
    • 84856104715 scopus 로고    scopus 로고
    • Historical Trend in Alpha-Particle induced Soft Error Rates of the Alpha™ Microprocessor
    • Apr
    • N. Seifert et al. Historical Trend in Alpha-Particle induced Soft Error Rates of the Alpha™ Microprocessor. In Int. Reliability Physics Symp. Apr. 2001.
    • (2001) Int. Reliability Physics Symp
    • Seifert, N.1
  • 42
    • 0036931372 scopus 로고    scopus 로고
    • Modeling the Effect of Technology Trends on the Soft Error Rate of Combinational Logic
    • Jun
    • P. Shivakuma et al. Modeling the Effect of Technology Trends on the Soft Error Rate of Combinational Logic. In Int. Conf. Dependable Systems and Networks. Jun. 2002.
    • (2002) Int. Conf. Dependable Systems and Networks
    • Shivakuma, P.1
  • 43
    • 25844437046 scopus 로고    scopus 로고
    • POWER5 System Microarchitecture
    • Sep
    • B. Sinharoy et al. POWER5 System Microarchitecture. IBM Journal of Research and Development, Vol. 49(4/5):505-521, Sep. 2005.
    • (2005) IBM Journal of Research and Development , vol.49 , Issue.4-5 , pp. 505-521
    • Sinharoy, B.1
  • 44
    • 0032667728 scopus 로고    scopus 로고
    • IBM's S/390 G5 Microprocessor Design
    • Mar./Apr
    • T. Slegel et al. IBM's S/390 G5 Microprocessor Design. IEEE Micro, Vol. 19(2):12-23, Mar./Apr. 1999.
    • (1999) IEEE Micro , vol.19 , Issue.2 , pp. 12-23
    • Slegel, T.1
  • 45
    • 21644470906 scopus 로고    scopus 로고
    • Efficient Resource Sharing in Concurrent Error Detecting Superscalar Microarchitecture
    • Nov
    • J. Smolens et al. Efficient Resource Sharing in Concurrent Error Detecting Superscalar Microarchitecture. In Int. Symp. Microarchitecture. Nov. 2004.
    • (2004) Int. Symp. Microarchitecture
    • Smolens, J.1
  • 47
    • 40349114890 scopus 로고    scopus 로고
    • Reunion: Complexity-Effective Multicore Redundancy
    • Dec
    • J. Smolens et al. Reunion: Complexity-Effective Multicore Redundancy. In Int. Symp. Microarchitecture. Dec. 2006.
    • (2006) Int. Symp. Microarchitecture
    • Smolens, J.1
  • 48
    • 0036292677 scopus 로고    scopus 로고
    • SafetyNet: Improving the Availability of Shared Memory Multiprocessors with Global Checkpoint/Recovery
    • May
    • D. Sorin et al. SafetyNet: Improving the Availability of Shared Memory Multiprocessors with Global Checkpoint/Recovery. In Int. Symp. Computer Architecture. May 2002.
    • (2002) Int. Symp. Computer Architecture
    • Sorin, D.1
  • 49
    • 0343564640 scopus 로고
    • IBM's ES/9000 Model 982's fault-tolerant design for consolidation
    • Feb
    • L. Spainhower et al. IBM's ES/9000 Model 982's fault-tolerant design for consolidation. IEEE Micro, Vol. 14(3):48-59, Feb. 1994.
    • (1994) IEEE Micro , vol.14 , Issue.3 , pp. 48-59
    • Spainhower, L.1
  • 50
    • 0024925797 scopus 로고
    • Application-transparent Process-level Error Recovery for Multicomputers
    • Jan
    • Y. Tamir and T. Frazier. Application-transparent Process-level Error Recovery for Multicomputers. In Hawaii Int. Conf. System Sciences. Jan. 1989.
    • (1989) Hawaii Int. Conf. System Sciences
    • Tamir, Y.1    Frazier, T.2
  • 51
    • 0024142151 scopus 로고
    • The Implementation and Application of Micro Rollback in Fault-Tolerant VLSI Systems
    • Jun
    • Y. Tamir, M. Tremblay, and D. Rennels. The Implementation and Application of Micro Rollback in Fault-Tolerant VLSI Systems. In Int. Symp. Fault-Tolerant Computing. Jun. 1988.
    • (1988) Int. Symp. Fault-Tolerant Computing
    • Tamir, Y.1    Tremblay, M.2    Rennels, D.3
  • 52
  • 53
    • 0036290674 scopus 로고    scopus 로고
    • Transient-Fault Recovery via Simultaneous Multithreading
    • May
    • T. Vijaykumar et al. Transient-Fault Recovery via Simultaneous Multithreading. In Int. Symp. Computer Architecture. May 2002.
    • (2002) Int. Symp. Computer Architecture
    • Vijaykumar, T.1
  • 54
    • 4544282186 scopus 로고    scopus 로고
    • Characterizing the Effects of Transient Faults on a High-Performance Processor Pipeline
    • Jun
    • N. Wang et al. Characterizing the Effects of Transient Faults on a High-Performance Processor Pipeline. In Int. Conf. Dependable Systems and Networks. Jun. 2004.
    • (2004) Int. Conf. Dependable Systems and Networks
    • Wang, N.1
  • 55
    • 0024013816 scopus 로고
    • Modulo 3 Residue Checker: New Results on Performance and Cost
    • J. Watterson and J. Hallenbeck. Modulo 3 Residue Checker: New Results on Performance and Cost. IEEE Transactions on Computers, Vol. 37(5):608-612, 1988.
    • (1988) IEEE Transactions on Computers , vol.37 , Issue.5 , pp. 608-612
    • Watterson, J.1    Hallenbeck, J.2
  • 56
    • 4644320531 scopus 로고    scopus 로고
    • Techniques to Reduce the Soft Error Rate of a High-Performance Microprocessor
    • Jun
    • C. Weaver et al. Techniques to Reduce the Soft Error Rate of a High-Performance Microprocessor. In Int. Symp. Computer Architecture. Jun. 2004.
    • (2004) Int. Symp. Computer Architecture
    • Weaver, C.1
  • 57
    • 0029179077 scopus 로고
    • The SPLASH-2 Programs: Characterization and Methodological Considerations
    • Jun
    • S. Woo et al. The SPLASH-2 Programs: Characterization and Methodological Considerations. In Int. Symp. Computer Architecture. Jun. 1995.
    • (1995) Int. Symp. Computer Architecture
    • Woo, S.1
  • 58
    • 0025419124 scopus 로고
    • Error Recovery in Shared Memory Multiprocessors Using Private Caches
    • Apr
    • K. Wu, W. Fuchs, and J. Patel. Error Recovery in Shared Memory Multiprocessors Using Private Caches. IEEE Trans. Parallel and Distributed Systems, Vol. 1(2):231-240, Apr. 1990.
    • (1990) IEEE Trans. Parallel and Distributed Systems , vol.1 , Issue.2 , pp. 231-240
    • Wu, K.1    Fuchs, W.2    Patel, J.3
  • 59
    • 0038684791 scopus 로고    scopus 로고
    • A "Flight Data Recorder" for Enabling Full-system Multiprocessor Deterministic Replay
    • Jun
    • M. Xu, R. Bodik, and M. Hill. A "Flight Data Recorder" for Enabling Full-system Multiprocessor Deterministic Replay. In Int. Symp. Computer Architecture. Jun. 2003.
    • (2003) Int. Symp. Computer Architecture
    • Xu, M.1    Bodik, R.2    Hill, M.3
  • 61
    • 0030129806 scopus 로고    scopus 로고
    • The MIPS R10000 Superscalar Microprocessor
    • Apr
    • K. Yeager. The MIPS R10000 Superscalar Microprocessor. IEEE Micro, Vol. 16(2):28-40, Apr. 1996.
    • (1996) IEEE Micro , vol.16 , Issue.2 , pp. 28-40
    • Yeager, K.1
  • 62
    • 0018716817 scopus 로고
    • Effect of Cosmic Rays on Computer Memories
    • Nov
    • J. Ziegler and W. Lanford. Effect of Cosmic Rays on Computer Memories. Science, Vol. 206(16):776-788, Nov. 1979.
    • (1979) Science , vol.206 , Issue.16 , pp. 776-788
    • Ziegler, J.1    Lanford, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.