메뉴 건너뛰기




Volumn 12, Issue 4, 2015, Pages

FluidCheck: A redundant threading-based approach for reliable execution in manycore processors

Author keywords

Checker architectures; Redundant multithreading; Reliability

Indexed keywords

COMPLEX NETWORKS; DISTRIBUTED COMPUTER SYSTEMS; ENERGY EFFICIENCY; FAULT TOLERANT COMPUTER SYSTEMS; MICROPROCESSOR CHIPS; MULTITASKING; NETWORK ARCHITECTURE; PROGRAM PROCESSORS; RADIATION HARDENING; RECONFIGURABLE HARDWARE; RELIABILITY; VLSI CIRCUITS;

EID: 84954315984     PISSN: 15443566     EISSN: 15443973     Source Type: Journal    
DOI: 10.1145/2842620     Document Type: Article
Times cited : (3)

References (36)
  • 1
  • 2
    • 0033321638 scopus 로고    scopus 로고
    • DIVA: A reliable substrate for deep submicron microarchitecture design
    • T. M. Austin. 1999. DIVA: A reliable substrate for deep submicron microarchitecture design. In Micro.
    • (1999) Micro
    • Austin, T.M.1
  • 5
    • 33847095957 scopus 로고    scopus 로고
    • Compatible phase co-scheduling on a CMP of multi-threaded processors
    • A. El-Moursy, R. Garg, D. H. Albonesi, and S. Dwarkadas. 2006. Compatible phase co-scheduling on a CMP of multi-threaded processors. In IPDPS.
    • (2006) IPDPS
    • El-Moursy, A.1    Garg, R.2    Albonesi, D.H.3    Dwarkadas, S.4
  • 6
    • 84887462279 scopus 로고    scopus 로고
    • L1-bandwidth aware thread allocation in multicore SMT processors
    • J. Feliu, J. Sahuquillo, S. Petit, and J. Duato. 2013. L1-bandwidth aware thread allocation in multicore SMT processors. In PACT.
    • (2013) PACT
    • Feliu, J.1    Sahuquillo, J.2    Petit, S.3    Duato, J.4
  • 7
    • 27544468225 scopus 로고    scopus 로고
    • Opportunistic transient-fault detection
    • (May 2005)
    • M. A. Gomaa and T.N. Vijaykumar. 2005. Opportunistic transient-fault detection. SIGARCH Comput. Archit. News 33, 2 (May 2005), 172-183.
    • (2005) SIGARCH Comput. Archit. News , vol.33 , Issue.2 , pp. 172-183
    • Gomaa, M.A.1    Vijaykumar, T.N.2
  • 8
    • 79961056148 scopus 로고    scopus 로고
    • Scaling with design constraints: Predicting the future of big chips
    • (2011)
    • W. Huang, K. Rajamani, M. R. Stan, and K. Skadron. 2011. Scaling with design constraints: Predicting the future of big chips. Micro 31, 4 (2011), 16-29.
    • (2011) Micro , vol.31 , Issue.4 , pp. 16-29
    • Huang, W.1    Rajamani, K.2    Stan, M.R.3    Skadron, K.4
  • 9
    • 84946688764 scopus 로고    scopus 로고
    • An evaluation of lazy fault detection based on adaptive redundant multithreading
    • S. Hukerikar, K. Teranishi, P. C. Diniz, and R. F. Lucas. 2014. An evaluation of lazy fault detection based on adaptive redundant multithreading. In HPEC.
    • (2014) HPEC
    • Hukerikar, S.1    Teranishi, K.2    Diniz, P.C.3    Lucas, R.F.4
  • 10
    • 84876580223 scopus 로고    scopus 로고
    • Warped-DMR: Light-weight error detection for GPGPU
    • H. Jeon and M. Annavaram. 2012. Warped-DMR: Light-weight error detection for GPGPU. In MICRO.
    • (2012) MICRO
    • Jeon, H.1    Annavaram, M.2
  • 11
    • 70350060187 scopus 로고    scopus 로고
    • ORION 2.0: A fast and accurate NoC power and area model for early-stage design space exploration
    • A. B. Kahng, B. Li, L. Peh, and K. Samadi. 2009. ORION 2.0: A fast and accurate NoC power and area model for early-stage design space exploration. In DATE.
    • (2009) DATE
    • Kahng, A.B.1    Li, B.2    Peh, L.3    Samadi, K.4
  • 12
    • 84885228906 scopus 로고    scopus 로고
    • A survey of checker architectures
    • (Aug. 2013)
    • R. Kalayappan and S. R. Sarangi. 2013. A survey of checker architectures. Comput. Surveys 45, 4, Article 48 (Aug. 2013).
    • (2013) Comput. Surveys , vol.45 , pp. 4
    • Kalayappan, R.1    Sarangi, S.R.2
  • 13
    • 57749177456 scopus 로고    scopus 로고
    • Speculative instruction validation for performance-reliability trade-off
    • S. Kumar and A. Aggarwal. 2008. Speculative instruction validation for performance-reliability trade-off. In HPCA.
    • (2008) HPCA
    • Kumar, S.1    Aggarwal, A.2
  • 14
    • 76749146060 scopus 로고    scopus 로고
    • McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures
    • S. Li, J. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi. 2009. McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures. In Micro.
    • (2009) Micro
    • Li, S.1    Ahn, J.2    Strong, R.D.3    Brockman, J.B.4    Tullsen, D.M.5    Jouppi, N.P.6
  • 15
  • 16
    • 36049000932 scopus 로고    scopus 로고
    • Using register lifetime predictions to protect register files against soft errors
    • P. Montesinos, W. Liu, and J. Torrellas. 2007. Using register lifetime predictions to protect register files against soft errors. In DSN.
    • (2007) DSN
    • Montesinos, P.1    Liu, W.2    Torrellas, J.3
  • 17
    • 0036287327 scopus 로고    scopus 로고
    • Detailed design and evaluation of redundant multithreading alternatives
    • (May 2002)
    • S. S. Mukherjee, M. Kontz, and S. K. Reinhardt. 2002. Detailed design and evaluation of redundant multithreading alternatives. SIGARCH Comput. Archit. News 30, 2 (May 2002), 99-110.
    • (2002) SIGARCH Comput. Archit. News , vol.30 , Issue.2 , pp. 99-110
    • Mukherjee, S.S.1    Kontz, M.2    Reinhardt, S.K.3
  • 19
    • 21644454187 scopus 로고    scopus 로고
    • Pinpointing representative portions of large intel® itanium® programs with dynamic instrumentation
    • H. Patil, R. Cohn, M. Charney, R. Kapoor, A. Sun, and A. Karunanidhi. 2004. Pinpointing representative portions of large intel® Itanium® programs with dynamic instrumentation. In Micro.
    • (2004) Micro
    • Patil, H.1    Cohn, R.2    Charney, M.3    Kapoor, R.4    Sun, A.5    Karunanidhi, A.6
  • 20
    • 0036290620 scopus 로고    scopus 로고
    • ReVive: Cost-effective architectural support for rollback recovery in shared-memory multiprocessors
    • M. Prvulovic, Z. Zhang, and J. Torrellas. 2002. ReVive: Cost-effective architectural support for rollback recovery in shared-memory multiprocessors. In ISCA.
    • (2002) ISCA
    • Prvulovic, M.1    Zhang, Z.2    Torrellas, J.3
  • 21
    • 57749181987 scopus 로고    scopus 로고
    • Supporting highly-decoupled thread-level redundancy for parallel programs
    • M. W. Rashid and M. C. Huang. 2008. Supporting highly-decoupled thread-level redundancy for parallel programs. In HPCA.
    • (2008) HPCA
    • Rashid, M.W.1    Huang, M.C.2
  • 22
    • 33746693677 scopus 로고    scopus 로고
    • Exploiting coarse-grain verification parallelism for power-efficient fault tolerance
    • (2005)
    • M. W. Rashid, E. J. Tan, M. C. Huang, and D. H. Albonesi. 2005. Exploiting coarse-grain verification parallelism for power-efficient fault tolerance. PACT (2005).
    • (2005) PACT
    • Rashid, M.W.1    Tan, E.J.2    Huang, M.C.3    Albonesi, D.H.4
  • 23
    • 0035691556 scopus 로고    scopus 로고
    • Dual use of superscalar datapath for transient-fault detection and recovery
    • J. Ray, J. C. Hoe, and B. Falsafi. 2001. Dual use of superscalar datapath for transient-fault detection and recovery. In Micro.
    • (2001) Micro
    • Ray, J.1    Hoe, J.C.2    Falsafi, B.3
  • 24
    • 0033726332 scopus 로고    scopus 로고
    • Transient fault detection via simultaneous multithreading
    • S. K. Reinhardt and S. S. Mukherjee. 2000. Transient fault detection via simultaneous multithreading. In ISCA.
    • (2000) ISCA
    • Reinhardt, S.K.1    Mukherjee, S.S.2
  • 27
    • 10444263677 scopus 로고    scopus 로고
    • Architectural support for enhanced SMT job scheduling
    • A. Settle, J. Kihm, A. Janiszewski, and D. Connors. 2004. Architectural support for enhanced SMT job scheduling. In PACT.
    • (2004) PACT
    • Settle, A.1    Kihm, J.2    Janiszewski, A.3    Connors, D.4
  • 28
    • 0033314330 scopus 로고    scopus 로고
    • IBM S/390 parallel enterprise server G5 fault tolerance: A historical perspective
    • (1999)
    • L. Spainhower and T. A. Gregg. 1999. IBM S/390 parallel enterprise server G5 fault tolerance: A historical perspective. IBM J. Res. Dev. 43, 5.6 (1999), 863-873.
    • (1999) IBM J. Res. Dev. , vol.43 , Issue.5-6 , pp. 863-873
    • Spainhower, L.1    Gregg, T.A.2
  • 29
    • 77953101372 scopus 로고    scopus 로고
    • Multiplexed redundant execution: A technique for efficient fault tolerance in chip multiprocessors
    • P. Subramanyan, V. Singh, K. K. Saluja, and E. Larsson. 2010. Multiplexed redundant execution: A technique for efficient fault tolerance in chip multiprocessors. In DATE.
    • (2010) DATE
    • Subramanyan, P.1    Singh, V.2    Saluja, K.K.3    Larsson, E.4
  • 30
    • 0034441012 scopus 로고    scopus 로고
    • Slipstream processors: Improving both performance and fault tolerance
    • K. Sundaramoorthy, Z. Purser, and E. Rotenburg. 2000. Slipstream processors: Improving both performance and fault tolerance. In ASPLOS.
    • (2000) ASPLOS
    • Sundaramoorthy, K.1    Purser, Z.2    Rotenburg, E.3
  • 32
    • 0029666641 scopus 로고    scopus 로고
    • Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor
    • (May 1996)
    • D. M. Tullsen, S. J. Eggers, J. S. Emer, H. M. Levy, J. L. Lo, and R. L. Stamm. 1996. Exploiting choice: Instruction fetch and issue on an implementable simultaneous multithreading processor. In ACM SIGARCH Comput. Archit. News 24, 2 (May 1996), 191-202.
    • (1996) ACM SIGARCH Comput. Archit. News , vol.24 , Issue.2 , pp. 191-202
    • Tullsen, D.M.1    Eggers, S.J.2    Emer, J.S.3    Levy, H.M.4    Lo, J.L.5    Stamm, R.L.6
  • 33
    • 0029200683 scopus 로고
    • Simultaneous multithreading: Maximizing on-chip parallelism
    • D. M. Tullsen, S. J. Eggers, and H. M. Levy. 1995. Simultaneous multithreading: Maximizing on-chip parallelism. In ISCA.
    • (1995) ISCA
    • Tullsen, D.M.1    Eggers, S.J.2    Levy, H.M.3
  • 34
    • 74049119339 scopus 로고    scopus 로고
    • Selective replication: A lightweight technique for soft errors
    • (Dec. 2009)
    • X. Vera, J. Abella, J. Carretero, and A. González. 2009. Selective replication: A lightweight technique for soft errors. ACM Trans. Comput. Syst. 27, 4, Article 8 (Dec. 2009).
    • (2009) ACM Trans. Comput. Syst. , vol.27 , Issue.4
    • Vera, X.1    Abella, J.2    Carretero, J.3    González, A.4
  • 35
    • 84905504258 scopus 로고    scopus 로고
    • Real-world design and evaluation of compiler-managed GPU redundant multithreading
    • J. Wadden, A. Lyashevsky, S. Gurumurthi, V. Sridharan, and K. Skadron. 2014. Real-world design and evaluation of compiler-managed GPU redundant multithreading. In ISCA.
    • (2014) ISCA
    • Wadden, J.1    Lyashevsky, A.2    Gurumurthi, S.3    Sridharan, V.4    Skadron, K.5
  • 36
    • 33947313321 scopus 로고    scopus 로고
    • A case for fault tolerance and performance enhancement using chip multi-processors
    • (Jan. 2006)
    • Huiyang Zhou. 2006. A case for fault tolerance and performance enhancement using chip multi-processors. IEEE Comput. Archit. Lett. 5, 1 (Jan. 2006), 22-25.
    • (2006) IEEE Comput. Archit. Lett. , vol.5 , Issue.1 , pp. 22-25
    • Zhou, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.