메뉴 건너뛰기




Volumn 16, Issue 2, 2016, Pages

Compiler-directed soft error detection and recovery to avoid DUE and SDC via tail-DMR

Author keywords

Acoustic wave detectors; Compilers; Idempotent processing; Soft error resilience; Tail DMR frontier

Indexed keywords

ACOUSTIC WAVES; ACOUSTICS; ERROR CORRECTION; ERRORS; PROGRAM COMPILERS; PROGRAM PROCESSORS; RADIATION HARDENING; RECOVERY;

EID: 85008950198     PISSN: 15399087     EISSN: 15583465     Source Type: Journal    
DOI: 10.1145/2930667     Document Type: Article
Times cited : (30)

References (68)
  • 1
    • 85008878660 scopus 로고    scopus 로고
    • ARM. 2015. Cortex-A57 Technique Reference Manual. Retrieved from http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.ddi0488g/index.html.
    • (2015) Cortex-A57 Technique Reference Manual
  • 6
    • 84898026203 scopus 로고    scopus 로고
    • Containment domains: A scalable, efficient and flexible resilience scheme for exascale systems
    • 2013
    • Jinsuk Chung, Ikhwan Lee, Michael Sullivan, Jee Ho Ryoo, Dong Wan Kim, Doe Hyun Yoon, Larry Kaplan, and Mattan Erez. 2013. Containment domains: A scalable, efficient and flexible resilience scheme for exascale systems. Scientific Programming 21, 3-4 (2013), 197-212.
    • (2013) Scientific Programming , vol.21 , Issue.3-4 , pp. 197-212
    • Chung, J.1    Lee, I.2    Sullivan, M.3    Ryoo, J.H.4    Kim, D.W.5    Yoon, D.H.6    Kaplan, L.7    Erez, M.8
  • 9
    • 84876946033 scopus 로고    scopus 로고
    • Idempotent code generation: Implementation, analysis, and evaluation
    • IEEE Computer Society
    • Marc de Kruijf and Karthikeyan Sankaralingam. 2013. Idempotent code generation: Implementation, analysis, and evaluation. In CGO. IEEE Computer Society, 1-12.
    • (2013) CGO , pp. 1-12
    • De Kruijf, M.1    Sankaralingam, K.2
  • 11
    • 77949759608 scopus 로고    scopus 로고
    • Shoestring: Probabilistic soft error reliability on the cheap
    • 2010
    • Shuguang Feng, Shantanu Gupta, Amin Ansari, and Scott Mahlke. 2010. Shoestring: Probabilistic soft error reliability on the cheap. ACM SIGARCH Computer Architecture News 38 (2010), 385-396.
    • (2010) ACM SIGARCH Computer Architecture News , vol.38 , pp. 385-396
    • Feng, S.1    Gupta, S.2    Ansari, A.3    Mahlke, S.4
  • 12
    • 84863372488 scopus 로고    scopus 로고
    • Encore: Lowcost, fine-grained transient fault recovery
    • Shuguang Feng, Shantanu Gupta, Amin Ansari, Scott A. Mahlke, and David I. August. 2011. Encore: Lowcost, fine-grained transient fault recovery. In MICRO'11. 398-409.
    • (2011) MICRO'11 , pp. 398-409
    • Feng, S.1    Gupta, S.2    Ansari, A.3    Mahlke, S.A.4    August, D.I.5
  • 16
    • 84858759524 scopus 로고    scopus 로고
    • Relyzer: Exploiting application-level fault equivalence to analyze application resiliency to transient faults
    • 2012
    • Siva Kumar Sastry Hari, Sarita V. Adve, Helia Naeimi, and Pradeep Ramachandran. 2012b. Relyzer: Exploiting application-level fault equivalence to analyze application resiliency to transient faults. ACM SIGPLAN Notices 47 (2012), 123-134.
    • (2012) ACM SIGPLAN Notices , vol.47 , pp. 123-134
    • Hari, S.K.S.1    Adve, S.V.2    Naeimi, H.3    Ramachandran, P.4
  • 18
    • 84930653090 scopus 로고    scopus 로고
    • UnSync-CMP: MulticoreCMParchitecture for energy efficient soft error reliability
    • January 2014
    • Reiley Jeyapaul, Abhishek Risheekesan, Aviral Shrivastava, and Kyoungwoo Lee. 2014. UnSync-CMP: MulticoreCMParchitecture for energy efficient soft error reliability. Transactions on Parallel and Distributed Systems 25, 1 (January 2014), 254-263.
    • (2014) Transactions on Parallel and Distributed Systems , vol.25 , Issue.1 , pp. 254-263
    • Jeyapaul, R.1    Risheekesan, A.2    Shrivastava, A.3    Lee, K.4
  • 22
    • 84986545349 scopus 로고    scopus 로고
    • Adaptive execution method for multithreaded processor-based parallel system
    • US Patent
    • Chang Hee Jung, Dae Seob Lim, Jae Jin Lee, and Sang Yong Han. 2009. Adaptive execution method for multithreaded processor-based parallel system. US Patent No. 7,526,637.
    • (2009)
    • Jung, C.H.1    Lim, D.S.2    Lee, J.J.3    Han, S.Y.4
  • 26
    • 84864144734 scopus 로고    scopus 로고
    • Efficient soft error protection for commodity embedded microprocessors using profile information
    • 2012
    • Daya Shanker Khudia, Griffin Wright, and Scott Mahlke. 2012. Efficient soft error protection for commodity embedded microprocessors using profile information. ACM SIGPLAN Notices 47 (2012), 99-108.
    • (2012) ACM SIGPLAN Notices , vol.47 , pp. 99-108
    • Khudia, D.S.1    Wright, G.2    Mahlke, S.3
  • 31
  • 33
    • 84889094390 scopus 로고    scopus 로고
    • Epipe: A low-cost faulttolerance technique considering WCET constraints
    • November 2013
    • Jianli Li, Jingling Xue, Xinwei Xie, QingWan, Qingping Tan, and Lanfang Tan. 2013. Epipe: A low-cost faulttolerance technique considering WCET constraints. Journal of System Architecture 59, 10 (November 2013), 1383-1393. DOI:http://dx.doi.org/10.1016/j.sysarc.2013.06.003
    • (2013) Journal of System Architecture , vol.59 , Issue.10 , pp. 1383-1393
    • Li, J.1    Xue, J.2    Xie, X.3    Wan, Q.4    Tan, Q.5    Tan, L.6
  • 41
    • 4544296705 scopus 로고
    • The use of triple-modular redundancy to improve computer reliability
    • 1962
    • Robert E. Lyons and Wouter Vanderkulk. 1962. The use of triple-modular redundancy to improve computer reliability. IBM Journal of Research and Development 6, 2 (1962), 200-209.
    • (1962) IBM Journal of Research and Development , vol.6 , Issue.2 , pp. 200-209
    • Lyons, R.E.1    Vanderkulk, W.2
  • 47
    • 84903145549 scopus 로고    scopus 로고
    • DTune: Leveraging reliable code generation for adaptive dependability tuning under process variation and aging-induced effects
    • ACM, New York, NY, Article 84
    • Semeen Rehman, Florian Kriebel, Duo Sun, Muhammad Shafique, and Jörg Henkel. 2014b. dTune: Leveraging reliable code generation for adaptive dependability tuning under process variation and aging-induced effects. In Proceedings of the 51st Annual Design Automation Conference (DAC'14). ACM, New York, NY, Article 84, 6 pages. DOI:http://dx.doi.org/10.1145/2593069.2593127
    • (2014) Proceedings of the 51st Annual Design Automation Conference (DAC'14) , pp. 6
    • Rehman, S.1    Kriebel, F.2    Sun, D.3    Shafique, M.4    Henkel, J.5
  • 48
    • 81355132234 scopus 로고    scopus 로고
    • Reliable software for unreliable hardware: Embedded code generation aiming at reliability
    • Robert P. Dick and Jan Madsen (Eds.) ACM
    • Semeen Rehman, Muhammad Shafique, Florian Kriebel, and Jrg Henkel. 2011. Reliable software for unreliable hardware: Embedded code generation aiming at reliability. In CODES+ISSS, Robert P. Dick and Jan Madsen (Eds.). ACM, 237-246.
    • (2011) CODES+ISSS , pp. 237-246
    • Rehman, S.1    Shafique, M.2    Kriebel, F.3    Henkel, J.4
  • 49
    • 34249775197 scopus 로고    scopus 로고
    • Automatic instruction-level software-only recovery
    • 2007
    • George A. Reis, Jonathan Chang, and David I. August. 2007. Automatic instruction-level software-only recovery. IEEE Micro 27, 1 (2007), 36-47.
    • (2007) IEEE Micro , vol.27 , Issue.1 , pp. 36-47
    • Reis, G.A.1    Chang, J.2    August, D.I.3
  • 53
    • 34147197380 scopus 로고    scopus 로고
    • An experimental study of soft errors in microprocessors
    • 2005
    • Giacinto Paolo Saggese, Nicholas J. Wang, Zbigniew Kalbarczyk, Sanjay J. Patel, and Ravishankar K. Iyer. 2005. An experimental study of soft errors in microprocessors. IEEE Micro 25, 6 (2005), 30-39.
    • (2005) IEEE Micro , vol.25 , Issue.6 , pp. 30-39
    • Saggese, G.P.1    Wang, N.J.2    Kalbarczyk, Z.3    Patel, S.J.4    Iyer, R.K.5
  • 55
    • 84879061413 scopus 로고    scopus 로고
    • Relyzer: Application resiliency analyzer for transient faults
    • 2013
    • Siva Kumar Sastry Hari, Sarita V. Adve, Helia Naeimi, and Prasadh Ramachandran. 2013. Relyzer: Application resiliency analyzer for transient faults. IEEE Micro 33, 3 (2013), 58-66.
    • (2013) IEEE Micro , vol.33 , Issue.3 , pp. 58-66
    • Hari, S.K.S.1    Adve, S.V.2    Naeimi, H.3    Ramachandran, P.4
  • 58
    • 84879876348 scopus 로고    scopus 로고
    • Exploiting programlevel masking and error propagation for constrained reliability optimization
    • ACM, New York, NY, Article 17
    • Muhammad Shafique, Semeen Rehman, Pau Vilimelis Aceituno, and JörgHenkel. 2013. Exploiting programlevel masking and error propagation for constrained reliability optimization. In Proceedings of the 50th Annual Design Automation Conference (DAC'13). ACM, New York, NY, Article 17, 9 pages. DOI:http://dx.doi.org/10.1145/2463209.2488755
    • (2013) Proceedings of the 50th Annual Design Automation Conference (DAC'13) , pp. 9
    • Shafique, M.1    Rehman, S.2    Aceituno, P.V.3    Henkel, J.4
  • 59
    • 84863554397 scopus 로고    scopus 로고
    • Is dark silicon useful? Harnessing the four horsemen of the coming dark silicon Apocalypse
    • Michael B. Taylor. 2012. Is dark silicon useful? Harnessing the four horsemen of the coming dark silicon Apocalypse. In Proceedings of the 49th Annual Design Automation Conference (DAC'12). 1131-1136.
    • (2012) Proceedings of the 49th Annual Design Automation Conference (DAC'12) , pp. 1131-1136
    • Taylor, M.B.1
  • 60
    • 84864862268 scopus 로고    scopus 로고
    • Setting an error detection infrastructure with low cost acoustic wave detectors
    • Gaurang Upasani, Xavier Vera, and Antonio Gonzalez. 2012. Setting an error detection infrastructure with low cost acoustic wave detectors. In ISCA. 333-343.
    • (2012) ISCA , pp. 333-343
    • Upasani, G.1    Vera, X.2    Gonzalez, A.3
  • 61
    • 84885237368 scopus 로고    scopus 로고
    • Reducing DUE-FIT of caches by exploiting acoustic wave detectors for error recovery
    • Gaurang Upasani, Xavier Vera, and Antonio Gonzalez. 2013. Reducing DUE-FIT of caches by exploiting acoustic wave detectors for error recovery. In IOLTS. 85-91.
    • (2013) IOLTS , pp. 85-91
    • Upasani, G.1    Vera, X.2    Gonzalez, A.3
  • 62
    • 84905453254 scopus 로고    scopus 로고
    • Avoiding core's DUE & SDC via acoustic wave detectors and tailored error containment and recovery
    • Gaurang Upasani, Xavier Vera, and Antonio Gonzalez. 2014a. Avoiding core's DUE & SDC via acoustic wave detectors and tailored error containment and recovery. In ISCA. 37-48.
    • (2014) ISCA , pp. 37-48
    • Upasani, G.1    Vera, X.2    Gonzalez, A.3
  • 64
    • 84961727717 scopus 로고    scopus 로고
    • A case for acousticwave detectors for soft-errors
    • 2016
    • GaurangUpasani, Xavier Vera, and Antonio Gonzalez. 2016. A case for acousticwave detectors for soft-errors. IEEE Transactions on Computing 65, 1 (2016), 5-18.
    • (2016) IEEE Transactions on Computing , vol.65 , Issue.1 , pp. 5-18
    • Xavier Vera, G.1    Gonzalez, A.2
  • 65
    • 84886455941 scopus 로고    scopus 로고
    • Implications of the power wall: Dim cores and reconfigurable logic
    • 2013
    • Liang Wang and Kevin Skadron. 2013. Implications of the power wall: Dim cores and reconfigurable logic. IEEE Micro 33, 5 (2013), 40-48.
    • (2013) IEEE Micro , vol.33 , Issue.5 , pp. 40-48
    • Wang, L.1    Skadron, K.2
  • 66
    • 33748113790 scopus 로고    scopus 로고
    • ReStore: Symptom-based soft error detection in microprocessors
    • 2006
    • Nicholas J.Wang and Sanjay J. Patel. 2006. ReStore: Symptom-based soft error detection in microprocessors. IEEE Transactions on Dependable and Secure Computing 3, 3 (2006), 188-201.
    • (2006) IEEE Transactions on Dependable and Secure Computing , vol.3 , Issue.3 , pp. 188-201
    • Wang, N.J.1    Patel, S.J.2
  • 67
    • 77949732979 scopus 로고    scopus 로고
    • Virtualized and flexible ECC for main memory
    • 2010
    • Doe Hyun Yoon and Mattan Erez. 2010. Virtualized and flexible ECC for main memory. ACM SIGARCH Computer Architecture News 38 (2010), 397-408.
    • (2010) ACM SIGARCH Computer Architecture News , vol.38 , pp. 397-408
    • Yoon, D.H.1    Erez, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.