-
3
-
-
84889082400
-
A foundation for the accurate predication of the soft error vulnerability of scientific applications
-
G. Bronevetsky, B.R. de Supinski, M. Schulz, A foundation for the accurate predication of the soft error vulnerability of scientific applications, in: SELSE 2009.
-
(2009)
SELSE
-
-
Bronevetsky, G.1
De Supinski, B.R.2
Schulz, M.3
-
4
-
-
0033314330
-
IBM S/390 parallel enterprise server G5 fault tolerance: A historical perspective
-
L. Spainhower, and T.A. Gregg IBM S/390 parallel enterprise server G5 fault tolerance: a historical perspective IBM Journal of Research and Development 43 5 1999 863 873 (Pubitemid 30589652)
-
(1999)
IBM Journal of Research and Development
, vol.43
, Issue.5
, pp. 863-873
-
-
Spainhower, L.1
Gregg, T.A.2
-
5
-
-
27544473955
-
NonStop advanced architecture
-
D. Bernick, B. Bruckert, P.D. Vigna, D. Garcia, R. Jardine, J. Klecka, J. Smullen, NonStop advanced architecture, in: DSN 2005, pp. 12-21.
-
(2005)
DSN
, pp. 12-21
-
-
Bernick, D.1
Bruckert, B.2
Vigna, P.D.3
Garcia, D.4
Jardine, R.5
Klecka, J.6
Smullen, J.7
-
6
-
-
0029763658
-
Triple-triple redundant 777 primary flight computer
-
Y. Yeh, Triple-triple redundant 777 primary flight computer, in: AERO 1996, pp. 293-307.
-
(1996)
AERO
, pp. 293-307
-
-
Yeh, Y.1
-
7
-
-
0033321638
-
DIVA: A reliable substrate for deep submicron microarchitecture design
-
T.M. Austin, DIVA: a reliable substrate for deep submicron microarchitecture design, in: MICRO 1999, pp. 196-207.
-
(1999)
MICRO
, pp. 196-207
-
-
Austin, T.M.1
-
8
-
-
21644436489
-
Microarchitecture and design challenges for gigascale integration
-
S. Borkar, Microarchitecture and design challenges for gigascale integration, in: MICRO 2004.
-
(2004)
MICRO
-
-
Borkar, S.1
-
9
-
-
84860593469
-
The reliability wall for exascale supercomputing
-
X. Yang, Z. Wang, J. Xue, and Y. Zhou The reliability wall for exascale supercomputing IEEE Transactions on Computers 61 6 2012 767 779
-
(2012)
IEEE Transactions on Computers
, vol.61
, Issue.6
, pp. 767-779
-
-
Yang, X.1
Wang, Z.2
Xue, J.3
Zhou, Y.4
-
10
-
-
84876400446
-
What is system hang and how to handle it
-
Y. Zhu, Y. Li, J. Xue, T. Tan, J. Shi, Y. Shen, C. Ma, What is system hang and how to handle it, in: ISSRE, 2012, pp. 141-150.
-
(2012)
ISSRE
, pp. 141-150
-
-
Zhu, Y.1
Li, Y.2
Xue, J.3
Tan, T.4
Shi, J.5
Shen, Y.6
Ma, C.7
-
11
-
-
0032597692
-
AR-SMT: A microarchitectural approach to fault tolerance in microprocessors
-
E. Rotenberg, AR-SMT: a microarchitectural approach to fault tolerance in microprocessors, in: FTCS 1999, pp. 84-91.
-
(1999)
FTCS
, pp. 84-91
-
-
Rotenberg, E.1
-
12
-
-
41349091201
-
Argus: Low-cost, comprehensive error detection in simple cores
-
A. Meixner, M. Bauer, D. Sorin, Argus: low-cost, comprehensive error detection in simple cores, in: MICRO 2007, pp. 210-222.
-
(2007)
MICRO
, pp. 210-222
-
-
Meixner, A.1
Bauer, M.2
Sorin, D.3
-
14
-
-
0036507790
-
Error detection by duplicated instructions in super-scalar processors
-
DOI 10.1109/24.994913, PII S0018952902026076
-
N. Oh, P. Shirvani, and E. McCluskey Error detection by duplicated instructions in super-scalar processors IEEE Transactions on Reliability 51 1 2002 63 75 (Pubitemid 34630924)
-
(2002)
IEEE Transactions on Reliability
, vol.51
, Issue.1
, pp. 63-75
-
-
Oh, N.1
Shirvani, P.P.2
McCluskey, E.J.3
-
15
-
-
33646829087
-
SWIFT: Software implemented fault tolerance
-
DOI 10.1109/CGO.2005.34, 1402092, Proceedings of the 2005 International Symposium onCode Generation and Optimization, CGO 2005
-
G. Reis, J. Chang, N. Vachharajani, R. Rangan, D. August, SWIFT: software implemented fault tolerance, in: CGO 2005, pp. 243-254. (Pubitemid 43773808)
-
(2005)
Proceedings of the 2005 International Symposium on Code Generation and Optimization, CGO 2005
, vol.2005
, pp. 243-254
-
-
Reis, G.A.1
Chang, J.2
Vachharajani, N.3
Rangan, R.4
August, D.I.5
-
16
-
-
67650566031
-
ESoftCheck: Removal of non-vital checks for fault tolerance
-
J. Yu, M.J. Garzaran, M. Snir, ESoftCheck: Removal of non-vital checks for fault tolerance, in: CGO 2009, pp. 35-46.
-
(2009)
CGO
, pp. 35-46
-
-
Yu, J.1
Garzaran, M.J.2
Snir, M.3
-
17
-
-
36049042979
-
Processor-level selective replication
-
DOI 10.1109/DSN.2007.75, 4273005, Proceedings - 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2007
-
N. Nakka, K. Pattabiraman, R. Iyer, Processor-level selective replication, in: DSN 2007, pp. 544-553. (Pubitemid 350080459)
-
(2007)
Proceedings of the International Conference on Dependable Systems and Networks
, pp. 544-553
-
-
Nakka, N.1
Pattabiraman, K.2
Iyer, R.3
-
18
-
-
77949759608
-
Shoestring: Probabilistic soft error reliability on the cheap
-
S. Feng, S. Gupta, A. Ansari, S. Mahlke, Shoestring: probabilistic soft error reliability on the cheap, in: ASPLOS 2010, pp. 385-396.
-
(2010)
ASPLOS
, pp. 385-396
-
-
Feng, S.1
Gupta, S.2
Ansari, A.3
Mahlke, S.4
-
19
-
-
84863492598
-
Runtime asynchronous fault tolerance via speculation
-
Y. Zhang, S. Ghosh, J. Huang, J.W. Lee, S.A. Mahlke, D.I. August, Runtime asynchronous fault tolerance via speculation, in: CGO 2012.
-
(2012)
CGO
-
-
Zhang, Y.1
Ghosh, S.2
Huang, J.3
Lee, J.W.4
Mahlke, S.A.5
August, D.I.6
-
20
-
-
34547697289
-
Application-level correctness and its impact on fault tolerance
-
DOI 10.1109/HPCA.2007.346196, 4147659, 2007 IEEE 13th Annual International Symposium on High Performance Computer Architecture, HPCA-13
-
X. Li, D. Yeung, Application-level correctness and its impact on fault tolerance, in: HPCA 2007, pp. 181-192. (Pubitemid 47208163)
-
(2007)
Proceedings - International Symposium on High-Performance Computer Architecture
, pp. 181-192
-
-
Li, X.1
Yeung, D.2
-
21
-
-
53349095714
-
A characterization of instruction-level error derating and its implications for error detection
-
J. Cook, C. Zilles, A characterization of instruction-level error derating and its implications for error detection, in: DSN 2008, pp. 482-491.
-
(2008)
DSN
, pp. 482-491
-
-
Cook, J.1
Zilles, C.2
-
22
-
-
84968854658
-
Y-branches: When you come to a fork in the road, take it
-
N.W. Michael, M. Fertig, S. Patel, Y-branches: When you come to a fork in the road, take it, in: PACT 2003, pp. 56-67.
-
(2003)
PACT
, pp. 56-67
-
-
Michael, N.W.1
Fertig, M.2
Patel, S.3
-
23
-
-
34249775197
-
Automatic instruction-level software-only recovery
-
DOI 10.1109/MM.2007.4
-
G.A. Reis, J. Chang, and D.I. August Automatic instruction-level software-only recovery IEEE Micro 27 1 2007 36 47 (Pubitemid 46850886)
-
(2007)
IEEE Micro
, vol.27
, Issue.1
, pp. 36-47
-
-
Reis, G.A.1
Chang, J.2
August, D.I.3
-
24
-
-
84963697733
-
Scheduling fault-tolerant distributed hard real-time tasks independently of the replication strategies
-
P. Chevochot, I. Puaut, Scheduling fault-tolerant distributed hard real-time tasks independently of the replication strategies, in: RTCSA 1999, pp. 356-363.
-
(1999)
RTCSA
, pp. 356-363
-
-
Chevochot, P.1
Puaut, I.2
-
25
-
-
84861597079
-
PartialRC: A partial recomputing method for efficient fault recovery on GPGPUs
-
doi:10.1007/s11390-012-1220-5
-
X.-h. Xu, X.-J. Yang, J. Xue, Y.-F. Lin, and Y.-S. Lin PartialRC: a partial recomputing method for efficient fault recovery on GPGPUs Journal of Computer Science and Technology 27 2 2012 240 255 doi:10.1007/s11390-012-1220-5
-
(2012)
Journal of Computer Science and Technology
, vol.27
, Issue.2
, pp. 240-255
-
-
Xu, X.-H.1
Yang, X.-J.2
Xue, J.3
Lin, Y.-F.4
Lin, Y.-S.5
-
26
-
-
84866653671
-
Low-cost program-level detectors for reducing silent data corruptions
-
S.K.S. Hari, S.V. Adve, H. Naeimi, Low-cost program-level detectors for reducing silent data corruptions, in: DSN 2012, pp. 1-12.
-
(2012)
DSN
, pp. 1-12
-
-
Hari, S.K.S.1
Adve, S.V.2
Naeimi, H.3
-
27
-
-
47349100793
-
Multi-bit error tolerant caches using two-dimensional error coding
-
J. Kim, N. Hardavellas, K. Mai, B. Falsafi, J.C. Hoe, Multi-bit error tolerant caches using two-dimensional error coding, in: MICRO 2007, pp. 197-209.
-
(2007)
MICRO
, pp. 197-209
-
-
Kim, J.1
Hardavellas, N.2
Mai, K.3
Falsafi, B.4
Hoe, J.C.5
-
28
-
-
0033726332
-
Transient fault detection via simultaneous multithreading
-
S.K. Reinhardt, S.S. Mukherjee, Transient fault detection via simultaneous multithreading, in: ISCA 2000, pp. 25-36.
-
(2000)
ISCA
, pp. 25-36
-
-
Reinhardt, S.K.1
Mukherjee, S.S.2
-
30
-
-
33646165711
-
Data cache locking for higher program predictability
-
X. Vera, B. Lisper, J. Xue, Data cache locking for higher program predictability, in: SIGMETRICS, 2003, pp. 272-282.
-
(2003)
SIGMETRICS
, pp. 272-282
-
-
Vera, X.1
Lisper, B.2
Xue, J.3
-
32
-
-
84864151582
-
WCET-aware data selection and allocation for scratchpad memory
-
Q. Wan, H. Wu, J. Xue, WCET-aware data selection and allocation for scratchpad memory, in: LCTES, 2012, pp. 41-50.
-
(2012)
LCTES
, pp. 41-50
-
-
Wan, Q.1
Wu, H.2
Xue, J.3
-
33
-
-
43949126892
-
The worst-case execution-time problem overview of methods and survey of tools
-
doi:10.1145/1347375.1347389. URL: http://doi.acm.org/10.1145/1347375. 1347389
-
R. Wilhelm, J. Engblom, A. Ermedahl, N. Holsti, S. Thesing, D. Whalley, G. Bernat, C. Ferdinand, R. Heckmann, T. Mitra, F. Mueller, I. Puaut, P. Puschner, J. Staschulat, and P. Stenström The worst-case execution-time problem overview of methods and survey of tools ACM Trans. Embed. Comput. Syst. 7 3 2008 36:1 36:53 doi:10.1145/1347375.1347389. URL: http://doi.acm.org/10. 1145/1347375.1347389
-
(2008)
ACM Trans. Embed. Comput. Syst.
, vol.7
, Issue.3
, pp. 361-3653
-
-
Wilhelm, R.1
Engblom, J.2
Ermedahl, A.3
Holsti, N.4
Thesing, S.5
Whalley, D.6
Bernat, G.7
Ferdinand, C.8
Heckmann, R.9
Mitra, T.10
Mueller, F.11
Puaut, I.12
Puschner, P.13
Staschulat, J.14
Stenström, P.15
-
34
-
-
78650668151
-
Optimal WCET-aware code selection for scratchpad memory
-
H. Wu, J. Xue, S. Parameswaran, Optimal WCET-aware code selection for scratchpad memory, in: EMSOFT, 2010, pp. 59-68.
-
(2010)
EMSOFT
, pp. 59-68
-
-
Wu, H.1
Xue, J.2
Parameswaran, S.3
-
35
-
-
0026156694
-
Experiments with a program timing tool based on source-level timing schema
-
C.Y. Park, and A.C. Shaw Experiments with a program timing tool based on source-level timing schema IEEE Computer 24 5 1991 48 57
-
(1991)
IEEE Computer
, vol.24
, Issue.5
, pp. 48-57
-
-
Park, C.Y.1
Shaw, A.C.2
-
37
-
-
36048974180
-
Chronos: A timing analyzer for embedded software
-
DOI 10.1016/j.scico.2007.01.014, PII S0167642307001633, Experimental Software and Toolkits
-
X. Li, Y. Liang, T. Mitra, and A. Roychoudhury Chronos: a timing analyzer for embedded software Science of Computer Programming 69 1-3 2007 56 67 (Pubitemid 350087241)
-
(2007)
Science of Computer Programming
, vol.69
, Issue.1-3
, pp. 56-67
-
-
Li, X.1
Liang, Y.2
Mitra, T.3
Roychoudhury, A.4
-
38
-
-
84884687003
-
A modular and retargetable framework for tree-based WCET analysis
-
A. Colin, I. Puaut, A modular and retargetable framework for tree-based WCET analysis, in: ECRTS 2001, pp. 37-44.
-
(2001)
ECRTS
, pp. 37-44
-
-
Colin, A.1
Puaut, I.2
-
39
-
-
0036994494
-
WCET analysis of probabilistic hard real-time systems
-
G. Bernat, A. Colin, S.M. Petters, WCET analysis of probabilistic hard real-time systems, in: RTSS 2002, pp. 279-288.
-
(2002)
RTSS
, pp. 279-288
-
-
Bernat, G.1
Colin, A.2
Petters, S.M.3
-
40
-
-
80051996448
-
The Mälardalen WCET benchmarks - Past, present and future
-
J. Gustafsson, A. Betts, A. Ermedahl, B. Lisper, The Mälardalen WCET benchmarks - past, present and future, in: WCET 2010, pp. 137-147.
-
(2010)
WCET
, pp. 137-147
-
-
Gustafsson, J.1
Betts, A.2
Ermedahl, A.3
Lisper, B.4
-
42
-
-
78149269828
-
DAFT: Decoupled acyclic fault tolerance
-
Y. Zhang, J.W. Lee, N.P. Johnson, D.I. August, DAFT: decoupled acyclic fault tolerance, in: PACT 2010, pp. 87-98.
-
(2010)
PACT
, pp. 87-98
-
-
Zhang, Y.1
Lee, J.W.2
Johnson, N.P.3
August, D.I.4
-
43
-
-
84866711634
-
Blockwatch: Leveraging similarity in parallel programs for error detection
-
J. Wei, K. Pattabiraman, Blockwatch: leveraging similarity in parallel programs for error detection, in: DSN 2012, pp. 1-12.
-
(2012)
DSN
, pp. 1-12
-
-
Wei, J.1
Pattabiraman, K.2
|