-
1
-
-
74049125756
-
-
Technical Report TN-41-01. Technical report, Micron Technology
-
Calculating Memory System Power for DDR3, Technical Report TN-41-01. Technical report, Micron Technology, 2007.
-
(2007)
Calculating Memory System Power for DDR3
-
-
-
3
-
-
74049087888
-
Future scaling of processor-memory interfaces
-
J. H. Ahn, N. P. Jouppi, C. Kozyrakis, J. Leverich, and R. S. Schreiber. Future Scaling of Processor-Memory Interfaces. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2009.
-
(2009)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Ahn, J.H.1
Jouppi, N.P.2
Kozyrakis, C.3
Leverich, J.4
Schreiber, R.S.5
-
5
-
-
84877716218
-
Scaling algebraic multigrid solvers: On the road to exascale
-
A. H. Baker, R. D. Falgout, T. Gamblin, T. V. Kolev, M. Schulz, and U. M. Yang. Scaling Algebraic Multigrid Solvers: On the Road to Exascale. In International Conf. on Competence in HPC, 2011.
-
(2011)
International Conf. on Competence in HPC
-
-
Baker, A.H.1
Falgout, R.D.2
Gamblin, T.3
Kolev, T.V.4
Schulz, M.5
Yang, U.M.6
-
8
-
-
84875168534
-
Online-ABFT: An online algorithm based fault tolerance scheme for soft error detection in iterative methods
-
Z. Chen. Online-ABFT: An Online Algorithm Based Fault Tolerance Scheme for Soft Error Detection in Iterative Methods. In ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming, 2013.
-
(2013)
ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming
-
-
Chen, Z.1
-
10
-
-
79959586938
-
High performance linpack benchmark: A fault tolerant implementation without checkpointing
-
T. Davies, C. Karlsson, H. Liu, C. Ding, and Z. Chen. High Performance Linpack Benchmark: A Fault Tolerant Implementation without Checkpointing. In International Conference on Supercomputing, 2011.
-
(2011)
International Conference on Supercomputing
-
-
Davies, T.1
Karlsson, C.2
Liu, H.3
Ding, C.4
Chen, Z.5
-
13
-
-
80051629238
-
Matrix multiplication on gpus with on-line fault tolerance
-
C. Ding, C. Karlsson, H. Liu, T. Davies, and Z. Chen. Matrix Multiplication on GPUs with On-Line Fault Tolerance. In International Symposium on Parallel and Distributed Processing with Applications, 2011.
-
(2011)
International Symposium on Parallel and Distributed Processing with Applications
-
-
Ding, C.1
Karlsson, C.2
Liu, H.3
Davies, T.4
Chen, Z.5
-
14
-
-
84858403667
-
Algorithm-based fault tolerance for dense matrix factorizations
-
P. Du, A. Bouteiller, G. Bosilca, T. Herault, and J. Dongarra. Algorithm-based Fault Tolerance for Dense Matrix Factorizations. In ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2012.
-
(2012)
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP)
-
-
Du, P.1
Bouteiller, A.2
Bosilca, G.3
Herault, T.4
Dongarra, J.5
-
15
-
-
80955123431
-
High performance dense linear system solver with soft error resilience
-
P. Du, P. Luszczek, S. Tomovand, and J. Dongarra. High Performance Dense Linear System Solver with Soft Error Resilience. In IEEE Cluster, 2011.
-
(2011)
IEEE Cluster
-
-
Du, P.1
Luszczek, P.2
Tomovand, S.3
Dongarra, J.4
-
16
-
-
33845468996
-
Fault tolerance techniques for the merrimac streaming supercomputer
-
M. Erez, N. Jayasena, T. J. Knight, and W. J. Dally. Fault Tolerance Techniques for the Merrimac Streaming Supercomputer. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2005.
-
(2005)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Erez, M.1
Jayasena, N.2
Knight, T.J.3
Dally, W.J.4
-
17
-
-
83155188951
-
Evaluating the viability of process replication reliability for exascale systems
-
K. Ferreira, J. Stearley, J. H. L. III, R. Oldfield, K. Pedretti, R. Brightwell, R. Riesen, P. G. Bridges, and D. Arnold. Evaluating the Viability of Process Replication Reliability for Exascale Systems. In International Conference for High Performance Computing, Networking, Storage and Analysis, 2011.
-
(2011)
International Conference for High Performance Computing, Networking, Storage and Analysis
-
-
Ferreira, K.1
Stearley III, J.2
Oldfield, R.3
Pedretti, K.4
Brightwell, R.5
Riesen, R.6
Bridges, P.G.7
Arnold, D.8
-
18
-
-
84871176503
-
Mechanisms and evaluation of cross-layer fault-tolerance for supercomputing
-
C.-H. Ho, M. de Kruijif, K. Sankaralingam, B. Rountree, M. Schulz, and B. R. de Supinski. Mechanisms and Evaluation of Cross-Layer Fault-Tolerance for Supercomputing. In International Conference on Parallel Processing (ICPP), 2012.
-
(2012)
International Conference on Parallel Processing (ICPP)
-
-
Ho, C.-H.1
De Kruijif, M.2
Sankaralingam, K.3
Rountree, B.4
Schulz, M.5
De Supinski, B.R.6
-
21
-
-
84877692741
-
Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool
-
D. Li, J. S. Vetter, and W. Yu. Classifying Soft Error Vulnerabilities in Extreme-Scale Scientific Applications Using a Binary Instrumentation Tool. In International Conference for High Performance Computing, Networking, Storage and Analysis, 2012.
-
(2012)
International Conference for High Performance Computing, Networking, Storage and Analysis
-
-
Li, D.1
Vetter, J.S.2
Yu, W.3
-
22
-
-
53349140999
-
Understanding the propagation of hard errors to software and implications for resilient system design
-
M. Li, P. Ramachandran, S. K. Sahoo, S. V. Adve, V. S. Adve, and Y. Zhou. Understanding the Propagation of Hard Errors to Software and Implications for Resilient System Design. In Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2008.
-
(2008)
Architectural Support for Programming Languages and Operating Systems (ASPLOS)
-
-
Li, M.1
Ramachandran, P.2
Sahoo, S.K.3
Adve, S.V.4
Adve, V.S.5
Zhou, Y.6
-
23
-
-
83155182888
-
System implications of memory reliability in exascale computing
-
S. Li, K. Chen, M.-Y. Hsieh, N. Muralimanohar, C. D. Kersey, J. B. Brockman, A. F. Rodrigues, and N. P. Jouppi. System Implications of Memory Reliability in Exascale Computing. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2011.
-
(2011)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Li, S.1
Chen, K.2
Hsieh, M.-Y.3
Muralimanohar, N.4
Kersey, C.D.5
Brockman, J.B.6
Rodrigues, A.F.7
Jouppi, N.P.8
-
24
-
-
84877700379
-
MAGE: Adaptive granularity and ecc for resilient and power efficient memory systems
-
S. Li, D. H. Yoon, K. Chen, J. Zhao, J. H. Ahn, J. B. Brockman, Y. Xie, and N. P. Jouppi. MAGE: Adaptive Granularity and ECC for Resilient and Power Efficient Memory Systems. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2012.
-
(2012)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Li, S.1
Yoon, D.H.2
Chen, K.3
Zhao, J.4
Ahn, J.H.5
Brockman, J.B.6
Xie, Y.7
Jouppi, N.P.8
-
25
-
-
83155174060
-
A realistic evaluation of memory hardware errors and software system susceptibility
-
X. Li, M. C. Huang, K. Shen, and L. Chu. A Realistic Evaluation of Memory Hardware Errors and Software System Susceptibility. In USENIX ATC, 2010.
-
(2010)
USENIX ATC
-
-
Li, X.1
Huang, M.C.2
Shen, K.3
Chu, L.4
-
26
-
-
84899691583
-
Algorithm-based recovery for newton's method without checkpointing
-
H. Liu, T. Davies, C. Ding, C. Karlsson, and Z. Chen. Algorithm-Based Recovery for Newton's Method without Checkpointing. In Workshop on Dependable Par., Distributed and Network-Centric Systems, 2011.
-
(2011)
Workshop on Dependable Par., Distributed and Network-Centric Systems
-
-
Liu, H.1
Davies, T.2
Ding, C.3
Karlsson, C.4
Chen, Z.5
-
27
-
-
78650814177
-
The 48-core scc processor: The programmer's view
-
T. G. Mattson, M. Riepen, T. Lehnig, P. Brett, W. Haas, P. Kennedy, J. Howard, S. Vangal, N. Borkar, G. Ruhl, and S. Dighe. The 48-core SCC Processor: The Programmer's View. In International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2010.
-
(2010)
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
-
-
Mattson, T.G.1
Riepen, M.2
Lehnig, T.3
Brett, P.4
Haas, W.5
Kennedy, P.6
Howard, J.7
Vangal, S.8
Borkar, N.9
Ruhl, G.10
Dighe, S.11
-
28
-
-
84899687630
-
-
Mcsim: A manycore simulation infrastructure
-
Mcsim: A manycore simulation infrastructure. http://scale. snu. ac. kr/mcsim.
-
-
-
-
31
-
-
79959550547
-
DRAMSim2: A cycle accurate memory system simulator
-
P. Rosenfeld, E. Cooper-Balis, and B. Jacob. DRAMSim2: A Cycle Accurate Memory System Simulator. Computer Architecture Letters, 10(1):16-19, 2011.
-
(2011)
Computer Architecture Letters
, vol.10
, Issue.1
, pp. 16-19
-
-
Rosenfeld, P.1
Cooper-Balis, E.2
Jacob, B.3
-
32
-
-
53349128424
-
Using likely program invariants to detect hardware errors
-
S. K. Sahoo, M.-L. Li, P. Ramachandran, S. V. Adve, V. S. Adve, and Y. Zhou. Using Likely Program Invariants to Detect Hardware Errors. In International Conf. on Dependable Systems and Networks, 2008.
-
(2008)
International Conf. on Dependable Systems and Networks
-
-
Sahoo, S.K.1
Li, M.-L.2
Ramachandran, P.3
Adve, S.V.4
Adve, V.S.5
Zhou, Y.6
-
34
-
-
41649118004
-
Impact of error correction code and dynamic memory recon-guration on high-reliability/low-cost server memory
-
C. Slayman. Impact of Error Correction Code and Dynamic Memory Recon-guration on High-Reliability/Low-Cost Server Memory. In Integrated Reliability Workshop, 2006.
-
(2006)
Integrated Reliability Workshop
-
-
Slayman, C.1
-
36
-
-
84864832751
-
LOT-ECC: Localized and tiered reliability mechanisms for commodity memory systems
-
A. N. Udipi, N. Muralimanohar, R. Balsubramonian, A. Davis, and N. P. Jouppi. LOT-ECC: Localized and Tiered Reliability Mechanisms for Commodity Memory Systems. In International Symposium on Computer Architecture (ISCA), 2012.
-
(2012)
International Symposium on Computer Architecture (ISCA)
-
-
Udipi, A.N.1
Muralimanohar, N.2
Balsubramonian, R.3
Davis, A.4
Jouppi, N.P.5
-
38
-
-
84899671222
-
Online soft error correction in cholesky decomposition
-
P. Wu, L. Chen, L. Tan, and Z. Chen. Online Soft Error Correction in Cholesky Decomposition. UC, Riverside, Technical Report UCR-CS-13-002, 2013.
-
(2013)
UC, Riverside, Technical Report UCR-CS-13-002
-
-
Wu, P.1
Chen, L.2
Tan, L.3
Chen, Z.4
-
39
-
-
84863252530
-
Fault tolerant matrix-matrix multiplication: Correcting soft errors on-line
-
P. Wu, C. Ding, L. Chen, F. Gao, T. Davies, C. Karlsson, and Z. Chen. Fault Tolerant Matrix-Matrix Multiplication: Correcting Soft Errors On-line. In Workshop on Scalable Algorithms for Large-Scale Systems, 2011.
-
(2011)
Workshop on Scalable Algorithms for Large-Scale Systems
-
-
Wu, P.1
Ding, C.2
Chen, L.3
Gao, F.4
Davies, T.5
Karlsson, C.6
Chen, Z.7
-
40
-
-
77952257218
-
Virtualized and flexible ecc for main memory
-
D. H. Yoon and M. Erez. Virtualized and Flexible ECC for Main Memory. In ASPLOS, 2010.
-
(2010)
ASPLOS
-
-
Yoon, D.H.1
Erez, M.2
|