-
1
-
-
84870548923
-
An overview of the BlueGene/L supercomputer
-
ADIGA, N., ALMASI, G., ALMASI, G., ARIDOR, Y., BARIK, R., ET AL. 2002. An overview of the BlueGene/L supercomputer. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 60-71.
-
(2002)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 60-71
-
-
Adiga, N.1
Almasi, G.2
Almasi, G.3
Aridor, Y.4
Barik, R.5
-
2
-
-
58149231291
-
A bipolar-selected phase change memory featuring multi-level cell storage
-
BEDESCHI, F., FACKENTHAL, R., RESTA, C., DONZE, E. M., JAGASIVAMANI, M., ET AL. 2009. A bipolar-selected phase change memory featuring multi-level cell storage. IEEE J. Solid-State Circ. 44, 1, 217-227.
-
(2009)
IEEE J. Solid-State Circ.
, vol.44
, Issue.1
, pp. 217-227
-
-
Bedeschi, F.1
Fackenthal, R.2
Resta, C.3
Donze, E.M.4
Jagasivamani, M.5
-
3
-
-
33846118079
-
Designing reliable systems from unreliable components: The challenges of transistor variability and degradation
-
BORKAR, S. Y. 2005. Designing reliable systems from unreliable components: The challenges of transistor variability and degradation. IEEE Micro 25, 6, 10-16.
-
(2005)
IEEE Micro
, vol.25
, Issue.6
, pp. 10-16
-
-
Borkar, S.Y.1
-
4
-
-
74049111423
-
Compiler-Enhanced incremental checkpointing for OpenMP applications
-
BRONEVETSKY, G.,MARQUES, D. J., PINGALI, K. K., ET AL. 2008. Compiler-Enhanced incremental checkpointing for OpenMP applications. In Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. 275-276.
-
(2008)
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 275-276
-
-
Bronevetsky, G.1
Marques, D.J.2
Pingali, K.K.3
-
6
-
-
68249127079
-
Fault tolerance in petascale/exascale systems: Current knowledge, challenges and research opportunities
-
CAPPELLO, F. 2009. Fault tolerance in petascale/exascale systems: Current knowledge, challenges and research opportunities. Int. J. High Perform. Comput. Appl. 23, 3, 212-226.
-
(2009)
Int. J. High Perform. Comput. Appl.
, vol.23
, Issue.3
, pp. 212-226
-
-
Cappello, F.1
-
7
-
-
0022020346
-
Distributed snapshots: Determining global states of distributed systems
-
DOI 10.1145/214451.214456
-
CHANDY, K. M. AND LAMPORT, L. 1985. Distributed snapshots: Determining global states of distributed systems. ACM Trans. Comput. Syst. 3, 1, 63-75. (Pubitemid 15597765)
-
(1985)
ACM Transactions on Computer Systems
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy K.Mani1
Lamport Leslie2
-
9
-
-
28044460018
-
A higher order estimate of the optimum checkpoint interval for restart dumps
-
DOI 10.1016/j.future.2004.11.016, PII S0167739X04002213
-
DALY, J. T. 2006. A higher order estimate of the optimum checkpoint interval for restart dumps. Future Gener. Comput. Syst. 22, 3, 303-312. (Pubitemid 41689812)
-
(2006)
Future Generation Computer Systems
, vol.22
, Issue.3
, pp. 303-312
-
-
Daly, J.T.1
-
10
-
-
76349091566
-
PCRAMsim: System-level performance, energy, and area modeling for phase-change RAM
-
DONG, X., JOUPPI, N., AND XIE, Y. 2009a. PCRAMsim: System-level performance, energy, and area modeling for phase-change RAM. In Proceedings of the International Conference on Computer-Aided Design. 269-275.
-
(2009)
Proceedings of the International Conference on Computer-Aided Design
, pp. 269-275
-
-
Dong, X.1
Jouppi, N.2
Xie, Y.3
-
11
-
-
74049097178
-
Leveraging 3DPCRAMtechnologies to reduce checkpoint overhead for future exascale systems
-
DONG, X.,MURALIMANOHAR, N., JOUPPI, N.,KAUFMANN, R., AND XIE, Y. 2009b. Leveraging 3DPCRAMtechnologies to reduce checkpoint overhead for future exascale systems. In Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 1-12.
-
(2009)
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
, pp. 1-12
-
-
Dong, X.1
Muralimanohar, N.2
Jouppi N.Kaufmann, R.3
Xie, Y.4
-
12
-
-
12344277946
-
-
Tech. rep. LBNL-54941, Lawrence Berkeley National Laboratory
-
DUELL, J.,HARGROVE, P., AND ROMAN, E. 2002. The design and implementation of Berkeley Lab's Linux checkpoint/ restart. Tech. rep. LBNL-54941, Lawrence Berkeley National Laboratory.
-
(2002)
The Design and Implementation of Berkeley Lab's Linux Checkpoint/ Restart
-
-
Duell, J.1
Hargrove, P.2
Roman, E.3
-
13
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
ELNOZAHY, E. N., ALVISI, L.,WANG, Y.-M., AND JOHNSON, D. B. 2002. A survey of rollback-recovery protocols in message-passing systems. ACM Comput. Surv. 34, 3, 375-408.
-
(2002)
ACM Comput. Surv.
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
14
-
-
79960821385
-
-
Tech. rep. LA-UR- 07-7405, Los Alamos National Laboratory
-
GRIDER, G., LONCARIC, J., AND LIMPART, D. 2007. Roadrunner system management report. Tech. rep. LA-UR- 07-7405, Los Alamos National Laboratory.
-
(2007)
Roadrunner System Management Report
-
-
Grider, G.1
Loncaric, J.2
Limpart, D.3
-
15
-
-
34548861504
-
A 512kb embedded phase change memory with 416kb/s write throughput at 100μa cell write current
-
HANZAWA, S., KITAI, N., OSADA, K., ET AL. 2007. A 512kb embedded phase change memory with 416kb/s write throughput at 100μa cell write current. In Proceedings of the IEEE International Solid-State Circuits Conference. 474-616.
-
(2007)
Proceedings of the IEEE International Solid-State Circuits Conference.
, pp. 474-616
-
-
Hanzawa, S.1
Kitai, N.2
Osada, K.3
-
16
-
-
49149120280
-
Accurate, pre-RTL temperature-aware design using a parameterized, geometric thermal model
-
HUANG, W., SANKARANARAYANAN, K., SKADRON, K., ET AL. 2008. Accurate, pre-RTL temperature-aware design using a parameterized, geometric thermal model. IEEE Trans. Comput. 57, 9, 1277-1288.
-
(2008)
IEEE Trans. Comput.
, vol.57
, Issue.9
, pp. 1277-1288
-
-
Huang, W.1
Sankaranarayanan, K.2
Skadron, K.3
-
17
-
-
79960810439
-
-
INTERNATIONAL TECHNOLOGY ROADMAP FOR SEMICONDUCTORS. Process integration, devices, and structures 2007 edition
-
INTERNATIONAL TECHNOLOGY ROADMAP FOR SEMICONDUCTORS. Process integration, devices, and structures 2007 edition. http://www.itrs.net/.
-
-
-
-
19
-
-
74049106729
-
-
LOS ALAMOS NATIONAL LABORATORY
-
LOS ALAMOS NATIONAL LABORATORY. 2009. Reliability data sets. http://institutes.lanl.gov/data/fdata/.
-
(2009)
Reliability Data Sets
-
-
-
21
-
-
29344473319
-
Predicting the number of fatal soft errors in Los Alamos National Laboratory's ASC Q supercomputer
-
DOI 10.1109/TDMR.2005.855685
-
MICHALAK, S. E., HARRIS, K. W., HENGARTNER, N. W., ET AL. 2005. Predicting the number of fatal soft errors in Los Alamos National Laboratory's ASCI Q supercomputer. IEEE Trans. Device Mater. Reliab. 5, 3, 329-335. (Pubitemid 43003054)
-
(2005)
IEEE Transactions on Device and Materials Reliability
, vol.5
, Issue.3
, pp. 329-335
-
-
Michalak, S.E.1
Harris, K.W.2
Hengartner, N.W.3
Takala, B.E.4
Wender, S.A.5
-
22
-
-
50649087527
-
Reliability-aware approach: An incremental checkpoint/restartmodel in hpc environments
-
NAKSINEHABOON, N., LIU, Y., LEANGSUKSUN, C., NASSAR, R., PAUN, M., AND SCOTT, S. L. 2008. Reliability-Aware approach: An incremental checkpoint/restartmodel in hpc environments. In Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid. 783-788.
-
(2008)
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid
, pp. 783-788
-
-
Naksinehaboon, N.1
Liu, Y.2
Leangsuksun, C.3
Nassar, R.4
Paun, M.5
Scott, S.L.6
-
23
-
-
63549128406
-
-
NASA
-
NASA. 2009. Nas parallel benchmarks. http://www.nas.nasa.gov/Resources/ Software/npb.html.
-
(2009)
Nas Parallel Benchmarks
-
-
-
24
-
-
47249142074
-
Modeling the impact of checkpoints on next-generation systems
-
OLDFIELD, R. A.,ARUNAGIRI, S.,TELLER, P. J., ET AL. 2007. Modeling the impact of checkpoints on next-generation systems. In Proceedings of the 24th IEEE Conference on Mass Storage Systems and Technologies. 30-46.
-
(2007)
Proceedings of the 24th IEEE Conference on Mass Storage Systems and Technologies
, pp. 30-46
-
-
Oldfield, R.A.1
Arunagiri, S.2
Teller, P.J.3
-
26
-
-
4544229593
-
Novel μtrench phase-change memory cell for embedded and stand-alone non-volatile memory applications
-
PELLIZZER, F., PIROVANO, A.,OTTOGALLI, F., ET AL. 2004. Novel μtrench phase-change memory cell for embedded and stand-alone non-volatile memory applications. In Proceedings of the IEEE Symposium on VLSI Technology. 18-19.
-
(2004)
Proceedings of the IEEE Symposium on VLSI Technology
, pp. 18-19
-
-
Pellizzer, F.1
Pirovano, A.2
Ottogalli, F.3
-
28
-
-
0033077475
-
Memory exclusion: Optimizing the performance of checkpointing systems
-
PLANK, J. S.,CHEN, Y., LI, K.,BECK, M., AND KINGSLEY, G. 1999. Memory exclusion: Optimizing the performance of checkpointing systems. Softw. Pract. Exper. 29, 2, 125-142.
-
(1999)
Softw. Pract. Exper.
, vol.29
, Issue.2
, pp. 125-142
-
-
Plank J.S.Chen, Y.1
Li, K.2
Beck, M.3
Kingsley, G.4
-
29
-
-
0032179680
-
Diskless checkpointing
-
PLANK, J. S., LI, K., AND PUENING, M. A. 1998. Diskless checkpointing. IEEE Trans. Parall. Distrib. Syst. 9, 10, 972-986. (Pubitemid 128747893)
-
(1998)
IEEE Transactions on Parallel and Distributed Systems
, vol.9
, Issue.10
, pp. 972-986
-
-
Plank, J.S.1
Li, K.2
Puening, M.A.3
-
30
-
-
33847111141
-
High-end computing: The challenge of scale
-
REED, D. 2004. High-end computing: The challenge of scale. In Director's Colloquium.
-
(2004)
Director's Colloquium
-
-
Reed, D.1
-
31
-
-
12444320288
-
On the feasibility of incremental checkpointing for scientific computing
-
SANCHO, J. C., PETRINI, F., JOHNSON, G., AND FRACHTENBERG, E. 2004. On the feasibility of incremental checkpointing for scientific computing. In Proceedings of the 18th International Parallel and Distributed Processing Symposium. 58-67.
-
(2004)
Proceedings of the 18th International Parallel and Distributed Processing Symposium
, pp. 58-67
-
-
Sancho, J.C.1
Petrini, F.2
Johnson, G.3
Frachtenberg, E.4
-
34
-
-
52649100126
-
Corona: System implications of emerging nanophotonic technology
-
VANTREASE, D., SCHREIBER, R.,MONCHIERO, M., ET AL. 2008. Corona: System implications of emerging nanophotonic technology. In Proceedings of the 35th International Symposium on Computer Architecture. 153-164.
-
(2008)
Proceedings of the 35th International Symposium on Computer Architecture
, pp. 153-164
-
-
Vantrease, D.1
Schreiber, R.2
Monchiero, M.3
-
36
-
-
33746626966
-
Design space exploration for 3D architectures
-
DOI 10.1145/1148015.1148016
-
XIE, Y., LOH, G. H., BLACK, B., AND BERNSTEIN, K. 2006. Design space exploration for 3D architectures. ACM J. Emerg. Technol. Comput. Syst. 2, 2, 65-103. (Pubitemid 44157546)
-
(2006)
ACM Journal on Emerging Technologies in Computing Systems
, vol.2
, Issue.2
, pp. 65-103
-
-
Xie, Y.1
Loh, G.H.2
Black, B.3
Bernstein, K.4
-
37
-
-
84976846528
-
A first order approximation to the optimal checkpoint interval
-
YOUNG, J. W. 1974. A first order approximation to the optimal checkpoint interval. Comm. ACM 17, 530-531.
-
(1974)
Comm. ACM
, vol.17
, pp. 530-531
-
-
Young, J.W.1
-
38
-
-
47249095732
-
An integrated phase change memory cell with Ge nanowire diode for cross-point memory
-
ZHANG, Y.,KIM, S.-B., MCVITTIE, J. P., ET AL. 2007. An integrated phase change memory cell with Ge nanowire diode for cross-point memory. In Proceedings of the IEEE Symposium on VLSI Technology. 98-99.
-
(2007)
Proceedings of the IEEE Symposium on VLSI Technology
, pp. 98-99
-
-
Zhang, Y.1
Kim, S.-B.2
Mcvittie, J.P.3
-
39
-
-
70450277571
-
A durable and energy efficient main memory using phase change memory technology
-
ZHOU, P., ZHAO, B., YANG, J., AND ZHANG, Y. 2009. A durable and energy efficient main memory using phase change memory technology. In Proceedings of the International Symposium on Computer Architecture. 14-23.
-
(2009)
Proceedings of the International Symposium on Computer Architecture
, pp. 14-23
-
-
Zhou, P.1
Zhao, B.2
Yang, J.3
Zhang, Y.4
|