-
2
-
-
1242287075
-
Smart cooling of data centers
-
36908b
-
C. D. Patel, C. E. Bash, R. Sharma, M. Beitelmal, and R. Friedrich, \Smart cooling of data centers," ASME Conference Proceedings, vol. 2003, no. 36908b, pp. 129-137, 2003.
-
(2003)
ASME Conference Proceedings
, vol.2003
, pp. 129-137
-
-
Patel, C.D.1
Bash, C.E.2
Sharma, R.3
Beitelmal, M.4
Friedrich, R.5
-
3
-
-
34247594029
-
Alternating cold and hot aisles provides more reliable cooling for server farms
-
R. F. Sullivan, \Alternating cold and hot aisles provides more reliable cooling for server farms," White Paper, Uptime Institute, 2000.
-
(2000)
White Paper, Uptime Institute
-
-
Sullivan, R.F.1
-
5
-
-
84899696017
-
-
R. American Society of Heating and A.-C. Engineers
-
R. American Society of Heating and A.-C. Engineers, \2008 ashrae environmental guidelines for datacom equipment. " [Online]. Available: http: //tc99. ashraetcs. org/documents/ASHRAE Extended Environmental Envelope Final Aug 1 2008. pdf
-
2008 ashrae environmental guidelines for datacom equipment
-
-
-
7
-
-
83155184565
-
A'cool' load balancer for parallel applications
-
11
-
O. Sarood and L. V. Kale, \A'cool' load balancer for parallel applications," in Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, ser. SC'11. New York, NY, USA: ACM, 2011, pp. 21:1-21:11. [Online]. Available: http://doi. acm. org/10. 1145/2063384. 2063412
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Ser. SC'11. New York, NY, USA: ACM
, vol.21
, pp. 1-21
-
-
Sarood, O.1
Kale, L.V.2
-
8
-
-
84869454748
-
-
12. Los Alamitos, CA, USA: IEEE Computer Society, 2012
-
O. Sarood, P. Miller, E. Totoni, and L. V. Kale, \Cool load balancing for high performance computing data centers," vol. 61, no. 12. Los Alamitos, CA, USA: IEEE Computer Society, 2012, pp. 1752-1764.
-
Cool load balancing for high performance computing data centers
, vol.61
, pp. 1752-1764
-
-
Sarood, O.1
Miller, P.2
Totoni, E.3
Kale, L.V.4
-
10
-
-
85024266067
-
-
7. New York, NY, USA: ACM, Oct. 2003
-
W.-c. Feng, \Making a case for efficient supercomputing," vol. 1, no. 7. New York, NY, USA: ACM, Oct. 2003, pp. 54-64. [Online]. Available: http://doi. acm. org/10. 1145/957717. 957772
-
Making a case for efficient supercomputing
, vol.1
, pp. 54-64
-
-
Feng, W.-C.1
-
11
-
-
34548715722
-
The importance of being low power in high-performance computing
-
August
-
The Importance of Being Low Power in High-Performance Computing," Cyberinfrastructure Technology Watch Quarterly (CTWatch Quarterly), vol. 1, no. 3, August 2005.
-
(2005)
Cyberinfrastructure Technology Watch Quarterly (CTWatch Quarterly)
, vol.1
, Issue.3
-
-
-
12
-
-
84899703545
-
-
Technical ReportDesign Note 002, Ericsson Microelectronics, April Ericsson
-
Ericsson, \Reliability Aspects on Power Supplies," Technical ReportDesign Note 002, Ericsson Microelectronics, April 2000.
-
(2000)
Reliability Aspects on Power Supplies
-
-
-
14
-
-
4544227478
-
The impact of technology scaling on lifetime reliability
-
J. Srinivasan, S. Adve, P. Bose, and J. Rivers, \The impact of technology scaling on lifetime reliability," in Dependable Systems and Networks, 2004 International Conference on, 2004, pp. 177-186.
-
(2004)
Dependable Systems and Networks, 2004 International Conference on
, pp. 177-186
-
-
Srinivasan, J.1
Adve, S.2
Bose, P.3
Rivers, J.4
-
16
-
-
47249098059
-
System-level fault-tolerance in large-scale parallel machines with buered coscheduling
-
F. Petrini, K. Davis, and J. Sancho, \System-level fault-tolerance in large-scale parallel machines with buered coscheduling," in Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International, 2004, pp. 209-.
-
(2004)
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
-
-
Petrini, F.1
Davis, K.2
Sancho, J.3
-
17
-
-
20444463494
-
FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI
-
G. Zheng, L. Shi, and L. V. Kale, \FTC-Charm++: An In-Memory Checkpoint-Based Fault Tolerant Runtime for Charm++ and MPI," in 2004 IEEE International Conference on Cluster Computing, San Diego, CA, September 2004, pp. 93-103.
-
(2004)
2004 IEEE International Conference on Cluster Computing, San Diego, CA, September
, pp. 93-103
-
-
Zheng, G.1
Shi, L.2
Kale, L.V.3
-
18
-
-
28044460018
-
A higher order estimate of the optimum checkpoint interval for restart dumps
-
J. T. Daly, \A higher order estimate of the optimum checkpoint interval for restart dumps," Future Generation Comp. Syst., vol. 22, no. 3, pp. 303-312, 2006.
-
(2006)
Future Generation Comp. Syst.
, vol.22
, Issue.3
, pp. 303-312
-
-
Daly, J.T.1
-
19
-
-
84976846528
-
A rst order approximation to the optimal checkpoint interval
-
J. W. Young, \A rst order approximation to the optimal checkpoint interval," Commun. ACM, vol. 17, no. 9, pp. 530-531, 1974.
-
(1974)
Commun. ACM
, vol.17
, Issue.9
, pp. 530-531
-
-
Young, J.W.1
-
20
-
-
83155160949
-
FTI: High performance fault tolerance interface for hybrid systems
-
L. Bautista-Gomez, D. Komatitsch, N. Maruyama, S. Tsuboi, F. Cappello, and S. Matsuoka, \FTI: High performance fault tolerance interface for hybrid systems," in 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov. 2011, pp. 1-12.
-
(2011)
2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Nov.
, pp. 1-12
-
-
Bautista-Gomez, L.1
Komatitsch, D.2
Maruyama, N.3
Tsuboi, S.4
Cappello, F.5
Matsuoka, S.6
-
21
-
-
74049121711
-
Berkeley lab checkpoint/restart (blcr) for linux clusters
-
P. H. Hargrove and J. C. Duell, \Berkeley lab checkpoint/restart (blcr) for linux clusters," in SciDAC, 2006.
-
(2006)
SciDAC
-
-
Hargrove, P.H.1
Duell, J.C.2
-
22
-
-
78650831692
-
Design, modeling, and evaluation of a scalable multi-level checkpointing system
-
A. Moody, G. Bronevetsky, K. Mohror, and B. R. de Supinski, \Design, modeling, and evaluation of a scalable multi-level checkpointing system," in SC, 2010, pp. 1-11.
-
(2010)
SC
, pp. 1-11
-
-
Moody, A.1
Bronevetsky, G.2
Mohror, K.3
De Supinski, B.R.4
-
23
-
-
84899694124
-
-
\Lulesh," http://computation. llnl. gov/casc/ShockHydro/.
-
Lulesh
-
-
-
26
-
-
51049088635
-
Massively parallel cosmological simulations with ChaNGa
-
P. Jetley, F. Gioachin, C. Mendes, L. V. Kale, and T. R. Quinn, \Massively parallel cosmological simulations with ChaNGa," in Proceedings of IEEE International Parallel and Distributed Processing Symposium 2008, 2008.
-
(2008)
Proceedings of IEEE International Parallel and Distributed Processing Symposium 2008
-
-
Jetley, P.1
Gioachin, F.2
Mendes, C.3
Kale, L.V.4
Quinn, T.R.5
-
27
-
-
82955213596
-
Periodic hierarchical load balancing for large supercomputers
-
March
-
G. Zheng, A. Bhatele, E. Meneses, and L. V. Kale, \Periodic Hierarchical Load Balancing for Large Supercomputers," International Journal of High Performance Computing Applications (IJHPCA), March 2011.
-
(2011)
International Journal of High Performance Computing Applications (IJHPCA)
-
-
Zheng, G.1
Bhatele, A.2
Meneses, E.3
Kale, L.V.4
-
28
-
-
84899672036
-
-
Intel turbo boost technology
-
\Intel turbo boost technology," http://www. intel. com/technology/turboboost/.
-
-
-
-
29
-
-
66749092384
-
-
P. Kogge, K. Bergman, S. Borkar, D. Campbell, W. Carlson, W. Dally, M. Denneau, P. Franzon, W. Harrod, J. Hiller, S. Karp, S. Keckler, D. Klein, R. Lucas, M. Richards, A. Scarpelli, S. Scott, A. Snavely, T. Sterling, R. S. Williams, and K. Yelick, \Exascale computing study: Technology challenges in achieving exascale systems," 2008.
-
(2008)
Exascale computing study: Technology challenges in achieving exascale systems
-
-
Kogge, P.1
Bergman, K.2
Borkar, S.3
Campbell, D.4
Carlson, W.5
Dally, W.6
Denneau, M.7
Franzon, P.8
Harrod, W.9
Hiller, J.10
Karp, S.11
Keckler, S.12
Klein, D.13
Lucas, R.14
Richards, M.15
Scarpelli, A.16
Scott, S.17
Snavely, A.18
Sterling, T.19
Williams, R.S.20
Yelick, K.21
more..
-
33
-
-
52249092623
-
Metrics for architecture-level lifetime reliability analysis
-
P. Ramachandran, S. Adve, P. Bose, and J. Rivers, \Metrics for architecture-level lifetime reliability analysis," in Performance Analysis of Systems and software, 2008. ISPASS 2008. IEEE International Symposium on, 2008, pp. 202-212.
-
(2008)
Performance Analysis of Systems and Software, 2008. ISPASS 2008. IEEE International Symposium on
, pp. 202-212
-
-
Ramachandran, P.1
Adve, S.2
Bose, P.3
Rivers, J.4
-
34
-
-
4644313547
-
The case for lifetime reliability-aware microprocessors
-
J. Srinivasan, S. V. Adve, P. Bose, and J. A. Rivers, \The case for lifetime reliability-aware microprocessors," in Proceedings of the 31st annual international symposium on Computer architecture, ser. ISCA'04. Washington, DC, USA: IEEE Computer Society, 2004, pp. 276-. [Online]. Available: http://dl. acm. org/citation. cfm?id=998680. 1006725
-
(2004)
Proceedings of the 31st Annual International Symposium on Computer Architecture, Ser. ISCA'04. Washington, DC, USA: IEEE Computer Society
, pp. 276
-
-
Srinivasan, J.1
Adve, S.V.2
Bose, P.3
Rivers, J.A.4
-
35
-
-
84885193593
-
A message-logging protocol for multicore systems
-
Boston, USA, June
-
E. Meneses, X. Ni, and L. V. Kale, \A Message-Logging Protocol for Multicore Systems," in Proceedings of the 2nd Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS), Boston, USA, June 2012.
-
(2012)
Proceedings of the 2nd Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS)
-
-
Meneses, E.1
Ni, X.2
Kale, L.V.3
-
37
-
-
84871643381
-
Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems
-
New York, USA, October
-
E. Meneses, O. Sarood, and L. V. Kale, \Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems," in Proceedings of the 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2012), New York, USA, October 2012.
-
(2012)
Proceedings of the 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2012)
-
-
Meneses, E.1
Sarood, O.2
Kale, L.V.3
-
38
-
-
83155188951
-
Evaluating the viability of process replication reliability for exascale systems
-
New York, NY, USA: ACM 12
-
K. Ferreira, J. Stearley, J. H. Laros, III, R. Oldfield, K. Pedretti, R. Brightwell, R. Riesen, P. G. Bridges, and D. Arnold, \Evaluating the viability of process replication reliability for exascale systems," in Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis. New York, NY, USA: ACM, 2011, pp. 44:1-44:12. [Online]. Available: http://doi. acm. org/10. 1145/2063384. 2063443
-
(2011)
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
, vol.44
, pp. 1-44
-
-
Ferreira, K.1
Stearley, J.2
Laros III, J.H.3
Oldfield, R.4
Pedretti, K.5
Brightwell, R.6
Riesen, R.7
Bridges, P.G.8
Arnold, D.9
-
39
-
-
22944456833
-
Lifetime reliability: Toward an architectural solution
-
J. Srinivasan, S. Adve, P. Bose, and J. Rivers, \Lifetime reliability: toward an architectural solution," Micro, IEEE, vol. 25, no. 3, pp. 70-80, 2005.
-
(2005)
Micro, IEEE
, vol.25
, Issue.3
, pp. 70-80
-
-
Srinivasan, J.1
Adve, S.2
Bose, P.3
Rivers, J.4
|