-
1
-
-
85081450426
-
-
The UC Berkeley/Stanford Recovery-Oriented Computing (ROC) Project. http://roc.cs.berkeley.edu/.
-
The UC Berkeley/Stanford Recovery-Oriented Computing (ROC) Project. http://roc.cs.berkeley.edu/.
-
-
-
-
2
-
-
85081446154
-
-
TOP500 List 11/2004. http://www.top500.org/lists/2005/11/basic.
-
TOP500 List 11/2004. http://www.top500.org/lists/2005/11/basic.
-
-
-
-
5
-
-
4043157227
-
Reliability, Availability, and Serviceability (RAS) of the IBM eServer z990
-
M. L. Fair, C. R. Conklin, S. B. Swaney, P. J. Meaney, W. J. Clarke, L. C. Alves, I. N. Modi, F. Freier, W. Fischer, and N. E. Weber. Reliability, Availability, and Serviceability (RAS) of the IBM eServer z990. IBM Journal of Research and Development, 48(3/4), 2004.
-
(2004)
IBM Journal of Research and Development
, vol.48
, Issue.3-4
-
-
Fair, M.L.1
Conklin, C.R.2
Swaney, S.B.3
Meaney, P.J.4
Clarke, W.J.5
Alves, L.C.6
Modi, I.N.7
Freier, F.8
Fischer, W.9
Weber, N.E.10
-
8
-
-
34548804807
-
-
IBM
-
IBM. Autonomic computing initiative, 2002. http://www.research.ibm.com/ autonomic/index_nf.html.
-
(2002)
Autonomic computing initiative
-
-
-
11
-
-
33845589803
-
-
Y. Liang, Y. Zhang, M. Jette, A. Sivasubramaniam, and R. Sahoo. Bluegene/1 failure analysis and prediction models. In Proceedings of the International Conference on Dependable Systems and Networks (DSN), 2006. To Appear.
-
Y. Liang, Y. Zhang, M. Jette, A. Sivasubramaniam, and R. Sahoo. Bluegene/1 failure analysis and prediction models. In Proceedings of the International Conference on Dependable Systems and Networks (DSN), 2006. To Appear.
-
-
-
-
12
-
-
27544497222
-
Filtering failure logs for a bluegene/1 prototype
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, R. Sahoo, J. Moreira, and M. Gupta. Filtering failure logs for a bluegene/1 prototype. In Proceedings of the International Conference on Dependable Systems and Networks (DSN), 2005.
-
(2005)
Proceedings of the International Conference on Dependable Systems and Networks (DSN)
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Sahoo, R.4
Moreira, J.5
Gupta, M.6
-
13
-
-
0025502686
-
Error log analysis: Statistical modelling and heuristic trend analysis
-
October
-
T. Y. Lin and D. P. Siewiorek. Error log analysis: Statistical modelling and heuristic trend analysis. IEEE Trans. on Reliability, 39(4):419-432, October 1990.
-
(1990)
IEEE Trans. on Reliability
, vol.39
, Issue.4
, pp. 419-432
-
-
Lin, T.Y.1
Siewiorek, D.P.2
-
14
-
-
84944403418
-
A Systematic Methodology to Compute the Architectural Vulnerability Factors for a High-Performance Microprocessor
-
S.S. Mukherjee, C. Weaver, J. Emer, S.K. Reinhardt, and T. Austin. A Systematic Methodology to Compute the Architectural Vulnerability Factors for a High-Performance Microprocessor. In Proceedings of the International Symposium on Microarchitecture (MICRO), pages 29-40, 2003.
-
(2003)
Proceedings of the International Symposium on Microarchitecture (MICRO)
, pp. 29-40
-
-
Mukherjee, S.S.1
Weaver, C.2
Emer, J.3
Reinhardt, S.K.4
Austin, T.5
-
17
-
-
4544382099
-
Failure Data Analysis of a Large-Scale Heterogeneous Server Environment
-
R. Sahoo, A. Sivasubramaniam, M. Squillante, and Y. Zhang. Failure Data Analysis of a Large-Scale Heterogeneous Server Environment. In Proceedings of the 2004 International Conference on Dependable Systems and Networks, pages 389-398, 2004.
-
(2004)
Proceedings of the 2004 International Conference on Dependable Systems and Networks
, pp. 389-398
-
-
Sahoo, R.1
Sivasubramaniam, A.2
Squillante, M.3
Zhang, Y.4
-
18
-
-
77952378080
-
Critical Event Prediction for Proactive Management in Large-scale Computer Clusters
-
August
-
R. K. Sahoo, A. J. Oliner, I. Rish, M. Gupta, J. E. Moreira, S. Ma, R. Vilalta, and A. Sivasubramaniam. Critical Event Prediction for Proactive Management in Large-scale Computer Clusters. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2003.
-
(2003)
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
-
-
Sahoo, R.K.1
Oliner, A.J.2
Rish, I.3
Gupta, M.4
Moreira, J.E.5
Ma, S.6
Vilalta, R.7
Sivasubramaniam, A.8
-
20
-
-
0026869241
-
Analysis and modeling of correlated failures in multicomputer systems
-
D. Tang and R. K. Iyer. Analysis and modeling of correlated failures in multicomputer systems. IEEE Transactions on Computers, 41(5):567-577, 1992.
-
(1992)
IEEE Transactions on Computers
, vol.41
, Issue.5
, pp. 567-577
-
-
Tang, D.1
Iyer, R.K.2
-
22
-
-
4243934975
-
-
PhD thesis, Dept. of Computer Science, Carnegie-Mellon University
-
M. M. Tsao. Trend Analysis and Fault Prediction. PhD thesis, Dept. of Computer Science, Carnegie-Mellon University, 1983.
-
(1983)
Trend Analysis and Fault Prediction
-
-
Tsao, M.M.1
-
23
-
-
0034832697
-
Analysis and Implementation of Software Rejuvenation in Cluster Systems
-
June
-
K. Vaidyanathan, R. E. Harper, S. W. Hunter, and K. S. Trivedi. Analysis and Implementation of Software Rejuvenation in Cluster Systems. In Proceedings of the ACM SIGMETRICS 2001 Conference on Measurement and Modeling of Computer Systems, pages 62-71, June 2001.
-
(2001)
Proceedings of the ACM SIGMETRICS 2001 Conference on Measurement and Modeling of Computer Systems
, pp. 62-71
-
-
Vaidyanathan, K.1
Harper, R.E.2
Hunter, S.W.3
Trivedi, K.S.4
-
24
-
-
4544360243
-
Networked Windows NT System Field Failure Data Analysis
-
9808 University of Illinois at Urbana- Champaign
-
J. Xu, Z. Kallbarczyk, and R. K. Iyer. Networked Windows NT System Field Failure Data Analysis. Technical Report CRHC 9808 University of Illinois at Urbana- Champaign, 1999.
-
(1999)
Technical Report CRHC
-
-
Xu, J.1
Kallbarczyk, Z.2
Iyer, R.K.3
|