-
2
-
-
0029703899
-
A comparative analysis of event tupling schemes
-
M. Buckley, D. Siewiorek, I. Center, and Y. Heights. A Comparative Analysis of Event Tupling Schemes. In Proc. Of Annual Symp. on Fault-Tolerant Computing, pages 294-303, 1996.
-
(1996)
Proc. of Annual Symp. on Fault-Tolerant Computing
, pp. 294-303
-
-
Buckley, M.1
Siewiorek, D.2
Center, I.3
Heights, Y.4
-
3
-
-
33845584672
-
Collecting and analyzing failure data of bluetooth personal area networks
-
Philadelphia, Pennsylvania, USA, June
-
M. Cinque, D. Cotroneo, and S. Russo. Collecting and analyzing failure data of bluetooth personal area networks. In Proc. Intl. Conf. on Dependable Systems and Networks, Philadelphia, Pennsylvania, USA, June 2006.
-
(2006)
Proc. Intl. Conf. on Dependable Systems and Networks
-
-
Cinque, M.1
Cotroneo, D.2
Russo, S.3
-
4
-
-
47249124464
-
Quantifying temporal and spatial correlation of failure events for proactive management
-
Reliable Distributed Systems, 2007. SRDS 2007
-
S. Fu and C.-Z. Xu. Quantifying temporal and spatial correlation of failure events for proactive management. In Reliable Distributed Systems, 2007. SRDS 2007. 26th IEEE International Symposium on, pages 175-184, 2007.
-
(2007)
26th IEEE International Symposium on
, pp. 175-184
-
-
Fu, S.1
Xu, C.-Z.2
-
5
-
-
80051945788
-
A census of tandem system availability
-
J. Gray. A census of tandem system availability. In IEEE Transactions on Reliability, pages 40-9, 1990.
-
(1990)
IEEE Transactions on Reliability
, pp. 40-9
-
-
Gray, J.1
-
8
-
-
0025416073
-
Automatic recognition of intermittent failures: An experimental study of field data
-
DOI 10.1109/12.54845
-
R. Iyer, L. Young, and P. Iyer. Automatic recognition of intermittent failures: An experimental study of field data. IEEE Transactions on Computers, 39:525-537, 1990. (Pubitemid 20702262)
-
(1990)
IEEE Transactions on Computers
, vol.39
, Issue.4
, pp. 525-537
-
-
Iyer Ravishankar, K.1
Young Luke, T.2
Krishna Iyer, P.V.3
-
9
-
-
84976815079
-
Measurement and modeling of computer reliability as affected by system activity
-
August
-
R. K. Iyer, D. J. Rossetti, and M. C. Hsueh. Measurement and modeling of computer reliability as affected by system activity. ACM Trans. Comput. Syst., 4:214-237, August 1986.
-
(1986)
ACM Trans. Comput. Syst
, vol.4
, pp. 214-237
-
-
Iyer, R.K.1
Rossetti, D.J.2
Hsueh, M.C.3
-
10
-
-
0022906522
-
-
R. K. Iyer, L. T. Young, and V. Sridhar. Recognition of error symptoms in large systems. In Proceedings of 1986 ACM Fall joint computer conference, ACM '86, pages 797-806, Los Alamitos, CA, USA, 1986. IEEE Computer Society Press. (Pubitemid 17537011)
-
(1986)
RECOGNITION OF ERROR SYMPTOMS IN LARGE SYSTEMS
, pp. 797-806
-
-
Iyer, R.K.1
Young, L.T.2
Sridhar, V.3
-
13
-
-
33845589803
-
Bluegene/L failure analysis and prediction models
-
Philadelphia, Pennsylvania, USA
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, M. Jette, and R. K. Sahoo. Bluegene/L failure analysis and prediction models. In Proc. Intl. Conf. on Dependable Systems and Networks, Philadelphia, Pennsylvania, USA, 2006.
-
(2006)
Proc. Intl. Conf. on Dependable Systems and Networks
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Jette, M.4
Sahoo, R.K.5
-
15
-
-
4544330153
-
-
MSR-TR-2000-56, Microsoft Research, Microsoft Corporation, Redmond, WA, June
-
B. Murphy and B. Levidow. Windows 2000 Dependability. MSR-TR-2000-56, Microsoft Research, Microsoft Corporation, Redmond, WA, June 2000.
-
(2000)
Windows 2000 Dependability
-
-
Murphy, B.1
Levidow, B.2
-
16
-
-
36049013419
-
What supercomputers say: A study of five system logs
-
DOI 10.1109/DSN.2007.103, 4273008, Proceedings - 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN 2007
-
A. J. Oliner and J. Stearley. What supercomputers say: A study of five system logs. In Proc. Intl. Conf. on Dependable Systems and Networks, pages 575-584. IEEE Computer Society, 2007. (Pubitemid 350080462)
-
(2007)
Proceedings of the International Conference on Dependable Systems and Networks
, pp. 575-584
-
-
Oliner, A.1
Stearley, J.2
-
18
-
-
77952378080
-
Critical event prediction for proactive management in large-scale computer clusters
-
New York, NY, USA. ACM
-
R. K. Sahoo, A. J. Oliner, I. Rish, M. Gupta, J. E. Moreira, S. Ma, R. Vilalta, and A. Sivasubramaniam. Critical event prediction for proactive management in large-scale computer clusters. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '03, pages 426-435, New York, NY, USA, 2003. ACM.
-
(2003)
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '03
, pp. 426-435
-
-
Sahoo, R.K.1
Oliner, A.J.2
Rish, I.3
Gupta, M.4
Moreira, J.E.5
Ma, S.6
Vilalta, R.7
Sivasubramaniam, A.8
-
19
-
-
4544382099
-
Failure data analysis of a large-scale heterogeneous server environment
-
Florence, Italy
-
R. K. Sahoo, A. Sivasubramaniam, M. S. Squillante, and Y. Zhang. Failure Data Analysis of a Large-Scale Heterogeneous Server Environment. In Proc. Intl. Conf. on Dependable Systems and Networks, Florence, Italy, 2004.
-
(2004)
Proc. Intl. Conf. on Dependable Systems and Networks
-
-
Sahoo, R.K.1
Sivasubramaniam, A.2
Squillante, M.S.3
Zhang, Y.4
-
21
-
-
33847328785
-
Availability assessment of sunOS/solaris unix systems based on syslogd and wtmpx log files: A case study
-
IEEE Computer Society
-
C. Simache and M. Kaâniche. Availability assessment of sunOS/solaris unix systems based on syslogd and wtmpx log files: A case study. In PRDC, pages 49-56. IEEE Computer Society, 2005.
-
(2005)
PRDC
, pp. 49-56
-
-
Simache, C.1
Kaâniche, M.2
-
22
-
-
0032257078
-
Meadep: A dependability evaluation tool for engineers
-
D. Tang, M. Hecht, J. Miller, and J. Handal. Meadep: A dependability evaluation tool for engineers. IEEE Transactions on Reliability, 47(4):443-450, 1998.
-
(1998)
IEEE Transactions on Reliability
, vol.47
, Issue.4
, pp. 443-450
-
-
Tang, D.1
Hecht, M.2
Miller, J.3
Handal, J.4
-
24
-
-
0030379933
-
Analyze-NOW-An environment for collection and analysis of failures in a networked of workstations
-
A. Thakur and R. K. Iyer. Analyze-NOW-An Environment for Collection and Analysis of Failures in a Networked of Workstations. IEEE Transactions on Reliability, pages Vol. 45, no. 4,560-570, 1996.
-
(1996)
IEEE Transactions on Reliability
, vol.45
, Issue.4
, pp. 560-570
-
-
Thakur, A.1
Iyer, R.K.2
-
25
-
-
77956513188
-
Detecting large-scale system problems by mining console logs
-
W. Xu, L. Huang, A. Fox, D. Patterson, and M. Jordan. Detecting Large-Scale System Problems by Mining Console Logs. In Proceedings of the ACM SIGOPS 22nd symposium on Operating Systems Principles (SOSP), 2009.
-
(2009)
Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles (SOSP)
-
-
Xu, W.1
Huang, L.2
Fox, A.3
Patterson, D.4
Jordan, M.5
|