-
1
-
-
85164992771
-
-
Available at
-
sysstat. Available at: http://pagesperso-orange.fr/sebastien.godard/.
-
Sysstat
-
-
-
3
-
-
68049121093
-
Anomaly detection: A survey
-
V. Chandola, A. Banerjee, and V. Kumar. Anomaly detection: A survey. ACM Computing Surveys, 41(3):1-58, 2009.
-
(2009)
ACM Computing Surveys
, vol.41
, Issue.3
, pp. 1-58
-
-
Chandola, V.1
Banerjee, A.2
Kumar, V.3
-
5
-
-
77954752832
-
Correlating instrumentation data to system states: A building block for automated diagnosis and control
-
I. Cohen, M. Goldszmidt, T. Kelly, J. Symons, and J. S. Chase. Correlating instrumentation data to system states: a building block for automated diagnosis and control. In Proceedings of USENIX Symposium on Opearting Systems Design and Implementation (OSDI), 2004.
-
Proceedings of USENIX Symposium on Opearting Systems Design and Implementation (OSDI), 2004
-
-
Cohen, I.1
Goldszmidt, M.2
Kelly, T.3
Symons, J.4
Chase, J.S.5
-
7
-
-
0031276011
-
Bayesian Network Classifiers
-
N. Friedman, D. Geiger, and M. Goldszmidt. Bayesian network classifiers. Machine Learning, 29(2-3):131-163, 1997. (Pubitemid 127510036)
-
(1997)
Machine Learning
, vol.29
, Issue.2-3
, pp. 131-163
-
-
Friedman, N.1
Geiger, D.2
Goldszmidt, M.3
-
10
-
-
76849100508
-
Failure-aware resource management for high-availability computing clusters with distributed virtual machines
-
S. Fu. Failure-aware resource management for high-availability computing clusters with distributed virtual machines. Journal of Parallel and Distributed Computing, 70(4):384-393, 2010.
-
(2010)
Journal of Parallel and Distributed Computing
, vol.70
, Issue.4
, pp. 384-393
-
-
Fu, S.1
-
14
-
-
77956227790
-
Quantifying event correlations for proactive failure management in networked computing systems
-
doi:10.1016/j.jpdc.2010.06.010.
-
S. Fu and C.-Z. Xu. Quantifying event correlations for proactive failure management in networked computing systems. Journal of Parallel and Distributed Computing, 2010. doi:10.1016/j.jpdc.2010.06.010.
-
(2010)
Journal of Parallel and Distributed Computing
-
-
Fu, S.1
Xu, C.-Z.2
-
15
-
-
0037236308
-
The dawning of the autonomic computing era
-
A. G. Ganek and T. A. Corbi. The dawning of the autonomic computing era. IBM Systems Journal, 42(1):5-18, 2003.
-
(2003)
IBM Systems Journal
, vol.42
, Issue.1
, pp. 5-18
-
-
Ganek, A.G.1
Corbi, T.A.2
-
18
-
-
0037253062
-
The vision of autonomic computing
-
J. O. Kephart and D. M. Chess. The vision of autonomic computing. IEEE Computer, 36(1):41-50, 2003.
-
(2003)
IEEE Computer
, vol.36
, Issue.1
, pp. 41-50
-
-
Kephart, J.O.1
Chess, D.M.2
-
19
-
-
33845589803
-
BlueGene/L failure analysis and prediction models
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, M. Jette, and R. K. Sahoo. BlueGene/L failure analysis and prediction models. In Proceedings of IEEE International Conference on Dependable Systems and Networks (DSN), 2006.
-
Proceedings of IEEE International Conference on Dependable Systems and Networks (DSN), 2006
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Jette, M.4
Sahoo, R.K.5
-
20
-
-
27544497222
-
Filtering failure logs for a BlueGene/L prototype
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, R. Sahoo, J. Moreira, and M. Gupta. Filtering failure logs for a BlueGene/L prototype. In Proceedings of IEEE International Conference on Dependable Systems and Networks (DSN), 2005.
-
Proceedings of IEEE International Conference on Dependable Systems and Networks (DSN), 2005
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Sahoo, R.4
Moreira, J.5
Gupta, M.6
-
30
-
-
50649093917
-
Triage: Diagnosing production run failures at the user's site
-
J. Tucek, S. Lu, C. Huang, S. Xanthos, and Y. Zhou. Triage: diagnosing production run failures at the user's site. In Proceedings of ACM Symposium on Operating Systems Principles (SOSP), 2007.
-
Proceedings of ACM Symposium on Operating Systems Principles (SOSP), 2007
-
-
Tucek, J.1
Lu, S.2
Huang, C.3
Xanthos, S.4
Zhou, Y.5
-
32
-
-
67650672322
-
Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems
-
P. Yalagandula, S. Nath, H. Yu, P. B. Gibbons, and S. Sesha. Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems. In Proceedings of USENIX Work- shop on Real, Large Distributed Systems (WORLDS), 2004.
-
Proceedings of USENIX Work- Shop on Real, Large Distributed Systems (WORLDS), 2004
-
-
Yalagandula, P.1
Nath, S.2
Yu, H.3
Gibbons, P.B.4
Sesha, S.5
|