메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Failure prediction for autonomic management of networked computer systems with availability assurance

Author keywords

Autonomic systems; Failure management; Networked computer systems; System dependability

Indexed keywords

AUTONOMIC MANAGEMENT; AUTONOMIC SYSTEMS; COMPONENT FAILURES; COMPUTATIONAL GRIDS; FAILURE BEHAVIORS; FAILURE CORRELATION; FAILURE DYNAMICS; FAILURE MANAGEMENT; FAILURE PREDICTION; NETWORKED COMPUTER SYSTEMS; OFFLINE; ONLINE PREDICTION; OPERATION COST; PRODUCTION ENVIRONMENTS; SELF MANAGEMENT; SYSTEM DEPENDABILITY; SYSTEM DESIGNERS;

EID: 77954054232     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPSW.2010.5470868     Document Type: Conference Paper
Times cited : (14)

References (45)
  • 1
    • 84881083374 scopus 로고    scopus 로고
    • Weka: The University of Waikato. Available at
    • Weka: The University of Waikato. Machine learning software in Java. Available at: http://www.cs.waikato.ac.nz/ml/weka/.
    • Machine Learning Software in Java
  • 3
    • 40849089513 scopus 로고    scopus 로고
    • Model-based performance evaluation of distributed checkpointing protocols
    • A. Agbaria and R. Friedman. Model-based performance evaluation of distributed checkpointing protocols. Performance Evaluation, 65(5):345-365, 2008.
    • (2008) Performance Evaluation , vol.65 , Issue.5 , pp. 345-365
    • Agbaria, A.1    Friedman, R.2
  • 9
    • 0036504529 scopus 로고    scopus 로고
    • Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing
    • A. Dogan and F. Özgüner. Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing. IEEE Transactions on Parallel and Distributed Systems, 13(3):308-323, 2002.
    • (2002) IEEE Transactions on Parallel and Distributed Systems , vol.13 , Issue.3 , pp. 308-323
    • Dogan, A.1    Özgüner, F.2
  • 10
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys, 34(3):375-408, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.N.M.1    Alvisi, L.2    Wang, Y.-M.3    Johnson, D.B.4
  • 13
    • 76849100508 scopus 로고    scopus 로고
    • Failure-aware resource management for highavailability computing clusters with distributed virtual machines
    • In Press
    • S. Fu. Failure-aware resource management for highavailability computing clusters with distributed virtual machines. Journal of Parallel and Distributed Computing, In Press, 2010.
    • (2010) Journal of Parallel and Distributed Computing
    • Fu, S.1
  • 14
  • 18
    • 33947184459 scopus 로고    scopus 로고
    • Analytical models for architecture-based software reliability prediction: A unification framework
    • S. S. Gokhale and K. S. Trivedi. Analytical models for architecture-based software reliability prediction: A unification framework. IEEE Transactions on Reliability, 55(4):578-590, 2006.
    • (2006) IEEE Transactions on Reliability , vol.55 , Issue.4 , pp. 578-590
    • Gokhale, S.S.1    Trivedi, K.S.2
  • 29
    • 1442309284 scopus 로고
    • On the reliability of the IBM MVS/XA operating system
    • S. Mourad and D. Andrews. On the reliability of the IBM MVS/XA operating system. IEEE Transactions on Software Engineering, 13(10):1135-1139, 1987.
    • (1987) IEEE Transactions on Software Engineering , vol.13 , Issue.10 , pp. 1135-1139
    • Mourad, S.1    Andrews, D.2
  • 34
    • 20444463471 scopus 로고    scopus 로고
    • A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters
    • X. Qin and H. Jiang. A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters. Journal of Parallel and Distributed Computing, 65(8):885-900, 2005.
    • (2005) Journal of Parallel and Distributed Computing , vol.65 , Issue.8 , pp. 885-900
    • Qin, X.1    Jiang, H.2
  • 44
    • 67650672322 scopus 로고    scopus 로고
    • Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems
    • P. Yalagandula, S. Nath, H. Yu, P. B. Gibbons, and S. Sesha. Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems. In Proceedings of USENIX WORLDS, 2004.
    • (2004) Proceedings of USENIX WORLDS
    • Yalagandula, P.1    Nath, S.2    Yu, H.3    Gibbons, P.B.4    Sesha, S.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.