메뉴 건너뛰기




Volumn 27, Issue 3, 2013, Pages 273-282

Failure prediction for HPC systems and applications: Current situation and open issues

Author keywords

failure prediction; fault tolerance; signal analysis

Indexed keywords

CLASSICAL APPROACH; CURRENT LIMITATION; CURRENT SITUATION; DIFFERENT DISTRIBUTIONS; FAILURE PREDICTION; HIGH PERFORMANCE COMPUTING SYSTEMS; PREDICTION SYSTEMS; PROACTIVE MEASURES;

EID: 84881050143     PISSN: 10943420     EISSN: 17412846     Source Type: Journal    
DOI: 10.1177/1094342013488258     Document Type: Conference Paper
Times cited : (30)

References (35)
  • 15
    • 84858781341 scopus 로고    scopus 로고
    • Cosmic rays don't strike twice: Understanding the nature of dram errors and the implications for system design
    • Hwang AA, Stefanovici IA, Schroeder B. Cosmic rays don't strike twice: understanding the nature of dram errors and the implications for system design. SIGARCH Computer Architecture News. 2012 ; 40 (1). 111-122
    • (2012) SIGARCH Computer Architecture News , vol.40 , Issue.1 , pp. 111-122
    • Hwang, A.A.1    Stefanovici, I.A.2    Schroeder, B.3
  • 18
    • 77958132122 scopus 로고    scopus 로고
    • Mining dependency in distributed systems through unstructured logs analysis
    • Lou J-G, Fu Q, Wang Y, Li J. Mining dependency in distributed systems through unstructured logs analysis. ACM SIGOPS Operating Systems Review. 2010 ; 44 (1). 91-96
    • (2010) ACM SIGOPS Operating Systems Review , vol.44 , Issue.1 , pp. 91-96
    • Lou, J.-G.1    Fu, Q.2    Wang, Y.3    Li, J.4
  • 23
    • 77950267881 scopus 로고    scopus 로고
    • A survey of online failure prediction methods
    • Salfner F, Lenk M, Malek M. A survey of online failure prediction methods. Computing Surveys. 2010 ; 42: 1-42
    • (2010) Computing Surveys , vol.42 , pp. 1-42
    • Salfner, F.1    Lenk, M.2    Malek, M.3
  • 24
    • 47249121233 scopus 로고    scopus 로고
    • Using hidden semi-Markov models for effective online failure prediction
    • Salfner F, Malek M. Using hidden semi-Markov models for effective online failure prediction. Symposium on Reliable Distributed Systems. 2007 ;: 161-174
    • (2007) Symposium on Reliable Distributed Systems , pp. 161-174
    • Salfner, F.1    Malek, M.2
  • 26
    • 84881064744 scopus 로고    scopus 로고
    • Exascale research: Preparing for the post Moore era
    • SnirMGroppWKoggeP (2011) Exascale research: preparing for the post Moore era. Computer Science Whitepapers. Available at: https://www.ideals.illinois. edu/bitstream/handle/2142/25469/Exascale%20Research.pdf?sequence=2
    • (2011) Computer Science Whitepapers
    • Snir, M.1    Gropp, W.2    Kogge, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.