메뉴 건너뛰기




Volumn , Issue , 2009, Pages 173-180

Blue gene/L log analysis and time to interrupt estimation

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION-LEVEL FAILURES; BLUE GENE/L SUPERCOMPUTER; COMPUTATIONAL ARCHITECTURE; FAILURE BEHAVIORS; FAILURE PREDICTION; LOG ANALYSIS; LOG FILE; LOG INFORMATION; PERFORMANCE MODELING; POWER AWARENESS; RELIABILITY MODELING; TEMPORAL FILTERING;

EID: 70349657128     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ARES.2009.105     Document Type: Conference Paper
Times cited : (18)

References (14)
  • 3
    • 27544497222 scopus 로고    scopus 로고
    • Y. Liang, Y. Zhang, A. Sivasubramaniam, R. Sahoo, J. Moreira, M. Gupta, Filtering failure logs for a Bluegene/L prototype, Dependable Systems and Networks, 2005. DSN 2005. Proceedings. International Conference on 28 June-1 July 2005 pp. 476 - 485.
    • Y. Liang, Y. Zhang, A. Sivasubramaniam, R. Sahoo, J. Moreira, M. Gupta, "Filtering failure logs for a Bluegene/L prototype", Dependable Systems and Networks, 2005. DSN 2005. Proceedings. International Conference on 28 June-1 July 2005 pp. 476 - 485.
  • 6
    • 70349693443 scopus 로고    scopus 로고
    • at Lawrence Livermore National Laboratory
    • Secure Computing Facility, High Performance Computing at Lawrence Livermore National Laboratory, https://computing.llnl.gov/? set= resources&page=SCF-resources#bluegenel
    • Secure Computing Facility, High Performance Computing
  • 7
    • 33845593340 scopus 로고    scopus 로고
    • Schroeder and G. Gibson, A large-scale study of failures in highperformance- computing systems, in Proceedings of the 2006 International Conference on Dependable Systems and Networks, June 2006.
    • Schroeder and G. Gibson, "A large-scale study of failures in highperformance- computing systems," in Proceedings of the 2006 International Conference on Dependable Systems and Networks, June 2006.
  • 10
    • 36049028957 scopus 로고    scopus 로고
    • Defining and measuring supercomputer Reliability, Availability, and Serviceability (RAS)
    • J. Stearley. Defining and measuring supercomputer Reliability, Availability, and Serviceability (RAS). In Proceedings of the Linux Clusters Institute Conference, 2005. See http://www.cs.sandia.gov/̃jrstear/ras.
    • Proceedings of the Linux Clusters Institute Conference, 2005
    • Stearley, J.1
  • 14
    • 70349681940 scopus 로고    scopus 로고
    • http://www.latech.edu/~nta008/patterns.tgz


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.