메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Modeling and tolerating heterogeneous failures in large parallel systems

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION-CENTRIC; CHECK POINTING; COMPONENT FAILURES; FAILURE MODEL; FAILURE RATE; FAULT-TOLERANT ALGORITHMS; GENERAL MODEL; HARDWARE COMPONENTS; HARDWARE FAILURES; HIGH PERFORMANCE COMPUTING SYSTEMS; OR-NETWORKS; PARALLEL SYSTEM; SPACE AND TIME; SPECIFIC COMPONENT; SUPERCOMPUTING APPLICATIONS; SYSTEM FAILURES;

EID: 83155160934     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2063384.2063444     Document Type: Conference Paper
Times cited : (72)

References (27)
  • 1
    • 83155186191 scopus 로고    scopus 로고
    • Personal communication, May
    • William D. Gropp. Personal communication, May 2010.
    • (2010)
    • Gropp, W.D.1
  • 5
    • 36049041275 scopus 로고    scopus 로고
    • Understanding disk failure rates: What does an mttf of 1, 000, 000 hours mean to you?
    • Oct
    • Bianca Schroeder and Garth Gibson. Understanding disk failure rates: What does an mttf of 1, 000, 000 hours mean to you? Transactions on Storage (TOS, 3(3), Oct 2007.
    • (2007) Transactions on Storage (TOS) , vol.3 , Issue.3
    • Schroeder, B.1    Gibson, G.2
  • 6
    • 33845593340 scopus 로고    scopus 로고
    • A large-scale study of failures in high-performance computing systems
    • DOI 10.1109/DSN.2006.5, 1633514, Proceedings - DSN 2006: 2006 International Conference on Dependable Systems and Networks
    • Bianca Schroeder and Garth A. Gibson. A large-scale study of failures in high-performance computing systems. In Proceedings of the International Conference on Dependable Systems and Networks, pages 249-258, Washington, DC, USA, 2006. IEEE Computer Society. (Pubitemid 44930426)
    • (2006) Proceedings of the International Conference on Dependable Systems and Networks , vol.2006 , pp. 249-258
    • Schroeder, B.1    Gibson, G.A.2
  • 8
    • 38049182471 scopus 로고    scopus 로고
    • How are real grids used? the analysis of four grid traces and its implications
    • A. Iosup, C. Dumitrescu, D. H. J. Epema, H. Li, and L. Wolters. How are real grids used? the analysis of four grid traces and its implications. In GRID, pages 262-269, 2006.
    • (2006) GRID , pp. 262-269
    • Iosup, A.1    Dumitrescu, C.2    Epema, D.H.J.3    Li, H.4    Wolters, L.5
  • 9
    • 38049172300 scopus 로고    scopus 로고
    • Catalog of boinc projects. http://www.boinc-wiki.info/Catalog-of-BOINC- Powered-Projects.
    • Catalog of Boinc Projects
  • 11
    • 84900592671 scopus 로고    scopus 로고
    • EINSTEN@home. http://einstein.phys.uwm.edu.
    • EINSTEN@home
  • 13
    • 84976846528 scopus 로고
    • A first order approximation to the optimum checkpoint interval
    • September
    • John W. Young. A first order approximation to the optimum checkpoint interval. Commun. ACM, 17:530-531, September 1974.
    • (1974) Commun. ACM , vol.17 , pp. 530-531
    • Young, J.W.1
  • 16
    • 20444471122 scopus 로고    scopus 로고
    • Towards informatic analysis of syslogs
    • 2004 IEEE International Conference on Cluster Computing, ICCC 2004
    • J. Stearley. Towards informatic analysis of syslogs. In Proceedings of the 2004 IEEE International Conference on Cluster Computing, pages 309-318, Washington, DC, USA, 2004. IEEE Computer Society. (Pubitemid 40822381)
    • (2004) Proceedings - IEEE International Conference on Cluster Computing, ICCC , pp. 309-318
    • Stearley, J.1
  • 20
    • 33244467640 scopus 로고    scopus 로고
    • Is remote host availability governed by a universal law?
    • John R. Douceur. Is remote host availability governed by a universal law? SIGMETRICS Performance Evaluation Review, 31(3):25-29, 2003.
    • (2003) SIGMETRICS Performance Evaluation Review , vol.31 , Issue.3 , pp. 25-29
    • Douceur, J.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.