메뉴 건너뛰기




Volumn , Issue , 2007, Pages

Performance under failures of high-end computing

Author keywords

Application performance; Failure modeling; Fault tolerance

Indexed keywords

APPLICATION PERFORMANCE; CHECKPOINTING; COMPLETION TIMES; COMPUTING PLATFORMS; EFFECTIVE PERFORMANCES; END USERS; EXTENSIVE SIMULATIONS; FAILURE MODELING; FAILURE RATES; FAILURE REPAIRS; FAULT-TOLERANT; MASK FAULTS; PARALLEL TASKS; PETAFLOP MACHINES; PREDICTION MODELS; PRODUCTION COSTS.; SCHEDULING STRATEGIES; SEQUENTIAL EXECUTIONS; SYSTEM FAILURES;

EID: 56749158844     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1362622.1362687     Document Type: Conference Paper
Times cited : (33)

References (19)
  • 2
    • 0003257220 scopus 로고    scopus 로고
    • Host load prediction using linear models
    • Dinda, P., and O'Hallaron, D. "Host load prediction using linear models," Cluster Computing, Vol 3, pp. 265-280, 2000.
    • (2000) Cluster Computing , vol.3 , pp. 265-280
    • Dinda, P.1    O'Hallaron, D.2
  • 3
    • 1542383568 scopus 로고    scopus 로고
    • Reliable matching and scheduling of precedence-constrained tasks in heterogeneous distributed computing
    • Toronto, Canada, Aug
    • Dogan, A., and Ozguner, F. "Reliable matching and scheduling of precedence-constrained tasks in heterogeneous distributed computing," In Proc. of the 29th International Conference on Parallel Processing, pp. 307-314, Toronto, Canada, Aug., 2000.
    • (2000) Proc. of the 29th International Conference on Parallel Processing , pp. 307-314
    • Dogan, A.1    Ozguner, F.2
  • 4
    • 0020765766 scopus 로고
    • The Effects of Checkpointing on Program Execution Time
    • June
    • Duda, A. "The Effects of Checkpointing on Program Execution Time," Information Processing Letters, vol. 16, pp. 221-229, June 1983.
    • (1983) Information Processing Letters , vol.16 , pp. 221-229
    • Duda, A.1
  • 5
    • 0030147013 scopus 로고    scopus 로고
    • Garg, S., Huang, Y., Kintala, C., and Trivedi, K. S., Minimizing Completion Time of a Program by Checkpointing and Rejuvenation, In Proc. of 1996 ACM SIGMETRICS Conference, pp. 252-261, Philadelphia, PA, May 1996.
    • Garg, S., Huang, Y., Kintala, C., and Trivedi, K. S., "Minimizing Completion Time of a Program by Checkpointing and Rejuvenation," In Proc. of 1996 ACM SIGMETRICS Conference, pp. 252-261, Philadelphia, PA, May 1996.
  • 6
    • 0036709549 scopus 로고    scopus 로고
    • Performance Modeling and Prediction of Non-Dedicated Network Computing
    • Sep
    • Gong, L., Sun, X-H., and Waston, E. "Performance Modeling and Prediction of Non-Dedicated Network Computing," IEEE Trans. on Computers, Vol 51, No 9, pp. 1041-1055, Sep., 2002.
    • (2002) IEEE Trans. on Computers , vol.51 , Issue.9 , pp. 1041-1055
    • Gong, L.1    Sun, X.-H.2    Waston, E.3
  • 8
    • 55849086811 scopus 로고    scopus 로고
    • Los Alamos National Laboratory, Science Research
    • Los Alamos National Laboratory, Operational Data to Support and Enable Computer Science Research, http://institute.lanl.gov/data/lanldata.shtml
    • Operational Data to Support and Enable Computer
  • 9
    • 36949009638 scopus 로고    scopus 로고
    • Scalable Diskless Checkpointing for Large Parallel Systems,
    • Ph.D dissertation, Department of Computer Science, University of Illinois at Urbana-Champaign
    • Lu, Charng-da "Scalable Diskless Checkpointing for Large Parallel Systems," Ph.D dissertation, Department of Computer Science, University of Illinois at Urbana-Champaign, 2005.
    • (2005)
    • Lu, C.-D.1
  • 11
    • 0023313354 scopus 로고
    • Queueing Analysis of Fault-Tolerant Computer Systems
    • Nicola, V. F., Kulkarni, V. G., and Trivedi, K. S. "Queueing Analysis of Fault-Tolerant Computer Systems," IEEE Trans. Software Engineering, Vol. SE-13, No. 3, pp. 363-375, 1987.
    • (1987) IEEE Trans. Software Engineering , vol.SE-13 , Issue.3 , pp. 363-375
    • Nicola, V.F.1    Kulkarni, V.G.2    Trivedi, K.S.3
  • 16
    • 0032683084 scopus 로고    scopus 로고
    • Srinivasan, S.., and Jha, N.K. Safety and Reliability Driven Task Allocation in Distributed Systems, IEEE Trans. Parallel and Distributed Systems, 10, No 3, pp. 238-251, 1999.
    • Srinivasan, S.., and Jha, N.K. "Safety and Reliability Driven Task Allocation in Distributed Systems," IEEE Trans. Parallel and Distributed Systems, Vol 10, No 3, pp. 238-251, 1999.
  • 17
    • 85130634439 scopus 로고    scopus 로고
    • Dynamically forecasting network performance using the network weather service
    • Wolski, R. "Dynamically forecasting network performance using the network weather service," Cluster Computing, Vol 1, pp. 119-132, 1998.
    • (1998) Cluster Computing , vol.1 , pp. 119-132
    • Wolski, R.1
  • 18
    • 33748087133 scopus 로고    scopus 로고
    • Grid Harvest Service: A Performance System of Grid Computing
    • Wu, M., and Sun, X.-H. "Grid Harvest Service: A Performance System of Grid Computing," Journal of Parallel and Distributed Computing, Vol. 66, No. 10, pp. 1322-1337, 2006.
    • (2006) Journal of Parallel and Distributed Computing , vol.66 , Issue.10 , pp. 1322-1337
    • Wu, M.1    Sun, X.-H.2
  • 19
    • 84976846528 scopus 로고
    • A First Order Approximation to the Optimal Checkpoint Interval
    • Young, J. W. "A First Order Approximation to the Optimal Checkpoint Interval." Comm. ACM, Vol. 17, No 9, pp. 530-531, 1974.
    • (1974) Comm. ACM , vol.17 , Issue.9 , pp. 530-531
    • Young, J.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.