메뉴 건너뛰기




Volumn , Issue , 2008, Pages

Enhancing application robustness through adaptive fault tolerance

Author keywords

[No Author keywords available]

Indexed keywords

FAULT RESILIENCE; PARALLEL AND DISTRIBUTED PROCESSING; PARALLEL APPLICATIONS;

EID: 51049095700     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IPDPS.2008.4536383     Document Type: Conference Paper
Times cited : (5)

References (26)
  • 3
    • 33845593340 scopus 로고    scopus 로고
    • A large scale study of failures in high performance-computing systems
    • B. Schroeder and G. Gibson, "A large scale study of failures in high performance-computing systems," in Proc. of DSN '06, 2006.
    • (2006) Proc. of DSN '06
    • Schroeder, B.1    Gibson, G.2
  • 4
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback recovery protocols in message-passing systems
    • E. Elnozahy, L. Alvisi, Y. Wang, and D. Johnson, "A survey of rollback recovery protocols in message-passing systems," ACM Computing Surveys, vol. 34(3), 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.3
    • Elnozahy, E.1    Alvisi, L.2    Wang, Y.3    Johnson, D.4
  • 5
    • 9144223280 scopus 로고    scopus 로고
    • Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery
    • E. Elnozahy and J. Plank, "Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery," IEEE Transactions on Dependable and Secure Computing, vol. 1(2), 2004.
    • (2004) IEEE Transactions on Dependable and Secure Computing , vol.1 , Issue.2
    • Elnozahy, E.1    Plank, J.2
  • 12
    • 57049111494 scopus 로고    scopus 로고
    • Adaptive fault management of parallel applications for high performance computing
    • to appear in
    • Z. Lan and Y. Li, "Adaptive fault management of parallel applications for high performance computing", to appear in IEEE Trans. on Computers, 2008.
    • (2008) IEEE Trans. on Computers
    • Lan, Z.1    Li, Y.2
  • 14
    • 51049121489 scopus 로고    scopus 로고
    • A fast recovery mechanism for checkpointing in networked environments
    • Illinois Institute of Technology
    • Y. Li and Z. Lan, "A fast recovery mechanism for checkpointing in networked environments", SCS Tech Report, Illinois Institute of Technology, 2007.
    • (2007) SCS Tech Report
    • Li, Y.1    Lan, Z.2
  • 15
    • 77952378080 scopus 로고    scopus 로고
    • Critical event prediction for proactive management in large-scale computer clusters
    • R. Sahoo, A. Oliner, I. Rish, M. Gupta, J. Moreira, and S. Ma, "Critical event prediction for proactive management in large-scale computer clusters," in Proc. of SIGKDD'03, 2003.
    • (2003) Proc. of SIGKDD'03
    • Sahoo, R.1    Oliner, A.2    Rish, I.3    Gupta, M.4    Moreira, J.5    Ma, S.6
  • 19
    • 33748611921 scopus 로고    scopus 로고
    • Ensemble based systems in decision making
    • R. Polikar, "Ensemble based systems in decision making", IEEE Circuits and Systems Magazine, vol. 6(3), 2006.
    • (2006) IEEE Circuits and Systems Magazine , vol.6 , Issue.3
    • Polikar, R.1
  • 20
    • 0034133513 scopus 로고    scopus 로고
    • Distance-based outliers: Algorithms and applications
    • Edwin M. Knorr, Raymond T. Ng, Vladimir Tucakov, "Distance-based outliers: algorithms and applications", The VLDB Journal,(2000) 8: 237-253.
    • (2000) The VLDB Journal , vol.8 , pp. 237-253
    • Knorr, E.M.1    Ng, R.T.2    Tucakov, V.3
  • 22
    • 33751107476 scopus 로고    scopus 로고
    • Mpi-mitten: Enabling migration technology in mpi
    • C. Du and X. Sun, "Mpi-mitten: Enabling migration technology in mpi," in Proc. of CCGrid'06, 2006.
    • (2006) Proc. of CCGrid'06
    • Du, C.1    Sun, X.2
  • 24
    • 0012283032 scopus 로고    scopus 로고
    • Achieving extreme resolution in numerical cosmology using adaptive mesh refinement: Resolving primordial star formulation
    • G. Bryan, T. Abel, and M. Norman, "Achieving extreme resolution in numerical cosmology using adaptive mesh refinement: Resolving primordial star formulation," in Proc. of SC'01, 2001.
    • (2001) Proc. of SC'01
    • Bryan, G.1    Abel, T.2    Norman, M.3
  • 25
    • 0029633168 scopus 로고
    • Gromacs: A message-passing parallel molecular dynamics implementation
    • H. Berendsen, D. V. der Spoel, and R. van Drunen, "Gromacs: A message-passing parallel molecular dynamics implementation," Comp. Phys. Comm., vol. 91:43-56, 1995.
    • (1995) Comp. Phys. Comm , vol.91 , pp. 43-56
    • Berendsen, H.1    der Spoel, D.V.2    van Drunen, R.3
  • 26
    • 84882885699 scopus 로고    scopus 로고
    • Online, Available
    • Nasa nas parallel benchmarks. [Online]. Available: http://www.nas.nasa. gov/Resources/Software/npb.html
    • Nasa nas parallel benchmarks


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.