메뉴 건너뛰기




Volumn , Issue , 2011, Pages 107-116

Failure prediction and localization in large scientific workflows

Author keywords

Failure prediction; Fault localization; Scientific workflows

Indexed keywords

APPLICATION PERFORMANCE; EXECUTION MANAGEMENT; FAILURE PREDICTION; FAULT LOCALIZATION; HARDWARE FAULTS; PARALLEL EXECUTIONS; REAL TIME EXECUTION; REPRESENTATIVE SAMPLE; SCIENTIFIC APPLICATIONS; SCIENTIFIC WORKFLOWS; STOCHASTIC ERRORS; WORK-FLOWS;

EID: 84857940460     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2110497.2110510     Document Type: Conference Paper
Times cited : (17)

References (37)
  • 5
    • 84857971411 scopus 로고    scopus 로고
    • "Dagman." [Online]. Available: www.cs.wisc.edu/condor/dagman
    • Dagman
  • 8
    • 84857941239 scopus 로고    scopus 로고
    • "SQLAlchemy." [Online]. Available: www.sqlalchemy.org
    • SQLAlchemy
  • 9
    • 84857961175 scopus 로고    scopus 로고
    • "R." [Online]. Available: www.r-project.org
  • 10
    • 77957739405 scopus 로고    scopus 로고
    • Broadband ground-motion simulation using a hybrid approach
    • R. Graves and A. Pitarka, "Broadband ground-motion simulation using a hybrid approach," Bulletin of the Seismological Society of America, vol. 100, no. 5A, p. 2095, 2010.
    • (2010) Bulletin of the Seismological Society of America , vol.100 , Issue.5 A , pp. 2095
    • Graves, R.1    Pitarka, A.2
  • 11
    • 84857977483 scopus 로고    scopus 로고
    • "Broadband working group." [Online]. Available: http://scec.usc.edu/research/cme/groups/broadband
  • 15
    • 84857979025 scopus 로고    scopus 로고
    • "USC Epigenome Center." [Online]. Available: epigenome.usc.edu
  • 16
    • 84857961178 scopus 로고    scopus 로고
    • "LIGO Project." [Online]. Available: www.ligo.caltech.edu
  • 17
    • 70349925723 scopus 로고    scopus 로고
    • A case study on the use of workflow technologies for scientific analysis: Gravitational wave data analysis
    • I. Taylor, E. Deelman, D. Gannon, and M. Shield, Eds. Springer
    • D. Brown, P. Brady, A. Dietz, J. Cao, B. Johnson, and J. McNabb, "A case study on the use of workflow technologies for scientific analysis: Gravitational wave data analysis," in Worflows for e-Sciences, I. Taylor, E. Deelman, D. Gannon, and M. Shield, Eds. Springer, 2006.
    • (2006) Worflows for E-sciences
    • Brown, D.1    Brady, P.2    Dietz, A.3    Cao, J.4    Johnson, B.5    McNabb, J.6
  • 19
    • 84857938026 scopus 로고    scopus 로고
    • "Periodograms." [Online]. Available: www.ipac.caltech.edu
  • 20
    • 10444251156 scopus 로고    scopus 로고
    • A taxonomy of grid monitoring systems
    • Jan.
    • S. Zanikolas and R. Sakellariou, "A taxonomy of grid monitoring systems," Future Generation Computer Systems, vol. 21, no. 1, pp. 163âǎş-188, Jan. 2005.
    • (2005) Future Generation Computer Systems , vol.21 , Issue.1 , pp. 163-188
    • Zanikolas, S.1    Sakellariou, R.2
  • 23
    • 33244488521 scopus 로고    scopus 로고
    • Visual Grid workflow in Triana
    • DOI 10.1007/s10723-005-9007-3
    • I. Taylor, M. Shields, I. Wang, and A. Harrison, "Visual grid workflow in Triana," Journal of Grid Computing, vol. 3, no. 3âǎş4, pp. 153âǎş-169, 2005. (Pubitemid 43280300)
    • (2005) Journal of Grid Computing , vol.3 , Issue.3-4 , pp. 153-169
    • Taylor, I.1    Shields, M.2    Wang, I.3    Harrison, A.4
  • 29
    • 28844461639 scopus 로고    scopus 로고
    • Dynamic instrumentation, performance monitoring and analysis of Grid scientific workflows
    • DOI 10.1007/s10723-005-5299-6
    • H. Truong and S. Dustdar, "Dynamic instrumentation, performance monitoring and analysis of grid scientific workflows," Journal of Grid Computing, vol. 3, no. 1-2, pp. 1-18, 2005. (Pubitemid 41762482)
    • (2005) Journal of Grid Computing , vol.3 , Issue.1-2 , pp. 1-18
    • Truong, H.-L.1    Fahringer, T.2    Dustdar, S.3
  • 33
    • 56749178938 scopus 로고    scopus 로고
    • Exploring event correlation for failure prediction in coalitions of clusters
    • ser. SC '07. New York, NY, USA: ACM http://doi.acm.org/10.1145/1362622. 1362678
    • S. Fu and C.-Z. Xu, "Exploring event correlation for failure prediction in coalitions of clusters," in Proceedings of the 2007 ACM/IEEE conference on Supercomputing, ser. SC '07. New York, NY, USA: ACM, 2007, pp. 41:1-41:12. [Online]. Available: http://doi.acm.org/10.1145/1362622.1362678
    • (2007) Proceedings of the 2007 ACM/IEEE Conference on Supercomputing , pp. 411-4112
    • Fu, S.1    Xu, C.-Z.2
  • 34
    • 74049136080 scopus 로고    scopus 로고
    • Predicting the execution time of grid workflow applications through local learning
    • ser. SC '09. New York, NY, USA: ACM http://doi.acm.org/10.1145/1654059. 1654093
    • F. Nadeem and T. Fahringer, "Predicting the execution time of grid workflow applications through local learning," in Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, ser. SC '09. New York, NY, USA: ACM, 2009, pp. 33:1-33:12. [Online]. Available: http://doi.acm.org/10.1145/1654059.1654093
    • (2009) Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , pp. 331-3312
    • Nadeem, F.1    Fahringer, T.2
  • 35
    • 33845593340 scopus 로고    scopus 로고
    • A large-scale study of failures in high-performance computing systems
    • DOI 10.1109/DSN.2006.5, 1633514, Proceedings - DSN 2006: 2006 International Conference on Dependable Systems and Networks
    • B. Schroeder and G. A. Gibson, "A large-scale study of failures in high-performance computing systems," in Proceedings of the International Conference on Dependable Systems and Networks. Washington, DC, USA: IEEE Computer Society, 2006, pp. 249-258. [Online]. Available: http://portal.acm.org/ citation.cfm?id=1135532.1135705 (Pubitemid 44930426)
    • (2006) Proceedings of the International Conference on Dependable Systems and Networks , vol.2006 , pp. 249-258
    • Schroeder, B.1    Gibson, G.A.2
  • 37
    • 70049100670 scopus 로고    scopus 로고
    • Adaptive monitoring in enterprise software systems
    • June
    • M. Munawar and P. Ward, "Adaptive monitoring in enterprise software systems," SysML, June 2006.
    • (2006) SysML
    • Munawar, M.1    Ward, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.