메뉴 건너뛰기




Volumn , Issue , 2012, Pages 225-234

Algorithm-Based Fault Tolerance for dense matrix factorizations

Author keywords

ABFT; Fail stop failure; Fault tolerance; LU; QR

Indexed keywords

ABFT; ALGORITHM BASED FAULT TOLERANCE; CHECK POINTING; CHECK-POINTING ALGORITHMS; CHECKSUM; CHOLESKY; COMPUTING UNITS; DENSE MATRICES; EIGENVALUES; EXTREME CONDITIONS; FAIL-STOP FAILURES; FAULT-TOLERANT ALGORITHMS; GENERIC SOLUTIONS; HYBRID APPROACH; HYBRID SOLUTION; LINEAR LEAST SQUARES PROBLEMS; MATRIX FACTORIZATIONS; MEAN TIME TO FAILURE; PROBLEM SIZE; QR; QR FACTORIZATIONS; RELIABLE COMPONENTS; SCIENTIFIC APPLICATIONS; SYSTEMS OF LINEAR EQUATIONS; THEORETICAL EVALUATION;

EID: 84858403667     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2145816.2145845     Document Type: Conference Paper
Times cited : (57)

References (24)
  • 2
    • 84858381507 scopus 로고    scopus 로고
    • http://www.top500.org/, 2011.
  • 6
    • 0001873476 scopus 로고
    • LAM: An open cluster environment for MPI
    • G. Burns, R. Daoud, and J. Vaigl. LAM: An open cluster environment for MPI. In Proceedings of SC'94, volume 94, pages 379-386, 1994.
    • (1994) Proceedings of SC'94 , vol.94 , pp. 379-386
    • Burns, G.1    Daoud, R.2    Vaigl, J.3
  • 7
    • 68249127079 scopus 로고    scopus 로고
    • Fault tolerance in petascale/exascale systems: Current knowledge, challenges and research opportunities
    • F. Cappello. Fault tolerance in petascale/exascale systems: Current knowledge, challenges and research opportunities. International Journal of High Performance Computing Applications, 23(3):212, 2009.
    • (2009) International Journal of High Performance Computing Applications , vol.23 , Issue.3 , pp. 212
    • Cappello, F.1
  • 8
    • 33847240498 scopus 로고    scopus 로고
    • Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources
    • IEEE
    • Z. Chen and J. Dongarra. Algorithm-based checkpoint-free fault tolerance for parallel matrix computations on volatile resources. In IPDPS'06, pages 10-pp. IEEE, 2006.
    • (2006) IPDPS'06 , pp. 10
    • Chen, Z.1    Dongarra, J.2
  • 10
    • 57049092162 scopus 로고    scopus 로고
    • Algorithm-based fault tolerance for fail-stop failures
    • Z. Chen and J. Dongarra. Algorithm-based fault tolerance for fail-stop failures. IEEE TPDS, 19(12):1628-1641, 2008.
    • (2008) IEEE TPDS , vol.19 , Issue.12 , pp. 1628-1641
    • Chen, Z.1    Dongarra, J.2
  • 15
    • 1542292472 scopus 로고    scopus 로고
    • FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world
    • G. Fagg and J. Dongarra. FT-MPI: Fault tolerant MPI, supporting dynamic applications in a dynamic world. EuroPVM/MPI, 2000.
    • (2000) EuroPVM/MPI
    • Fagg, G.1    Dongarra, J.2
  • 16
    • 84858430349 scopus 로고    scopus 로고
    • Failure tolerance in petascale computers
    • G. Gibson. Failure tolerance in petascale computers. In Journal of Physics: Conference Series, volume 78, page 012022, 2007.
    • (2007) Journal of Physics: Conference Series , vol.78 , pp. 012022
    • Gibson, G.1
  • 19
    • 0021439162 scopus 로고
    • Algorithm-based fault tolerance for matrix operations
    • K. Huang and J. Abraham. Algorithm-based fault tolerance for matrix operations. Computers, IEEE Transactions on, 100(6):518-528, 1984. (Pubitemid 14584528)
    • (1984) IEEE Transactions on Computers , vol.C-33 , Issue.6 , pp. 518-528
    • Huang, K.-H.1    Abraham Jacob, A.2
  • 22
    • 0023995880 scopus 로고
    • Analysis of algorithm-based fault tolerance techniques
    • DOI 10.1016/0743-7315(88)90027-5
    • F. Luk and H. Park. An analysis of algorithm-based fault tolerance techniques* 1. Journal of Parallel and Distributed Computing, 5(2):172-184, 1988. (Pubitemid 18589858)
    • (1988) Journal of Parallel and Distributed Computing , vol.5 , Issue.2 , pp. 172-184
    • Luk Franklin, T.1    Park, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.