메뉴 건너뛰기




Volumn 9, Issue 10, 1998, Pages 972-986

Diskless checkpointing

Author keywords

Checkpointing; Copy on write; Error correcting codes; Fault tolerance; Memory redundancy; RAID systems; Rollback recovery

Indexed keywords

DATA STORAGE EQUIPMENT; ERROR CORRECTION; FAULT TOLERANT COMPUTER SYSTEMS; PARALLEL PROCESSING SYSTEMS; PROGRAM DEBUGGING; PROGRAM PROCESSORS; STORAGE ALLOCATION (COMPUTER);

EID: 0032179680     PISSN: 10459219     EISSN: None     Source Type: Journal    
DOI: 10.1109/71.730527     Document Type: Article
Times cited : (275)

References (39)
  • 2
    • 0031570635 scopus 로고    scopus 로고
    • Application Level Fault Tolerance in Heterogeneous Networks of Workstations
    • Sept.
    • A. Beguelin, E. Seligman, and P. Stephan, "Application Level Fault Tolerance in Heterogeneous Networks of Workstations," J. Parallel and Distributed Computing, vol. 43, Sept. 1997.
    • (1997) J. Parallel and Distributed Computing , vol.43
    • Beguelin, A.1    Seligman, E.2    Stephan, P.3
  • 3
    • 0028194811 scopus 로고
    • EVENODD: An Optimal Scheme for Tolerating Double Disk Failures in RAID Architectures
    • Chicago, Apr.
    • M. Blaum, J. Brady, J. Bruck, and J. Menon, "EVENODD: An Optimal Scheme for Tolerating Double Disk Failures in RAID Architectures," Proc. 21st Ann. Int'l Symp. Computer Architecture, pp. 245-254, Chicago, Apr. 1994.
    • (1994) Proc. 21st Ann. Int'l Symp. Computer Architecture , pp. 245-254
    • Blaum, M.1    Brady, J.2    Bruck, J.3    Menon, J.4
  • 6
    • 0029715009 scopus 로고    scopus 로고
    • Efficient Checkpoint Mechanisms for Massively Parallel Machines
    • Sendai, June
    • T. Chiueh and P. Deng, "Efficient Checkpoint Mechanisms for Massively Parallel Machines," Proc. 26th Int'l Symp. Fault-Tolerant Computing, pp. 370-379, Sendai, June 1996.
    • (1996) Proc. 26th Int'l Symp. Fault-Tolerant Computing , pp. 370-379
    • Chiueh, T.1    Deng, P.2
  • 10
    • 0026867749 scopus 로고
    • Manetho: Transparent Roll-back-Recovery with Low Overhead, Limited Rollback and Fast Output Commit
    • May
    • E.N. Elnozahy and W. Zwaenepoel, "Manetho: Transparent Roll-back-Recovery with Low Overhead, Limited Rollback and Fast Output Commit," IEEE Trans. Computers, vol. 41, no. 5, May 1992.
    • (1992) IEEE Trans. Computers , vol.41 , Issue.5
    • Elnozahy, E.N.1    Zwaenepoel, W.2
  • 17
    • 0000674171 scopus 로고
    • Job and Process Recovery in a UNIX-Based Operating System
    • San Diego, Calif., Jan.
    • B.A. Kingsbury and J.T. Kline, "Job and Process Recovery in a UNIX-Based Operating System," Proc. Usenix Winter 1989 Technical Conf., pp. 355-364, San Diego, Calif., Jan. 1989.
    • (1989) Proc. Usenix Winter 1989 Technical Conf. , pp. 355-364
    • Kingsbury, B.A.1    Kline, J.T.2
  • 22
  • 24
    • 0030392072 scopus 로고    scopus 로고
    • Improving the Performance of Coordinated Checkpointers on Networks of Workstations Using RAID Techniques
    • Oct.
    • J.S. Plank, "Improving the Performance of Coordinated Checkpointers on Networks of Workstations Using RAID Techniques," Proc. 15th Symp. Reliable Distributed Systems, pp. 76-85, Oct. 1996.
    • (1996) Proc. 15th Symp. Reliable Distributed Systems , pp. 76-85
    • Plank, J.S.1
  • 25
    • 0031223146 scopus 로고    scopus 로고
    • A Tutorial on Reed-Solomon Coding for Fault-Tolerance in RAID-Like Systems
    • Sept.
    • J.S. Plank, "A Tutorial on Reed-Solomon Coding for Fault-Tolerance in RAID-Like Systems," Software - Practice & Experience, vol. 27, no. 9, pp. 995-1,012, Sept. 1997.
    • (1997) Software - Practice & Experience , vol.27 , Issue.9
    • Plank, J.S.1
  • 27
    • 0031570636 scopus 로고    scopus 로고
    • Fault Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing
    • Sept.
    • J.S. Plank, Y. Kim, and J. Dongarra, "Fault Tolerant Matrix Operations for Networks of Workstations Using Diskless Checkpointing," J. Parallel and Distributed Computing, vol. 43, pp. 125-138, Sept. 1997.
    • (1997) J. Parallel and Distributed Computing , vol.43 , pp. 125-138
    • Plank, J.S.1    Kim, Y.2    Dongarra, J.3
  • 29
    • 0002991145 scopus 로고
    • Ickp - A Consistent Checkpointer for Multicomputers
    • Summer
    • J.S. Plank and K. Li, "Ickp - A Consistent Checkpointer for Multicomputers," IEEE Parallel & Distributed Technology, vol. 2, no. 2, pp. 62-67, Summer 1994.
    • (1994) IEEE Parallel & Distributed Technology , vol.2 , Issue.2 , pp. 62-67
    • Plank, J.S.1    Li, K.2
  • 31
    • 0028994280 scopus 로고
    • Fault-Tolerance for Off-the-Shelf Applications and Hardware
    • Pasadena, Calif., June
    • M. Russinovich and Z. Segall, "Fault-Tolerance for Off-the-Shelf Applications and Hardware," Proc. 25th Int'l Symp. Fault-Tolerant Computing, pp. 67-71, Pasadena, Calif., June 1995.
    • (1995) Proc. 25th Int'l Symp. Fault-Tolerant Computing , pp. 67-71
    • Russinovich, M.1    Segall, Z.2
  • 35
    • 0029251277 scopus 로고
    • The Condor Distributed Processing System
    • Feb.
    • T. Tannenbaum and M. Litzkow, "The Condor Distributed Processing System," Dr. Dobb's J., no. 227, pp. 40-48, Feb. 1995.
    • (1995) Dr. Dobb's J. , Issue.227 , pp. 40-48
    • Tannenbaum, T.1    Litzkow, M.2
  • 37
    • 0031388399 scopus 로고    scopus 로고
    • Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme
    • August
    • N.H. Vaidya, "Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme," IEEE Trans. Computers, vol. 46, no. 8, pp. 942-947, August 1997.
    • (1997) IEEE Trans. Computers , vol.46 , Issue.8 , pp. 942-947
    • Vaidya, N.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.