메뉴 건너뛰기




Volumn 23, Issue 11, 2011, Pages 1196-1212

A survey on software checkpointing and mobility techniques in distributed systems

Author keywords

autonomous systems; checkpoint rollback; decision phase; distributed applications; strong mobility

Indexed keywords

DISTRIBUTED DATABASE SYSTEMS;

EID: 79960124024     PISSN: 15320626     EISSN: 15320634     Source Type: Journal    
DOI: 10.1002/cpe.1696     Document Type: Article
Times cited : (7)

References (49)
  • 3
    • 14244258300 scopus 로고    scopus 로고
    • Self adaptivity in Grid computing
    • DOI 10.1002/cpe.927, Grid Performance and Grids and Web Services for E-Science
    • Vadhiyar S, Dongarra J,. Self adaptivity in grid computing: Research articles. Concurrency Computation: Practice and Experience 2005; 17 (2-4): 235-257. (Pubitemid 40285777)
    • (2005) Concurrency Computation Practice and Experience , vol.17 , Issue.2-4 , pp. 235-257
    • Vadhiyar, S.S.1    Dongarra, J.J.2
  • 6
    • 68149132323 scopus 로고    scopus 로고
    • Transparent parallel checkpointing and migration in clusters and ClusterGrids
    • Kovacs J,. Transparent parallel checkpointing and migration in clusters and ClusterGrids. International Journal of Computational Science and Engineering 2009; 4 (3): 171-181.
    • (2009) International Journal of Computational Science and Engineering , vol.4 , Issue.3 , pp. 171-181
    • Kovacs, J.1
  • 9
    • 10644223387 scopus 로고    scopus 로고
    • Computing on large-scale distributed systems: Xtrem Web architecture, programming models, security, tests and convergence with grid
    • Cappello F, Djilali S, Fedak G, Herault T, Magniette F, Néri V, Lodygensky O,. Computing on large-scale distributed systems: Xtrem Web architecture, programming models, security, tests and convergence with grid. Future Generation Computer Systems 2005; 21 (3): 417-437.
    • (2005) Future Generation Computer Systems , vol.21 , Issue.3 , pp. 417-437
    • Cappello, F.1    Djilali, S.2    Fedak, G.3    Herault, T.4    Magniette, F.5    Néri, V.6    Lodygensky, O.7
  • 10
    • 79960125492 scopus 로고    scopus 로고
    • IBMCorporation. An architectural blueprint for autonomic computing
    • [2006 October ]
    • IBMCorporation. An architectural blueprint for autonomic computing. White Paper, 2006. Available at: [October 2009 ].
    • (2009) White Paper
  • 12
    • 0042078549 scopus 로고    scopus 로고
    • A survey of rollback-recovery protocols in message-passing systems
    • Elnozahy EN, Alvisi L, Wang Y-M, Johnson DB,. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Survey 2002; 34 (3): 375-408.
    • (2002) ACM Computing Survey , vol.34 , Issue.3 , pp. 375-408
    • Elnozahy, E.N.1    Alvisi, L.2    Wang, Y.-M.3    Johnson, D.B.4
  • 14
    • 27144432456 scopus 로고    scopus 로고
    • A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing
    • Euro-Par 2005 Parallel Processing: 11th International Euro-Par Conference. Proceedings
    • Jafar S, Gautier T, Krings AW, Roch J-L,. A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing. In Euro-Par (Lecture Notes in Computer Science, vol. 3648), Cunha JC, Medeiros PD, (eds). Springer: Berlin, 2005; 675-684. (Pubitemid 41490867)
    • (2005) Lecture Notes in Computer Science , vol.3648 , pp. 675-684
    • Jafar, S.1    Gautier, T.2    Krings, A.3    Roch, J.-L.4
  • 15
    • 33750200181 scopus 로고    scopus 로고
    • ASSIST as a research framework for high-performance Grid programming environments
    • In, Cunha J.C., Rana O.F. (eds). Springer: Berlin, January.
    • Aldinucci M, Coppola M, Danelutto M, Vanneschi M, Zoccolo C,. ASSIST as a research framework for high-performance Grid programming environments. In Grid Computing: Software Environments and Tools, Cunha JC, Rana OF, (eds). Springer: Berlin, January 2006; 230-256.
    • (2006) Grid Computing: Software Environments and Tools , pp. 230-256
    • Aldinucci, M.1    Coppola, M.2    Danelutto, M.3    Vanneschi, M.4    Zoccolo, C.5
  • 20
    • 0003802908 scopus 로고    scopus 로고
    • Agent tcl: A flexible and secure mobile-agent system
    • Hanover, NH, U.S.A.
    • Gray RS,. Agent tcl: A flexible and secure mobile-agent system. Technical Report, Hanover, NH, U.S.A., 1998.
    • (1998) Technical Report
    • Gray, R.S.1
  • 21
    • 52349122889 scopus 로고    scopus 로고
    • Engineering an autonomic container for WSRF-based web services
    • Reich C, Banholzer M, Buyya R, Bubendorfer K,. Engineering an autonomic container for WSRF-based web services. Adcom 2007; 0: 277-282.
    • (2007) Adcom , vol.0 , pp. 277-282
    • Reich, C.1    Banholzer, M.2    Buyya, R.3    Bubendorfer, K.4
  • 22
    • 33646415354 scopus 로고    scopus 로고
    • Self healing and self configuration in a WSRF grid environment
    • In (Lecture Notes in Computer Science, vol. 3719), Hobbs M., Goscinski A.M., Zhou W. (eds). Springer: Berlin.
    • Messig M, Goscinski A,. Self healing and self configuration in a WSRF grid environment. In ICA3PP (Lecture Notes in Computer Science, vol. 3719), Hobbs M, Goscinski AM, Zhou W, (eds). Springer: Berlin, 2005; 149-158.
    • (2005) ICA3PP , pp. 149-158
    • Messig, M.1    Goscinski, A.2
  • 23
    • 0022020346 scopus 로고
    • Distributed snapshots: Determining global states of distributed systems
    • DOI 10.1145/214451.214456
    • Mani Chandy K, Lamport L,. Distributed snapshots: Determining global states of distributed systems. ACM Transactions on Computer Systems 1985; 3 (1): 63-75. (Pubitemid 15597765)
    • (1985) ACM Transactions on Computer Systems , vol.3 , Issue.1 , pp. 63-75
    • Chandy K.Mani1    Lamport Leslie2
  • 25
    • 24944480927 scopus 로고    scopus 로고
    • Transparent fault tolerance for grid applications
    • Advances in Grid Computing - EGC 2005: European Grid Conference, Revised Selected Papers
    • Garbacki P, Biskupski B, Bal HE,. Transparent fault tolerance for grid applications. EGC (Lecture Notes in Computer Science, vol. 3470), Sloot PMA, Hoekstra AG, Priol T, Reinefeld A, Bubak M, (eds). Springer: Berlin, 2005; 671-680. (Pubitemid 41313248)
    • (2005) Lecture Notes in Computer Science , vol.3470 , pp. 671-680
    • Garbacki, P.1    Biskupski, B.2    Bal, H.3
  • 26
    • 41149169653 scopus 로고    scopus 로고
    • A serialization based approach for strong mobility of shared object
    • DOI 10.1145/1294325.1294359, Proceedings of the 2007 5th International Conference on the Principles and Practice of Programming in Java, PPPJ 2007
    • Marzouk S, Ben-Jemaa M, Jmaiel M,. A serialisation based approach for strong mobility of shared object. Proceedings of the First International Workshop on Java for Mobility (Ja4Mo 07) as part of the International Conference on Principles and Practices of Programming in Java (PPPJ 2007), Lisbon, Portugal. ACM: New York, September 2007; 237-242. (Pubitemid 351429500)
    • (2007) ACM International Conference Proceeding Series , vol.272 , pp. 237-242
    • Marzouk, S.1    Ben Jemaa, M.2    Jmaiel, M.3
  • 27
    • 0035201417 scopus 로고    scopus 로고
    • Processor allocation and checkpoint interval selection in cluster computing systems
    • DOI 10.1006/jpdc.2001.1757
    • Plank JS, Thomason MG,. Processor allocation and checkpoint interval selection in cluster computing systems. Journal of Parallel and Distributed Computing 2001; 61 (11): 1570-1590. (Pubitemid 33119054)
    • (2001) Journal of Parallel and Distributed Computing , vol.61 , Issue.11 , pp. 1570-1590
    • Plank, J.S.1    Thomason, M.G.2
  • 28
    • 33847764225 scopus 로고    scopus 로고
    • Model-based checkpoint scheduling for volatile resource environments
    • Department of Computer Science, University of California Santa Barbara, Santa Barbara, CA.
    • Nurmi D, Wolski R, Brevik J,. Model-based checkpoint scheduling for volatile resource environments. Technical Report 2004-25, Department of Computer Science, University of California Santa Barbara, Santa Barbara, CA, 2004.
    • (2004) Technical Report 2004-25
    • Nurmi, D.1    Wolski, R.2    Brevik, J.3
  • 29
    • 0030600996 scopus 로고    scopus 로고
    • Checkpointing in distributed computing systems
    • DOI 10.1006/jpdc.1996.0069
    • Wong KF, Franklin M,. Checkpointing in distributed computing systems. Journal of Parallel and Distributed Computing 1996; 35 (1): 67-75. (Pubitemid 126167709)
    • (1996) Journal of Parallel and Distributed Computing , vol.35 , Issue.1 , pp. 67-75
    • Wong, K.F.1    Franklin, M.2
  • 30
    • 0018454850 scopus 로고
    • On the optimum checkpoint interval
    • Gelenbe E,. On the optimum checkpoint interval. Journal of the ACM 1979; 26 (2): 259-270.
    • (1979) Journal of the ACM , vol.26 , Issue.2 , pp. 259-270
    • Gelenbe, E.1
  • 32
    • 0031388399 scopus 로고    scopus 로고
    • Impact of checkpoint latency on overhead ratio of a checkpointing scheme
    • Vaidya NH,. Impact of checkpoint latency on overhead ratio of a checkpointing scheme. IEEE Transactions on Computers 1997; 46 (8): 942-947. (Pubitemid 127760644)
    • (1997) IEEE Transactions on Computers , vol.46 , Issue.8 , pp. 942-947
    • Vaidya, N.H.1
  • 33
    • 84976846528 scopus 로고
    • A first order approximation to the optimum checkpoint interval
    • Young JW,. A first order approximation to the optimum checkpoint interval. Communications of the ACM 1974; 17 (9): 530-531.
    • (1974) Communications of the ACM , vol.17 , Issue.9 , pp. 530-531
    • Young, J.W.1
  • 35
    • 0028427727 scopus 로고
    • Consistent global checkpoints based on direct dependency tracking
    • DOI 10.1016/0020-0190(94)00038-7
    • Wang Y-M, Lowry A, Kent Fuchs W,. Consistent global checkpoints based on direct dependency tracking. Information Processing Letters 1994; 50 (4): 223-230. (Pubitemid 124013158)
    • (1994) Information Processing Letters , vol.50 , Issue.4 , pp. 223-230
    • Wang, Y.-M.1    Lowry, A.2    Fuchs, W.K.3
  • 36
    • 0031124071 scopus 로고    scopus 로고
    • Consistent global checkpoints that contain a given set of local checkpoints
    • Wang Y-M, AT&T Bell Labs, Hill NJM,. Consistent global checkpoints that contain a given set of localcheckpoints. IEEE Transactions on Computers 1997; 46: 456-468. (Pubitemid 127760472)
    • (1997) IEEE Transactions on Computers , vol.46 , Issue.4 , pp. 456-468
    • Wang, Y.-M.1
  • 39
    • 84866225421 scopus 로고    scopus 로고
    • A communication-induced checkpointing protocol that ensures rollback-dependency trackability
    • Washington, DC, U.S.A. IEEE Computer Society: Silver Spring, MD.
    • Baldoni R,. A communication-induced checkpointing protocol that ensures rollback-dependency trackability. FTCS '97: Proceedings of the 27th International Symposium on Fault-Tolerant Computing (FTCS '97). Washington, DC, U.S.A. IEEE Computer Society: Silver Spring, MD, 1997; 68.
    • (1997) FTCS '97: Proceedings of the 27th International Symposium on Fault-Tolerant Computing (FTCS '97) , pp. 68
    • Baldoni, R.1
  • 43
    • 0141599174 scopus 로고
    • Libckpt: Transparent checkpointing under unix
    • Knoxville, TN, U.S.A.
    • Plank JS, Beck M, Kingsley G, Li K,. Libckpt: Transparent checkpointing under unix. Technical Report, Knoxville, TN, U.S.A., 1994.
    • (1994) Technical Report
    • Plank, J.S.1    Beck, M.2    Kingsley, G.3    Li, K.4
  • 49
    • 0026867749 scopus 로고
    • Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit
    • Elnozahy EN, Zwaenepoel W,. Manetho: Transparent rollback-recovery with low overhead, limited rollback, and fast output commit. IEEE Transactions on Computers 1992; 41 (5): 526-531.
    • (1992) IEEE Transactions on Computers , vol.41 , Issue.5 , pp. 526-531
    • Elnozahy, E.N.1    Zwaenepoel, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.