-
1
-
-
12344290608
-
Application-level checkpointing for shared memory programs. In: ASPLOS-XI
-
ACM Press
-
Bronevetsky, G., Marques, D., Pingali, K., Szwed, P., Schulz, M.: Application-level checkpointing for shared memory programs. In: ASPLOS-XI: Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, ACM Press (2004) 235-247
-
(2004)
Proceedings of the 11Th International Conference on Architectural Support for Programming Languages and Operating Systems
, pp. 235-247
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Szwed, P.4
Schulz, M.5
-
2
-
-
0345044000
-
Process migration
-
Milojicic, D.S., Douglis, F., Paindaveine, Y., Wheeler, R., Zhou, S.: Process migration. ACM Comput. Surv. 32(3) (2000) 241-299
-
(2000)
ACM Comput. Surv
, vol.32
, Issue.3
, pp. 241-299
-
-
Milojicic, D.S.1
Douglis, F.2
Paindaveine, Y.3
Wheeler, R.4
Zhou, S.5
-
3
-
-
0036292677
-
Safetynet: Improving the availability of shared memory multiprocessors with global checkpoint/recover
-
Sorin, D.J., Martin, M.M.K., Hill, M.D., Wood, D.A.: Safetynet: Improving the availability of shared memory multiprocessors with global checkpoint/recovery. In: ISCA ’02: Proceedings of the 29th annual international symposium on Computer architecture, IEEE Computer Society (2002) 123-134
-
(2002)
ISCA ’02: Proceedings of the 29Th Annual International Symposium on Computer Architecture, IEEE Computer Society
, pp. 123-134
-
-
Sorin, D.J.1
Martin, M.M.K.2
Hill, M.D.3
Wood, D.A.4
-
5
-
-
20444444457
-
The LAM/MPI checkpoint/restart framework: System-initiated checkpointing
-
Sante Fe, New Mexico, USA
-
Sankaran, S., Squyres, J.M., Barrett, B., Lumsdaine, A., Duell, J., Hargrove, P., Roman, E.: The LAM/MPI checkpoint/restart framework: System-initiated checkpointing. In: Proceedings, LACSI Symposium, Sante Fe, New Mexico, USA (2003)
-
(2003)
Proceedings, LACSI Symposium
-
-
Sankaran, S.1
Squyres, J.M.2
Barrett, B.3
Lumsdaine, A.4
Duell, J.5
Hargrove, P.6
Roman, E.7
-
6
-
-
34547424834
-
Application-transparent checkpoint/ restart for mpi programs over infiniband
-
Columbus, OH
-
Gao, Q., Yu, W., Huang, W., Panda, D.K.: Application-transparent checkpoint/ restart for mpi programs over infiniband. In: ICPP’06: Proceedings of the 35th International Conference on Parallel Processing, Columbus, OH (2006)
-
(2006)
ICPP’06: Proceedings of the 35Th International Conference on Parallel Processing
-
-
Gao, Q.1
Yu, W.2
Huang, W.3
Panda, D.K.4
-
7
-
-
0002067202
-
-
Technical Report UT-CS-94-242
-
Plank, J.S., Beck, M., Kingsley, G., Li, K.: Libckpt: Transparent checkpointing under unix. Technical Report UT-CS-94-242 (1994)
-
(1994)
Libckpt: Transparent Checkpointing under Unix
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Li, K.4
-
8
-
-
0042909225
-
User-level process checkpoint and restore for migration
-
Bozyigit, M., Wasiq, M.: User-level process checkpoint and restore for migration. SIGOPS Oper. Syst. Rev. 35(2) (2001) 86-96
-
(2001)
SIGOPS Oper. Syst. Rev
, vol.35
, Issue.2
, pp. 86-96
-
-
Bozyigit, M.1
Wasiq, M.2
-
9
-
-
0032070610
-
Arachne: A portable threads system supporting migrant threads on heterogeneous network farms
-
Dimitrov, B., Rego, V.: Arachne: A portable threads system supporting migrant threads on heterogeneous network farms. IEEE Transactions on Parallel and Distributed Systems 9(5) (1998) 459
-
(1998)
IEEE Transactions on Parallel and Distributed Systems
, vol.9
, Issue.5
, pp. 459
-
-
Dimitrov, B.1
Rego, V.2
-
10
-
-
0030106770
-
Ariadne: Architecture of a portable threads system supporting thread migration
-
Mascarenhas, E., Rego, V.: Ariadne: Architecture of a portable threads system supporting thread migration. Software- Practice and Experience 26(3) (1996) 327-356
-
(1996)
Software- Practice and Experience
, vol.26
, Issue.3
, pp. 327-356
-
-
Mascarenhas, E.1
Rego, V.2
-
11
-
-
8344278434
-
-
Technion, Isreal
-
Itzkovitz, A., Schuster, A., Wolfovich, L.: Thread migration and its applications in distributed shared memory systems. Technical Report LPCR9603, Technion, Isreal (1996)
-
(1996)
Thread Migration and Its Applications in Distributed Shared Memory Systems. Technical Report LPCR9603
-
-
Itzkovitz, A.1
Schuster, A.2
Wolfovich, L.3
-
15
-
-
0031570635
-
Application level fault tolerance in heterogeneous networks of workstations
-
Beguelin, A., Seligman, E., Stephan, P.: Application level fault tolerance in heterogeneous networks of workstations. J. Parallel Distrib. Comput. 43(2) (1997) 147-155
-
(1997)
J. Parallel Distrib. Comput
, vol.43
, Issue.2
, pp. 147-155
-
-
Beguelin, A.1
Seligman, E.2
Stephan, P.3
-
16
-
-
84945288051
-
On improving thread migration: Safety and performance
-
Sahni, S., Prasanna, V.K., Shukla, U., eds., Berlin, Germany, Springer-Verlag
-
Jiang, H., Chaudhary, V.: On improving thread migration: Safety and performance. In Sahni, S., Prasanna, V.K., Shukla, U., eds.: Proceedings 9th International Conference on High Performance Computing HiPC2002. Volume 2552 of Lecture Notes in Computer Science., Berlin, Germany, Springer-Verlag (2002) 474-484
-
(2002)
Proceedings 9Th International Conference on High Performance Computing Hipc2002. Volume 2552 of Lecture Notes in Computer Science
, pp. 474-484
-
-
Jiang, H.1
Chaudhary, V.2
-
18
-
-
4544385770
-
Simsnap: Fast-forwarding via native execution and application-level checkpointing
-
Szwed, P.K., Marques, D., Buels, R.M., McKee, S.A., Schulz, M.: Simsnap: Fast-forwarding via native execution and application-level checkpointing. In: INTERACT-8 2004. EighthWorkshop on Interaction between Compilers and Computer Architectures. (2004) 65
-
(2004)
INTERACT-8 2004. Eighthworkshop on Interaction between Compilers and Computer Architectures
, pp. 65
-
-
Szwed, P.K.1
Marques, D.2
Buels, R.M.3
McKee, S.A.4
Schulz, M.5
-
25
-
-
1142268808
-
Collective operations in application-level fault-tolerant mpi
-
ACM Press
-
Bronevetsky, G., Marques, D., Pingali, K., Stodghill, P.: Collective operations in application-level fault-tolerant mpi. In: ICS ’03: Proceedings of the 17th annual international conference on Supercomputing, ACM Press (2003) 234-243
-
(2003)
ICS ’03: Proceedings of the 17Th Annual International Conference on Supercomputing
, pp. 234-243
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Stodghill, P.4
-
26
-
-
0038040085
-
Automated applicationlevel checkpointing of mpi programs
-
ACM Press
-
Bronevetsky, G., Marques, D., Pingali, K., Stodghill, P.: Automated applicationlevel checkpointing of mpi programs. In: PPoPP ’03: Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming, ACM Press (2003) 84-94
-
(2003)
Ppopp ’03: Proceedings of the Ninth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
, pp. 84-94
-
-
Bronevetsky, G.1
Marques, D.2
Pingali, K.3
Stodghill, P.4
-
29
-
-
0042078549
-
A survey of rollbackrecovery protocols in message-passing systems
-
Elnozahy, E.N.M., Alvisi, L., Wang, Y.M., Johnson, D.B.: A survey of rollbackrecovery protocols in message-passing systems. ACM Comput. Surv. 34(3) (2002) 375-408
-
(2002)
ACM Comput. Surv
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.M.1
Alvisi, L.2
Wang, Y.M.3
Johnson, D.B.4
-
30
-
-
77952975508
-
Checkpointing-based rollback recovery for parallel applications on the integrade grid middleware
-
ACM Press
-
de Camargo, R.Y., Goldchleger, A., Kon, F., Goldman, A.: Checkpointing-based rollback recovery for parallel applications on the integrade grid middleware. In: Proceedings of the 2nd workshop on Middleware for grid computing, ACM Press (2004) 35-40
-
(2004)
Proceedings of the 2Nd Workshop on Middleware for Grid Computing
, pp. 35-40
-
-
De Camargo, R.Y.1
Goldchleger, A.2
Kon, F.3
Goldman, A.4
|