-
1
-
-
18844416337
-
Quadrics QsNet II: A network for supercomputing applications
-
Stanford University, California, August 18-20
-
D. Addison, J. Beecroft, D. Hewson, M. McLaren, and F. Petrini. Quadrics QsNet II: A Network for Supercomputing Applications. In Hot Chips 14, Stanford University, California, August 18-20, 2003.
-
(2003)
Hot Chips
, vol.14
-
-
Addison, D.1
Beecroft, J.2
Hewson, D.3
McLaren, M.4
Petrini, F.5
-
3
-
-
0003605996
-
-
NAS 95-020, NASA Ames Research Center, Moffett Field, California, December
-
D. Bailey, T. Harris, W. Saphir, R. van der Wijngaart, A. Woo, and M. Yarrow. The NAS Parallel Benchmarks 2.0. NAS 95-020, NASA Ames Research Center, Moffett Field, California, December 1995.
-
(1995)
The NAS Parallel Benchmarks 2.0
-
-
Bailey, D.1
Harris, T.2
Saphir, W.3
Van Der Wijngaart, R.4
Woo, A.5
Yarrow, M.6
-
4
-
-
0032021963
-
The MOSIX multicomputer operating system for high performance cluster computing
-
March
-
A. Barak and O. La'adan. The MOSIX Multicomputer Operating System for High Performance Cluster Computing. Journal of Future Generation Computer Systems, 13(4-5):361-372, March 1998.
-
(1998)
Journal of Future Generation Computer Systems
, vol.13
, Issue.4-5
, pp. 361-372
-
-
Barak, A.1
La'adan, O.2
-
5
-
-
12844286028
-
Application-level checkpointing for shared memory programs
-
Boston, MA, October
-
Greg Bronevetsky, Martin Schulz, Peter Szwed, Daniel Marques, and Keshav Pingali. Application-level Checkpointing for Shared Memory Programs. In Eleventh International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XI), Boston, MA, October 2004.
-
(2004)
Eleventh International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS XI)
-
-
Bronevetsky, G.1
Schulz, M.2
Szwed, P.3
Marques, D.4
Pingali, K.5
-
6
-
-
0036680081
-
Checkpointing of multithreaded programs
-
August
-
C. Carothers and B. Szymanski. Checkpointing of Multithreaded Programs. Dr. Dobbs Journal, 15(8), August 2002.
-
(2002)
Dr. Dobbs Journal
, vol.15
, Issue.8
-
-
Carothers, C.1
Szymanski, B.2
-
10
-
-
84871146551
-
The performance of consistent checkpointing
-
Houston, TX, October 5-7
-
E. N. Elnozahy, D. B. Johnson, and W. Zwaenepoel. The Performance of Consistent Checkpointing. In Proceedings of the 11th Symposium on Reliable Distributed Systems, Houston, TX, October 5-7, 1992.
-
(1992)
Proceedings of the 11th Symposium on Reliable Distributed Systems
-
-
Elnozahy, E.N.1
Johnson, D.B.2
Zwaenepoel, W.3
-
12
-
-
12444268325
-
System-level fault-tolerance in large-scale parallel machines with buffered coscheduling
-
Santa Fe, NM, April
-
Fabrizio Petrini and Kei Davis and José Carlos Sancho. System-Level Fault-Tolerance in Large-Scale Parallel Machines with Buffered Coscheduling. In In 9th IEEE Workshop on Fault-Tolerant Parallel, Distributed and Network-Centric Systems (FTPDS04), Santa Fe, NM, April 2004.
-
(2004)
In 9th IEEE Workshop on Fault-Tolerant Parallel, Distributed and Network-centric Systems (FTPDS04)
-
-
Petrini, F.1
Davis, K.2
Sancho, J.C.3
-
13
-
-
33645243002
-
BCS MPI: A new approach in the system software design for large-scale parallel computers
-
Phoenix, Arizona, November 10-16
-
Juan Fernández, Eitan Frachtenberg, and Fabrizio Petrini. BCS MPI: A New Approach in the System Software Design for Large-Scale Parallel Computers. In Proceedings of SC2003, Phoenix, Arizona, November 10-16, 2003.
-
(2003)
Proceedings of SC2003
-
-
Fernández, J.1
Frachtenberg, E.2
Petrini, F.3
-
14
-
-
10044248764
-
Architectural support for system software on large-scale clusters
-
Montreal, Quebec, Canada, August
-
Juan Fernández, Eitan Frachtenberg, Fabrizio Petrini, Kei Davis, and José Carlos Sancho. Architectural Support for System Software on Large-Scale Clusters. In The 2004 International Conference on Parallel Processing, (ICPP-04), Montreal, Quebec, Canada, August 2004.
-
(2004)
The 2004 International Conference on Parallel Processing, (ICPP-04)
-
-
Fernández, J.1
Frachtenberg, E.2
Petrini, F.3
Davis, K.4
Sancho, J.C.5
-
16
-
-
33845396625
-
Designing parallel operating systems via parallel programming
-
Pisa, Italy, August
-
Eitan Frachtenberg, Kei Davis, Fabrizio Petrini, Juan Fernández, and José Carlos Sancho. Designing Parallel Operating Systems via Parallel Programming. In Euro-Par 2004, Pisa, Italy, August 2004.
-
(2004)
Euro-par 2004
-
-
Frachtenberg, E.1
Davis, K.2
Petrini, F.3
Fernández, J.4
Sancho, J.C.5
-
17
-
-
12444341477
-
STORM: Lightning-fast resource management
-
Baltimore, MD, November
-
Eitan Frachtenberg, Fabrizio Petrini, Juan Fernández, Scott Pakin, and Salvador Coll. STORM: Lightning-Fast Resource Management. In ACM/IEEE SC2002, Baltimore, MD, November 2002.
-
(2002)
ACM/IEEE SC2002
-
-
Frachtenberg, E.1
Petrini, F.2
Fernández, J.3
Pakin, S.4
Coll, S.5
-
18
-
-
84877019821
-
STORM: Lightning-fast resource management
-
Baltimore, Maryland, November 16-22
-
Eitan Frachtenberg, Fabrizio Petrini, Juan Fernández, Scott Pakin, and Salvador Coll. STORM: Lightning-Fast Resource Management. In Proceedings of SC2002, Baltimore, Maryland, November 16-22 2002.
-
(2002)
Proceedings of SC2002
-
-
Frachtenberg, E.1
Petrini, F.2
Fernández, J.3
Pakin, S.4
Coll, S.5
-
19
-
-
84867482607
-
-
E. Hendriks. VMADump. Available from http://cvs.sourceforge.net/viewcvs. py/bproc/vmadump.
-
VMADump
-
-
Hendriks, E.1
-
21
-
-
12444310559
-
Predictive performance and scalability modeling of a large-scale application
-
November 10-16
-
D. J. Kerbyson, H. J. Alme, A. Hoisie, F.Petrini, H. J. Wasserman, and M. Gittings. Predictive Performance and Scalability Modeling of a Large-Scale Application. In Proceedings of the Supercomputing, November 10-16, 2001.
-
(2001)
Proceedings of the Supercomputing
-
-
Kerbyson, D.J.1
Alme, H.J.2
Hoisie, A.3
Petrini, F.4
Wasserman, H.J.5
Gittings, M.6
-
22
-
-
33845380230
-
-
Lightning Linux Cluster. Available from http://www.lanl.gov/worldview/ news/releases/archive/03-107.shtml.
-
Lightning Linux Cluster
-
-
-
23
-
-
33746293114
-
User and kernel level checkpointing
-
Phoenix, Arizona, November 15-17
-
N. Meyer. User and Kernel Level Checkpointing. In Proceedings of the Sun Microsystems HPC Consortium Meeting, Phoenix, Arizona, November 15-17, 2003. Available from http://checkpointing.psnc.pl/Progress/sat_nmeyer.pdf.
-
(2003)
Proceedings of the Sun Microsystems HPC Consortium Meeting
-
-
Meyer, N.1
-
24
-
-
84978437417
-
The design and implementation of zap: A system for migrating computing environments
-
Boston, MA, December 9-11
-
S. Osman, D. Subhraveti, G. Su, and J. Nieh. The Design and Implementation of Zap: A System for Migrating Computing Environments. In Proceedings of the Fifth Symposium on Operating Systems Design and Implementation, Boston, MA, December 9-11, 2002.
-
(2002)
Proceedings of the Fifth Symposium on Operating Systems Design and Implementation
-
-
Osman, S.1
Subhraveti, D.2
Su, G.3
Nieh, J.4
-
26
-
-
33845436998
-
-
EPCKPT
-
E. Pinheiro. EPCKPT. Available from http://www.research.rutgers.edu/ ~edpin/epckpt.
-
-
-
Pinheiro, E.1
-
27
-
-
85084159983
-
Libckpt: Transparent checkpointing under unix
-
New Orleans, Louisiana, January 16-20
-
J. S. Plank, M. Beck, G. Kingsley, and K. Li. Libckpt: Transparent Checkpointing under Unix. In Proceedings of the Usenix Winter 1995 Technical Conference, New Orleans, Louisiana, January 16-20, 1995.
-
(1995)
Proceedings of the Usenix Winter 1995 Technical Conference
-
-
Plank, J.S.1
Beck, M.2
Kingsley, G.3
Li, K.4
-
30
-
-
12444268355
-
On the feasibility of incremental checkpointing for scientific computing
-
Santa Fe, New Mexico, April 26-30
-
J. C. Sancho, F. Petrini, G. Johnson, J. Fernández, and E. Frachtenberg. On the Feasibility of Incremental Checkpointing for Scientific Computing. In Proceedings of the 18th International Parallel & Distributed Processing Symposium, Santa Fe, New Mexico, April 26-30, 2004.
-
(2004)
Proceedings of the 18th International Parallel & Distributed Processing Symposium
-
-
Sancho, J.C.1
Petrini, F.2
Johnson, G.3
Fernández, J.4
Frachtenberg, E.5
-
31
-
-
20444444457
-
The LAM/MPI checkpoint/restart framework: System-initiated checkpointing
-
Santa Fe, New Mexico, October 12-14
-
S. Sankaran, J. M. Squyres, B. Barrett, A. Lumsdaine, J. Duell, P. Hargrove, and E. Roman. The LAM/MPI Checkpoint/Restart Framework: System-Initiated Checkpointing. In Proceedings of the LACSI Symposium, Santa Fe, New Mexico, October 12-14, 2003.
-
(2003)
Proceedings of the LACSI Symposium
-
-
Sankaran, S.1
Squyres, J.M.2
Barrett, B.3
Lumsdaine, A.4
Duell, J.5
Hargrove, P.6
Roman, E.7
-
32
-
-
84934312471
-
Implementation and evaluation of a scalable application-level checkpoint-recovery scheme for MPI programs
-
Pittsburgh, PA, November 10-16
-
Martin Schulz, Greg Bronevetsky, Rohit Fernandes, Daniel Marques, Keshav Pingali, , and Paul Stodghill. Implementation and Evaluation of a Scalable Application-level Checkpoint-Recovery Scheme for MPI Programs. In ACM/IEEE SC2004, Pittsburgh, PA, November 10-16, 2004.
-
(2004)
ACM/IEEE SC2004
-
-
Schulz, M.1
Bronevetsky, G.2
Fernandes, R.3
Marques, D.4
Pingali, K.5
Stodghill, P.6
-
34
-
-
0003595929
-
-
The ASCI Sweep3D Benchmark. Available from http://www.llnl.gov/ asci_benchmarks/asci/limited/sweep3d/.
-
The ASCI Sweep3D Benchmark
-
-
-
35
-
-
8344283205
-
-
Technical Report CUCS-014-01, Department of Computer Science, Columbia University, New York, November
-
H. Zhong and J. Nieh. CRAK: Linux Checkpoint/Restart as a Kernel Module. Technical Report CUCS-014-01, Department of Computer Science, Columbia University, New York, November 2001.
-
(2001)
CRAK: Linux Checkpoint/Restart as a Kernel Module
-
-
Zhong, H.1
Nieh, J.2
|