-
3
-
-
33845593340
-
A large scale study of failures in high performance-computing systems
-
B. Schroeder and G. Gibson, "A large scale study of failures in high performance-computing systems," in Proc. of DSN '06, 2006.
-
(2006)
Proc. of DSN '06
-
-
Schroeder, B.1
Gibson, G.2
-
4
-
-
0042078549
-
A survey of rollback recovery protocols in message-passing systems
-
E. Elnozahy, L. Alvisi, Y. Wang, and D. Johnson, "A survey of rollback recovery protocols in message-passing systems," ACM Computing Surveys, vol. 34(3), 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
-
-
Elnozahy, E.1
Alvisi, L.2
Wang, Y.3
Johnson, D.4
-
5
-
-
9144223280
-
Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery
-
E. Elnozahy and J. Plank, "Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery," IEEE Transactions on Dependable and Secure Computing, vol. 1(2), 2004.
-
(2004)
IEEE Transactions on Dependable and Secure Computing
, vol.1
, Issue.2
-
-
Elnozahy, E.1
Plank, J.2
-
6
-
-
0035266102
-
Proactive management of software aging
-
V. Castelli, R. Harper, P. Heldelberger, S. Hunter, K. Trivedi, K. Vaidyanathan, and W. Zeggert, "Proactive management of software aging," IBM Journal of Research and Development, vol. 45(2), 2001.
-
(2001)
IBM Journal of Research and Development
, vol.45
, Issue.2
-
-
Castelli, V.1
Harper, R.2
Heldelberger, P.3
Hunter, S.4
Trivedi, K.5
Vaidyanathan, K.6
Zeggert, W.7
-
9
-
-
47249153592
-
A meta-learning failure predictor for blue gene/l systems
-
P. Gujrati, Y. Li, Z. Lan, R. Thakur, and J. White, "A meta-learning failure predictor for blue gene/l systems," in Proc. of International Conference on Parallel Processing, 2007.
-
(2007)
Proc. of International Conference on Parallel Processing
-
-
Gujrati, P.1
Li, Y.2
Lan, Z.3
Thakur, R.4
White, J.5
-
12
-
-
57049111494
-
Adaptive fault management of parallel applications for high performance computing
-
to appear in
-
Z. Lan and Y. Li, "Adaptive fault management of parallel applications for high performance computing", to appear in IEEE Trans. on Computers, 2008.
-
(2008)
IEEE Trans. on Computers
-
-
Lan, Z.1
Li, Y.2
-
14
-
-
51049121489
-
A fast recovery mechanism for checkpointing in networked environments
-
Illinois Institute of Technology
-
Y. Li and Z. Lan, "A fast recovery mechanism for checkpointing in networked environments", SCS Tech Report, Illinois Institute of Technology, 2007.
-
(2007)
SCS Tech Report
-
-
Li, Y.1
Lan, Z.2
-
15
-
-
77952378080
-
Critical event prediction for proactive management in large-scale computer clusters
-
R. Sahoo, A. Oliner, I. Rish, M. Gupta, J. Moreira, and S. Ma, "Critical event prediction for proactive management in large-scale computer clusters," in Proc. of SIGKDD'03, 2003.
-
(2003)
Proc. of SIGKDD'03
-
-
Sahoo, R.1
Oliner, A.2
Rish, I.3
Gupta, M.4
Moreira, J.5
Ma, S.6
-
16
-
-
33845589803
-
Blue Gene/L failure analysis and prediction models
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, M. Jette, and R. Sahoo, "Blue Gene/L failure analysis and prediction models," in Proc. of DSN'06, 2006.
-
(2006)
Proc. of DSN'06
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Jette, M.4
Sahoo, R.5
-
18
-
-
0003922190
-
-
Wiley Interscience, New York, NY, 2nd edition
-
R. Duda, P. Hart, and D. Stork. Pattern Classification. Wiley Interscience, New York, NY, 2001. 2nd edition.
-
(2001)
Pattern Classification
-
-
Duda, R.1
Hart, P.2
Stork, D.3
-
19
-
-
33748611921
-
Ensemble based systems in decision making
-
R. Polikar, "Ensemble based systems in decision making", IEEE Circuits and Systems Magazine, vol. 6(3), 2006.
-
(2006)
IEEE Circuits and Systems Magazine
, vol.6
, Issue.3
-
-
Polikar, R.1
-
20
-
-
0034133513
-
Distance-based outliers: Algorithms and applications
-
Edwin M. Knorr, Raymond T. Ng, Vladimir Tucakov, "Distance-based outliers: algorithms and applications", The VLDB Journal,(2000) 8: 237-253.
-
(2000)
The VLDB Journal
, vol.8
, pp. 237-253
-
-
Knorr, E.M.1
Ng, R.T.2
Tucakov, V.3
-
21
-
-
51049108066
-
Mpich-v: A multiprotocol automatic fault tolerant mpi
-
A. Bouteiller, T. Herault, G. Krawezik, P. Lemarinier, and F. Cappello, "Mpich-v: A multiprotocol automatic fault tolerant mpi," International Journal of High Performance Computing and Applications, 2005.
-
(2005)
International Journal of High Performance Computing and Applications
-
-
Bouteiller, A.1
Herault, T.2
Krawezik, G.3
Lemarinier, P.4
Cappello, F.5
-
22
-
-
33751107476
-
Mpi-mitten: Enabling migration technology in mpi
-
C. Du and X. Sun, "Mpi-mitten: Enabling migration technology in mpi," in Proc. of CCGrid'06, 2006.
-
(2006)
Proc. of CCGrid'06
-
-
Du, C.1
Sun, X.2
-
24
-
-
0012283032
-
Achieving extreme resolution in numerical cosmology using adaptive mesh refinement: Resolving primordial star formulation
-
G. Bryan, T. Abel, and M. Norman, "Achieving extreme resolution in numerical cosmology using adaptive mesh refinement: Resolving primordial star formulation," in Proc. of SC'01, 2001.
-
(2001)
Proc. of SC'01
-
-
Bryan, G.1
Abel, T.2
Norman, M.3
-
25
-
-
0029633168
-
Gromacs: A message-passing parallel molecular dynamics implementation
-
H. Berendsen, D. V. der Spoel, and R. van Drunen, "Gromacs: A message-passing parallel molecular dynamics implementation," Comp. Phys. Comm., vol. 91:43-56, 1995.
-
(1995)
Comp. Phys. Comm
, vol.91
, pp. 43-56
-
-
Berendsen, H.1
der Spoel, D.V.2
van Drunen, R.3
-
26
-
-
84882885699
-
-
Online, Available
-
Nasa nas parallel benchmarks. [Online]. Available: http://www.nas.nasa. gov/Resources/Software/npb.html
-
Nasa nas parallel benchmarks
-
-
|