-
1
-
-
84881083374
-
-
Weka: The University of Waikato. Available at
-
Weka: The University of Waikato. Machine learning software in Java. Available at: http://www.cs.waikato.ac.nz/ml/weka/.
-
Machine Learning Software in Java
-
-
-
3
-
-
40849089513
-
Model-based performance evaluation of distributed checkpointing protocols
-
A. Agbaria and R. Friedman. Model-based performance evaluation of distributed checkpointing protocols. Performance Evaluation, 65(5):345-365, 2008.
-
(2008)
Performance Evaluation
, vol.65
, Issue.5
, pp. 345-365
-
-
Agbaria, A.1
Friedman, R.2
-
4
-
-
1542679193
-
Objective Bayesian analysis of spatially correlated data
-
J. O. Berger, V. D. Oliveira, and B. Sansó. Objective Bayesian analysis of spatially correlated data. Journal of the American Statistical Association, 96(456):1361-1374, 2001.
-
(2001)
Journal of the American Statistical Association
, vol.96
, Issue.456
, pp. 1361-1374
-
-
Berger, J.O.1
Oliveira, V.D.2
Sansó, B.3
-
5
-
-
27544473955
-
Nonstop advanced architecture
-
D. Bernick, B. Bruckert, P. D. Vigna, D. Garcia, R. Jardine, J. Klecka, and J. Smullen. Nonstop advanced architecture. In Proceedings of IEEE International Conference on Depend- able Systems and Networks (DSN), 2005.
-
(2005)
Proceedings of IEEE International Conference on Depend- Able Systems and Networks (DSN)
-
-
Bernick, D.1
Bruckert, B.2
Vigna, P.D.3
Garcia, D.4
Jardine, R.5
Klecka, J.6
Smullen, J.7
-
6
-
-
74049111423
-
Compiler-enhanced incremental checkpointing for openmp applications
-
G. Bronevetsky, D. J. Marques, K. K. Pingali, R. Rugina, and S. A. McKee. Compiler-enhanced incremental checkpointing for openmp applications. In Proceedings of ACM Symposium on Principles and Practice of Parallel Programming (PPoPP), 2008.
-
(2008)
Proceedings of ACM Symposium on Principles and Practice of Parallel Programming (PPoPP)
-
-
Bronevetsky, G.1
Marques, D.J.2
Pingali, K.K.3
Rugina, R.4
McKee, S.A.5
-
7
-
-
70449914816
-
Dynamic content web applications: Crash, failover, and recovery analysis
-
L. E. Buzato, G. M. D. Vieira, and W. Zwaenepoel. Dynamic content web applications: Crash, failover, and recovery analysis. In Proceedings of International Conference on Dependable Systems and Networks (DSN), 2009.
-
(2009)
Proceedings of International Conference on Dependable Systems and Networks (DSN)
-
-
Buzato, L.E.1
Vieira, G.M.D.2
Zwaenepoel, W.3
-
9
-
-
0036504529
-
Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing
-
A. Dogan and F. Özgüner. Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing. IEEE Transactions on Parallel and Distributed Systems, 13(3):308-323, 2002.
-
(2002)
IEEE Transactions on Parallel and Distributed Systems
, vol.13
, Issue.3
, pp. 308-323
-
-
Dogan, A.1
Özgüner, F.2
-
10
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message-passing systems. ACM Computing Surveys, 34(3):375-408, 2002.
-
(2002)
ACM Computing Surveys
, vol.34
, Issue.3
, pp. 375-408
-
-
Elnozahy, E.N.M.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
11
-
-
4043157227
-
Reliability, availability, and serviceability (RAS) of the IBM eServer z990
-
M. L. Fair, C. R. Conklin, S. B. Swaney, P. J. Meaney, W. J. Clarke, L. C. Alves, I. N. Modi, F. Freier, W. Fischer, and N. E. Weber. Reliability, availability, and serviceability (RAS) of the IBM eServer z990. IBM Journal of Research and Development, 48(3-4), 2004.
-
(2004)
IBM Journal of Research and Development
, vol.48
, Issue.3-4
-
-
Fair, M.L.1
Conklin, C.R.2
Swaney, S.B.3
Meaney, P.J.4
Clarke, W.J.5
Alves, L.C.6
Modi, I.N.7
Freier, F.8
Fischer, W.9
Weber, N.E.10
-
13
-
-
76849100508
-
Failure-aware resource management for highavailability computing clusters with distributed virtual machines
-
In Press
-
S. Fu. Failure-aware resource management for highavailability computing clusters with distributed virtual machines. Journal of Parallel and Distributed Computing, In Press, 2010.
-
(2010)
Journal of Parallel and Distributed Computing
-
-
Fu, S.1
-
18
-
-
33947184459
-
Analytical models for architecture-based software reliability prediction: A unification framework
-
S. S. Gokhale and K. S. Trivedi. Analytical models for architecture-based software reliability prediction: A unification framework. IEEE Transactions on Reliability, 55(4):578-590, 2006.
-
(2006)
IEEE Transactions on Reliability
, vol.55
, Issue.4
, pp. 578-590
-
-
Gokhale, S.S.1
Trivedi, K.S.2
-
19
-
-
55849147399
-
Dynamic meta-learning for failure prediction in large-scale systems: A case study
-
J. Gu, Z. Zheng, Z. Lan, J. White, E. Hocks, and B.-H. Park. Dynamic meta-learning for failure prediction in large-scale systems: A case study. In Proceedings of IEEE International Conference on Parallel Processing (ICPP), 2008.
-
(2008)
Proceedings of IEEE International Conference on Parallel Processing (ICPP)
-
-
Gu, J.1
Zheng, Z.2
Lan, Z.3
White, J.4
Hocks, E.5
Park, B.-H.6
-
26
-
-
33845589803
-
BlueGene/L failure analysis and prediction models
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, M. Jette, and R. K. Sahoo. BlueGene/L failure analysis and prediction models. In Proceedings of International Conference on Dependable Systems and Networks (DSN), 2006.
-
(2006)
Proceedings of International Conference on Dependable Systems and Networks (DSN)
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Jette, M.4
Sahoo, R.K.5
-
27
-
-
27544497222
-
Filtering failure logs for a BlueGene/L prototype
-
Y. Liang, Y. Zhang, A. Sivasubramaniam, R. Sahoo, J. Moreira, and M. Gupta. Filtering failure logs for a BlueGene/L prototype. In Proceedings of Conference on Dependable Systems and Networks (DSN), 2005.
-
(2005)
Proceedings of Conference on Dependable Systems and Networks (DSN)
-
-
Liang, Y.1
Zhang, Y.2
Sivasubramaniam, A.3
Sahoo, R.4
Moreira, J.5
Gupta, M.6
-
29
-
-
1442309284
-
On the reliability of the IBM MVS/XA operating system
-
S. Mourad and D. Andrews. On the reliability of the IBM MVS/XA operating system. IEEE Transactions on Software Engineering, 13(10):1135-1139, 1987.
-
(1987)
IEEE Transactions on Software Engineering
, vol.13
, Issue.10
, pp. 1135-1139
-
-
Mourad, S.1
Andrews, D.2
-
34
-
-
20444463471
-
A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters
-
X. Qin and H. Jiang. A dynamic and reliability-driven scheduling algorithm for parallel real-time jobs executing on heterogeneous clusters. Journal of Parallel and Distributed Computing, 65(8):885-900, 2005.
-
(2005)
Journal of Parallel and Distributed Computing
, vol.65
, Issue.8
, pp. 885-900
-
-
Qin, X.1
Jiang, H.2
-
44
-
-
67650672322
-
Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems
-
P. Yalagandula, S. Nath, H. Yu, P. B. Gibbons, and S. Sesha. Beyond availability: Towards a deeper understanding of machine failure characteristics in large distributed systems. In Proceedings of USENIX WORLDS, 2004.
-
(2004)
Proceedings of USENIX WORLDS
-
-
Yalagandula, P.1
Nath, S.2
Yu, H.3
Gibbons, P.B.4
Sesha, S.5
|