-
1
-
-
33845595513
-
Performance implications of failures in large-scale cluster scheduling
-
Y. Zhang, M. S. Squillante, A. Sivasubramaniam, and R. K. Sahoo, "Performance implications of failures in large-scale cluster scheduling, " in International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), 2004.
-
(2004)
International Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP)
-
-
Zhang, Y.1
Squillante, M.S.2
Sivasubramaniam, A.3
Sahoo, R.K.4
-
3
-
-
33847149362
-
Recent advances in checkpoint/recovery systems
-
G. Bronevetsky, R. Fernandes, D. Marques, K. Pingali, and P. Stodghill, "Recent advances in checkpoint/recovery systems, " in IEEE Parallel and Distributed Processing Symposium (IPDPS), 2006.
-
(2006)
IEEE Parallel and Distributed Processing Symposium (IPDPS)
-
-
Bronevetsky, G.1
Fernandes, R.2
Marques, D.3
Pingali, K.4
Stodghill, P.5
-
4
-
-
9144223280
-
Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery
-
E. Elnozahy and J. Plank, "Checkpointing for peta-scale systems: A look into the future of practical rollback-recovery, " IEEE Transactions on Dependable and Secure Computing, vol. 1, no. 2, pp. 97-108, 2004.
-
(2004)
IEEE Transactions on Dependable and Secure Computing
, vol.1
, Issue.2
, pp. 97-108
-
-
Elnozahy, E.1
Plank, J.2
-
5
-
-
84877693592
-
Fault prediction under the microscope: A closer look into HPC systems
-
A. Gainaru, F. Cappello, M. Snir, and W. Kramer, "Fault prediction under the microscope: A closer look into HPC systems, " in IEEE/ACM Conference on High Performance Computing Networking, Storage and Analysis (SC), 2012.
-
(2012)
IEEE/ACM Conference on High Performance Computing Networking, Storage and Analysis (SC)
-
-
Gainaru, A.1
Cappello, F.2
Snir, M.3
Kramer, W.4
-
6
-
-
84886069616
-
-
"http://www.infodrom.org/projects/sysklogd/."
-
-
-
-
8
-
-
33845589803
-
BlueGene/L failure analysis and prediction models
-
Y. Liang, Y. Zhang, M. Jette, A. Sivasubramaniam, and R. Sahoo, "BlueGene/L failure analysis and prediction models, " in International Conference on Dependable Systems and Networks (DSN), 2006, pp. 425-434.
-
(2006)
International Conference on Dependable Systems and Networks (DSN)
, pp. 425-434
-
-
Liang, Y.1
Zhang, Y.2
Jette, M.3
Sivasubramaniam, A.4
Sahoo, R.5
-
9
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
L. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition, " Proceedings of the IEEE, vol. 77, no. 2, pp. 257-286, 1989.
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.1
-
10
-
-
84863395085
-
Advances and challenges in log analysis
-
A. Oliner, G. Ganapathi, and W. Xu, "Advances and challenges in log analysis, " Communications of the ACM, vol. 55, no. 2, pp. 55-61, 2012.
-
(2012)
Communications of the ACM
, vol.55
, Issue.2
, pp. 55-61
-
-
Oliner, A.1
Ganapathi, G.2
Xu, W.3
-
14
-
-
67049096648
-
Alert detection in system logs
-
A. Oliner, A. Aiken, and J. Stearley, "Alert detection in system logs, " in IEEE International Conference on Data Mining, 2008, pp. 959-964.
-
(2008)
IEEE International Conference on Data Mining
, pp. 959-964
-
-
Oliner, A.1
Aiken, A.2
Stearley, J.3
-
15
-
-
85092792131
-
Analyzing system logs: A new view of what's important
-
S. Sabato, E. Yom-Tov, A. Tsherniak, and S. Rosset, "Analyzing system logs: A new view of what's important, " in Workshop on Computer Systems with Machine Learning (SysML), 2007.
-
(2007)
Workshop on Computer Systems with Machine Learning (SysML)
-
-
Sabato, S.1
Yom-Tov, E.2
Tsherniak, A.3
Rosset, S.4
-
16
-
-
84870004832
-
Experience mining Google's production console logs
-
W. Xu, L. Huang, A. Fox, D. Patterson, and M. Jordan, "Experience mining Google's production console logs, " in Workshop on Managing Systems via Log Analysis and Machine Learning Techniques (SLAML), 2010.
-
(2010)
Workshop on Managing Systems Via Log Analysis and Machine Learning Techniques (SLAML)
-
-
Xu, W.1
Huang, L.2
Fox, A.3
Patterson, D.4
Jordan, M.5
-
18
-
-
33845583903
-
An approach for detecting and distinguishing errors versus attacks in sensor networks
-
C. Basile, M. Gupta, Z. Kalbarczyk, and R. K. Iyer, "An approach for detecting and distinguishing errors versus attacks in sensor networks, " in Proceedings of the International Conference on Dependable Systems and Networks, ser. DSN '06, 2006, pp. 473-484.
-
(2006)
Proceedings of the International Conference on Dependable Systems and Networks, Ser. DSN '06
, pp. 473-484
-
-
Basile, C.1
Gupta, M.2
Kalbarczyk, Z.3
Iyer, R.K.4
|