-
1
-
-
0036041277
-
Improving cluster availability using workstation validation
-
Heath, T., Martin, R.P., Nguyen, T.D.: Improving cluster availability using workstation validation. In: SIGMETRICS, pp. 217-227 (2002)
-
(2002)
SIGMETRICS
, pp. 217-227
-
-
Heath, T.1
Martin, R.P.2
Nguyen, T.D.3
-
2
-
-
77955897418
-
Total recall: System support for automated availability management
-
Bhagwan, R., Tati, K., Cheng, Y., Savage, S., Voelker, G.: Total recall: System support for automated availability management. In: NSDI, pp. 337-350 (2004)
-
(2004)
NSDI
, pp. 337-350
-
-
Bhagwan, R.1
Tati, K.2
Cheng, Y.3
Savage, S.4
Voelker, G.5
-
3
-
-
4544382099
-
Failure data analysis of a large-scale heterogeneous server environment
-
Sahoo, R., Sivasubramaniam, A., Squillante, M., Zhang, Y.: Failure data analysis of a large-scale heterogeneous server environment. In: DSN, p. 772 (2004)
-
(2004)
DSN
, pp. 772
-
-
Sahoo, R.1
Sivasubramaniam, A.2
Squillante, M.3
Zhang, Y.4
-
4
-
-
0027233282
-
Dependability measurement and modeling of a multicomputer system
-
Tang, D., Iyer, R.K.: Dependability measurement and modeling of a multicomputer system. IEEE Trans. Computers 42(1), 62-75 (1993)
-
(1993)
IEEE Trans. Computers
, vol.42
, Issue.1
, pp. 62-75
-
-
Tang, D.1
Iyer, R.K.2
-
5
-
-
33845593340
-
A large-scale study of failures in high-performance computing systems
-
Schroeder, B., Gibson, G.A.: A large-scale study of failures in high-performance computing systems. In: DSN, pp. 249-258 (2006)
-
(2006)
DSN
, pp. 249-258
-
-
Schroeder, B.1
Gibson, G.A.2
-
6
-
-
38049182471
-
How are real grids used? the analysis of four grid traces and its implications
-
Iosup, A., Dumitrescu, C., Epema, D.H.J., Li, H., Wolters, L.: How are real grids used? the analysis of four grid traces and its implications. In: GRID, pp. 262-269 (2006)
-
(2006)
GRID
, pp. 262-269
-
-
Iosup, A.1
Dumitrescu, C.2
Epema, D.H.J.3
Li, H.4
Wolters, L.5
-
7
-
-
0020151687
-
Derivation and calibration of a transient error reliability model
-
Castillo, X., McConnel, S.R., Siewiorek, D.P.: Derivation and calibration of a transient error reliability model. IEEE Trans. Computers 31(7), 658-671 (1982)
-
(1982)
IEEE Trans. Computers
, vol.31
, Issue.7
, pp. 658-671
-
-
Castillo, X.1
McConnel, S.R.2
Siewiorek, D.P.3
-
8
-
-
0020153888
-
A statistical failure/load relationship: Results of a multicomputer study
-
Iyer, R.K., Butner, S.E., McCluskey, E.J.: A statistical failure/load relationship: Results of a multicomputer study. IEEE Trans. Computers 31(7), 697-706 (1982)
-
(1982)
IEEE Trans. Computers
, vol.31
, Issue.7
, pp. 697-706
-
-
Iyer, R.K.1
Butner, S.E.2
McCluskey, E.J.3
-
9
-
-
0025505070
-
A Census of Tandem System Availability between 1985 and 1990
-
Gray, J.: A Census of Tandem System Availability Between 1985 and 1990. IEEE Trans. on Reliability 39, 409-418 (1990)
-
(1990)
IEEE Trans. on Reliability
, vol.39
, pp. 409-418
-
-
Gray, J.1
-
10
-
-
47249113435
-
On the dynamic resource availability in grids
-
Iosup, A., Jan, M., Sonmez, O.O., Epema, D.H.J.: On the dynamic resource availability in grids. In: GRID, pp. 26-33 (2007)
-
(2007)
GRID
, pp. 26-33
-
-
Iosup, A.1
Jan, M.2
Sonmez, O.O.3
Epema, D.H.J.4
-
11
-
-
23944448107
-
Performance implications of failures in large-scale cluster scheduling
-
Zhang, Y., Squillante, M., Sivasubramaniam, A., Sahoo, R.: Performance implications of failures in large-scale cluster scheduling. In: JSSPP, pp. 233-252 (2004)
-
(2004)
JSSPP
, pp. 233-252
-
-
Zhang, Y.1
Squillante, M.2
Sivasubramaniam, A.3
Sahoo, R.4
-
12
-
-
47249131447
-
Exploiting availability prediction in distributed systems
-
Mickens, J.W., Noble, B.D.: Exploiting availability prediction in distributed systems. In: NSDI (2006)
-
(2006)
NSDI
-
-
Mickens, J.W.1
Noble, B.D.2
-
13
-
-
0034444963
-
Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs
-
Bolosky, W.J., Douceur, J.R., Ely, D., Theimer, M.: Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs. In: SIGMETRICS, pp. 34-43 (2000)
-
(2000)
SIGMETRICS
, pp. 34-43
-
-
Bolosky, W.J.1
Douceur, J.R.2
Ely, D.3
Theimer, M.4
-
14
-
-
77954903245
-
The Failure Trace Archive: Enabling comparative analysis of failures in diverse distributed systems
-
Archive data available
-
Kondo, D., Javadi, B., Iosup, A., Epema, D.: The Failure Trace Archive: Enabling comparative analysis of failures in diverse distributed systems. In: CCGRID, pp. 1-10 (2010), Archive data available, http://fta.inria.fr
-
(2010)
CCGRID
, pp. 1-10
-
-
Kondo, D.1
Javadi, B.2
Iosup, A.3
Epema, D.4
-
15
-
-
12344308304
-
Basic concepts and taxonomy of dependable and secure computing
-
Avizienis, A., Laprie, J.C., Randell, B., Landwehr, C.E.: Basic concepts and taxonomy of dependable and secure computing. IEEE Trans. Dependable Sec. Comput. 1(1), 11-33 (2004)
-
(2004)
IEEE Trans. Dependable Sec. Comput.
, vol.1
, Issue.1
, pp. 11-33
-
-
Avizienis, A.1
Laprie, J.C.2
Randell, B.3
Landwehr, C.E.4
-
16
-
-
0025502686
-
Error log analysis: Statistical modeling and heuristic trend analysis
-
Lin, T.T.Y., Siewiorek, D.P.: Error log analysis: statistical modeling and heuristic trend analysis. IEEE Trans. on Reliability 39, 419-432 (1990)
-
(1990)
IEEE Trans. on Reliability
, vol.39
, pp. 419-432
-
-
Lin, T.T.Y.1
Siewiorek, D.P.2
-
18
-
-
0000068589
-
R. A. Fisher and the making of maximum likelihood 1912-1922
-
Aldrich, J.: R. A. Fisher and the making of maximum likelihood 1912-1922. Statistical Science 12(3), 162-176 (1997)
-
(1997)
Statistical Science
, vol.12
, Issue.3
, pp. 162-176
-
-
Aldrich, J.1
-
19
-
-
78349280529
-
-
Tech.Rep. PDS-2010-001, TU Delft
-
Gallet, M., Yigitbasi, N., Javadi, B., Kondo, D., Iosup, A., Epema, D.: A model for space-correlated failures in large-scale distributed systems. Tech.Rep. PDS-2010-001, TU Delft (2010), http://pds.twi.tudelft.nl/reports/2010/ PDS-2010-001.pdf
-
(2010)
A Model for Space-correlated Failures in Large-scale Distributed Systems
-
-
Gallet, M.1
Yigitbasi, N.2
Javadi, B.3
Kondo, D.4
Iosup, A.5
Epema, D.6
|