-
1
-
-
21644455102
-
Performance debugging for distributed systems of black boxes
-
M. K. Aguilera, J. C. Mogul, J. L. Wiener, P. Reynolds, and A. Methitacharoen. Performance debugging for distributed systems of black boxes. In SOSP, pages 74-89, 2003.
-
(2003)
SOSP
, pp. 74-89
-
-
Aguilera, M.K.1
Mogul, J.C.2
Wiener, J.L.3
Reynolds, P.4
Methitacharoen, A.5
-
2
-
-
36949029788
-
Towards highly reliable enterprise network services via inference of multi-level dependencies
-
P. Bahl, R. Chandra, A. Greenberg, S. Kandula, D. A. Maltz, and M. Zhang. Towards highly reliable enterprise network services via inference of multi-level dependencies. In SIGCOMM, 2007.
-
(2007)
SIGCOMM
-
-
Bahl, P.1
Chandra, R.2
Greenberg, A.3
Kandula, S.4
Maltz, D.A.5
Zhang, M.6
-
3
-
-
85030750455
-
Using Magpie for request extraction and workload modelling
-
P. Barham, A. Donnelly, R. Isaacs, and R. Mortier. Using Magpie for request extraction and workload modelling. In OSDI, 2004.
-
(2004)
OSDI
-
-
Barham, P.1
Donnelly, A.2
Isaacs, R.3
Mortier, R.4
-
5
-
-
84952324793
-
An active approach to characterizing dynamic dependencies for problem determination in a distributed environment
-
Seattle, WA
-
A. Brown, G. Kar, and A. Keller. An active approach to characterizing dynamic dependencies for problem determination in a distributed environment. In IEEE IM, pages 377-390, Seattle, WA, 2001.
-
(2001)
IEEE IM
, pp. 377-390
-
-
Brown, A.1
Kar, G.2
Keller, A.3
-
6
-
-
80052414776
-
Path-based failure and evolution management
-
M. Y. Chen, A. Accardi, E. Kiciman, J. Lloyd, D. Patterson, A. Fox, and E. Brewer. Path-based failure and evolution management. In NSDI, 2004.
-
(2004)
NSDI
-
-
Chen, M.Y.1
Accardi, A.2
Kiciman, E.3
Lloyd, J.4
Patterson, D.5
Fox, A.6
Brewer, E.7
-
7
-
-
0036930823
-
Pinpoint: Problem determination in large, dynamic internet services
-
June
-
M. Y. Chen, E. Kiciman, E. Fratkin, A. Fox, and E. Brewer. Pinpoint: problem determination in large, dynamic internet services. In DSN, June 2002.
-
(2002)
DSN
-
-
Chen, M.Y.1
Kiciman, E.2
Fratkin, E.3
Fox, A.4
Brewer, E.5
-
8
-
-
0029215025
-
On the distributed fault diagnosis of computer networks
-
Alexandria, Egypt, June
-
S. Chutani and H. Nussbaumer. On the distributed fault diagnosis of computer networks. In IEEE Symposium on Computers and Communications, pages 71-77, Alexandria, Egypt, June 1995.
-
(1995)
IEEE Symposium on Computers and Communications
, pp. 71-77
-
-
Chutani, S.1
Nussbaumer, H.2
-
9
-
-
84885599987
-
Capturing, indexing, clustering, and retrieving system history
-
I. Cohen, S. Zhang, M. Goldszmidt, J. Symons, T. Kelly, and A. Fox. Capturing, indexing, clustering, and retrieving system history. In SOSP, 2005.
-
(2005)
SOSP
-
-
Cohen, I.1
Zhang, S.2
Goldszmidt, M.3
Symons, J.4
Kelly, T.5
Fox, A.6
-
12
-
-
72249113391
-
Debugging in the (very) large: Ten years of implementation and experience
-
K. Glerum, K. Kinshumann, S. Greenberg, G. Aul, V. Orgovan, G. Nichols, D. Grant, G. Loihle, and G. Hunt. Debugging in the (very) large: Ten years of implementation and experience. In SOSP, 2009.
-
(2009)
SOSP
-
-
Glerum, K.1
Kinshumann, K.2
Greenberg, S.3
Aul, G.4
Orgovan, V.5
Nichols, G.6
Grant, D.7
Loihle, G.8
Hunt, G.9
-
14
-
-
83155166028
-
IP fault localization via risk modeling
-
R. R. Kompella, J. Yates, A. Greenberg, and A. C. Snoeren. IP fault localization via risk modeling. In NSDI, pages 57-70, 2005.
-
(2005)
NSDI
, pp. 57-70
-
-
Kompella, R.R.1
Yates, J.2
Greenberg, A.3
Snoeren, A.C.4
-
15
-
-
72849144813
-
D3S: Debugging deployed distributed systems
-
X. Liu, Z. Guo, X. Wang, F. Chen, X. Lian, J. Tang, M. Wu, M. F. Kaashoek, and Z. Zhang. D3S: debugging deployed distributed systems. In NSDI, 2008.
-
(2008)
NSDI
-
-
Liu, X.1
Guo, Z.2
Wang, X.3
Chen, F.4
Lian, X.5
Tang, J.6
Wu, M.7
Kaashoek, M.F.8
Zhang, Z.9
-
16
-
-
79960508731
-
WiDS Checker: Combating bugs in distributed systems
-
X. Liu, W. Lin, A. Pan, and Z. Zhang. WiDS Checker: Combating bugs in distributed systems. In NSDI, 2007.
-
(2007)
NSDI
-
-
Liu, X.1
Lin, W.2
Pan, A.3
Zhang, Z.4
-
17
-
-
34548045473
-
Emergent (mis)behavior vs. complex software systems
-
J. C. Mogul. Emergent (mis)behavior vs. complex software systems. In EuroSys, 2006.
-
(2006)
EuroSys
-
-
Mogul, J.C.1
-
18
-
-
52649106991
-
Junior: The Stanford entry in the Urban Challenge
-
M. Montemerlo et al. Junior: The Stanford entry in the Urban Challenge. Journal of Field Robotics, 25(9):569-597, 2008.
-
(2008)
Journal of Field Robotics
, vol.25
, Issue.9
, pp. 569-597
-
-
Montemerlo, M.1
-
20
-
-
77956573133
-
Using correlated surprise to infer shared influence
-
A. J. Oliner, A. V. Kulkarni, and A. Aiken. Using correlated surprise to infer shared influence. In DSN, 2010.
-
(2010)
DSN
-
-
Oliner, A.J.1
Kulkarni, A.V.2
Aiken, A.3
-
21
-
-
36049013419
-
What supercomputers say: A study of five system logs
-
A. J. Oliner and J. Stearley. What supercomputers say: A study of five system logs. In DSN, 2007.
-
(2007)
DSN
-
-
Oliner, A.J.1
Stearley, J.2
-
22
-
-
77954721776
-
-
Technical report, CMU-PDL-08-112
-
X. Pan, J. Tan, S. Kavulya, R. Gandhi, and P. Narasimhan. Ganesha: Black-box fault diagnosis for MapReduce systems. Technical report, CMU-PDL-08-112, 2008.
-
(2008)
Ganesha: Black-box Fault Diagnosis for MapReduce Systems
-
-
Pan, X.1
Tan, J.2
Kavulya, S.3
Gandhi, R.4
Narasimhan, P.5
-
23
-
-
77957793926
-
Pip: Detecting the unexpected in distributed systems
-
P. Reynolds, C. Killian, J. L. Wiener, J. C. Mogul, M. A. Shah, and A. Vahdat. Pip: Detecting the unexpected in distributed systems. In NSDI, 2006.
-
(2006)
NSDI
-
-
Reynolds, P.1
Killian, C.2
Wiener, J.L.3
Mogul, J.C.4
Shah, M.A.5
Vahdat, A.6
-
24
-
-
34250636365
-
WAP5: Black-box performance debugging for wide-area systems
-
P. Reynolds, J. L. Wiener, J. C. Mogul, M. K. Aguilera, and A. Vahdat. WAP5: black-box performance debugging for wide-area systems. In WWW, 2006.
-
(2006)
WWW
-
-
Reynolds, P.1
Wiener, J.L.2
Mogul, J.C.3
Aguilera, M.K.4
Vahdat, A.5
-
25
-
-
4544231612
-
Real-time problem determination in distributed systems using active probing
-
I. Rish, M. Brodie, N. Odintsova, S. Ma, and G. Grabarnik. Real-time problem determination in distributed systems using active probing. In NOMS, 2004.
-
(2004)
NOMS
-
-
Rish, I.1
Brodie, M.2
Odintsova, N.3
Ma, S.4
Grabarnik, G.5
-
26
-
-
12244279838
-
Detecting causal relationships in distributed computations: In search of the holy grail
-
March
-
R. Schwarz and F. Mettern. Detecting causal relationships in distributed computations: in search of the holy grail. Distributed Computing, 7(3):149-174, March 1994.
-
(1994)
Distributed Computing
, vol.7
, Issue.3
, pp. 149-174
-
-
Schwarz, R.1
Mettern, F.2
-
28
-
-
77954724309
-
-
The Computer Failure Data Repository (CFDR). The HPC4 data. http://cfdr.usenix.org/data.html, 2009.
-
(2009)
The HPC4 Data
-
-
-
29
-
-
33750024797
-
Stanley: The robot that won the DARPA Grand Challenge
-
June
-
S. Thrun and M. Montemerlo, et al. Stanley: The robot that won the DARPA Grand Challenge. Journal of Field Robotics, 23(9):661-692, June 2006.
-
(2006)
Journal of Field Robotics
, vol.23
, Issue.9
, pp. 661-692
-
-
Thrun, S.1
Montemerlo, M.2
-
30
-
-
72249121870
-
Detecting large-scale system problems by mining console logs
-
W. Xu, L. Huang, A. Fox, D. Patterson, and M. I. Jordan. Detecting large-scale system problems by mining console logs. In SOSP, 2009.
-
(2009)
SOSP
-
-
Xu, W.1
Huang, L.2
Fox, A.3
Patterson, D.4
Jordan, M.I.5
-
31
-
-
0028166447
-
Understanding "why" in software process modelling, analysis, and design
-
Sorrento, Italy, May
-
E. S. K. Yu and J. Mylopoulos. Understanding "why" in software process modelling, analysis, and design. In ICSE, Sorrento, Italy, May 1994.
-
(1994)
ICSE
-
-
Yu, E.S.K.1
Mylopoulos, J.2
|