-
1
-
-
0032597670
-
An analysis of communication induced checkpointing
-
L. Alvisi, E. Elnozahy, S. Rao, S. A. Husain, and A. D. Mel. An analysis of communication induced checkpointing. In Fault-Tolerant Computing, 1999. doi:10.1109/FTCS.1999.781058.
-
(1999)
Fault-Tolerant Computing
-
-
Alvisi, L.1
Elnozahy, E.2
Rao, S.3
Husain, S.A.4
Mel, A.D.5
-
2
-
-
0028532579
-
Why cryptosystems fail
-
Nov
-
R. J. Anderson. Why cryptosystems fail. Commun. ACM, 37, Nov. 1994. doi:10.1145/188280.188291.
-
(1994)
Commun. ACM
, vol.37
-
-
Anderson, R.J.1
-
4
-
-
85077335065
-
Flux: A language for programming high-performance servers
-
B. Burns, K. Grimaldi, A. Kostadinov, E. D. Berger, and M. D. Corner. Flux: A language for programming high-performance servers. In USENIX ATC, 2006.
-
(2006)
USENIX ATC
-
-
Burns, B.1
Grimaldi, K.2
Kostadinov, A.3
Berger, E.D.4
Corner, M.D.5
-
5
-
-
0022020346
-
Distributed snapshots: Determining global states of a distributed system
-
Feb
-
K. M. Chandy and L. Lamport. Distributed snapshots: Determining global states of a distributed system. ACM TOCS, 3(1):63-75, Feb. 1985. doi:10.1145/214451.214456.
-
(1985)
ACM TOCS
, vol.3
, Issue.1
, pp. 63-75
-
-
Chandy, K.M.1
Lamport, L.2
-
8
-
-
85030321143
-
MapReduce: Simplified data processing on large clusters
-
acmid:1251264
-
J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. In OSDI, 2004. acmid:1251264.
-
(2004)
OSDI
-
-
Dean, J.1
Ghemawat, S.2
-
9
-
-
0042078549
-
A survey of rollback-recovery protocols in message-passing systems
-
Sept
-
E. N. M. Elnozahy, L. Alvisi, Y.-M. Wang, and D. B. Johnson. A survey of rollback-recovery protocols in message-passing systems. ACM Comput. Surv., 34:375-408, Sept. 2002. doi:10.1145/568522.568525.
-
(2002)
ACM Comput. Surv.
, vol.34
, pp. 375-408
-
-
Elnozahy, E.N.M.1
Alvisi, L.2
Wang, Y.-M.3
Johnson, D.B.4
-
11
-
-
0041898464
-
Condor-G: A computation management agent for multiinstitutional grids
-
J. Frey, T. Tannenbaum, M. Livny, I. Foster, and S. Tuecke. Condor-g: A computation management agent for multiinstitutional grids. Cluster Computing, 5(3):237-246, 2002.
-
(2002)
Cluster Computing
, vol.5
, Issue.3
, pp. 237-246
-
-
Frey, J.1
Tannenbaum, T.2
Livny, M.3
Foster, I.4
Tuecke, S.5
-
13
-
-
85094320514
-
Friday: Global comprehension for distributed replay
-
D. Geels, G. Altekar, P. Maniatis, T. Roscoe, and I. Stoica. Friday: Global comprehension for distributed replay. In NSDI, 2007. http://www.usenix.org/event/nsdi07/tech/geels.html.
-
(2007)
NSDI
-
-
Geels, D.1
Altekar, G.2
Maniatis, P.3
Roscoe, T.4
Stoica, I.5
-
15
-
-
34548041192
-
Dryad: Distributed data-parallel programs from sequential building blocks
-
Mar. acmid:1273005
-
M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly. Dryad: distributed data-parallel programs from sequential building blocks. SIGOPS OS Rev., 41:59-72, Mar. 2007. acmid:1273005.
-
(2007)
SIGOPS OS Rev
, vol.41
, pp. 59-72
-
-
Isard, M.1
Budiu, M.2
Yu, Y.3
Birrell, A.4
Fetterly, D.5
-
16
-
-
85080555471
-
-
T. Kelly. http://ai.eecs.umich.edu/~tpkelly/Ken/.
-
-
-
Kelly, T.1
-
17
-
-
79954434449
-
-
Technical report, HP Labs
-
T. Kelly, A. H. Karp, M. Stiegler, T. Close, and H. K. Cho. Output-valid rollback-recovery. Technical report, HP Labs, 2010. http://www.hpl.hp.com/techreports/2010/HPL-2010-155.pdf.
-
(2010)
Output-Valid Rollback-Recovery
-
-
Kelly, T.1
Karp, A.H.2
Stiegler, M.3
Close, T.4
Cho, H.K.5
-
18
-
-
85080536037
-
-
C. Killian. http://www.macesystems.org/maceken/.
-
-
-
Killian, C.1
-
19
-
-
78751480311
-
Finding latent performance bugs in systems implementations
-
C. Killian, K. Nagaraj, S. Pervez, R. Braud, J. W. Anderson, and R. Jhala. Finding latent performance bugs in systems implementations. In FSE, 2010. doi:10.1145/1882291.1882297.
-
(2010)
FSE
-
-
Killian, C.1
Nagaraj, K.2
Pervez, S.3
Braud, R.4
Anderson, J.W.5
Jhala, R.6
-
20
-
-
35448934440
-
MacE: Language support for building distributed systems
-
C. E. Killian, J. W. Anderson, R. Braud, R. Jhala, and A. M. Vahdat. Mace: language support for building distributed systems. In PLDI, 2007. doi:10.1145/1250734.1250755.
-
(2007)
PLDI
-
-
Killian, C.E.1
Anderson, J.W.2
Braud, R.3
Jhala, R.4
Vahdat, A.M.5
-
21
-
-
77953746059
-
SPlay: Distributed systems evaluation made simple
-
L. Leonini, É. Rivière, and P. Felber. Splay: Distributed systems evaluation made simple. In NSDI, 2009. Available from: http://www.usenix.org/event/nsdi09/tech/.
-
(2009)
NSDI
-
-
Leonini, L.1
Rivière, É.2
Felber, P.3
-
22
-
-
0024138827
-
Condor-a hunter of idle workstations
-
M. Litzkow, M. Livny, and M. Mutka. Condor-a hunter of idle workstations. In ICDCS, volume 43, 1988.
-
(1988)
ICDCS
, vol.43
-
-
Litzkow, M.1
Livny, M.2
Mutka, M.3
-
23
-
-
78650646163
-
Smart dust? Not quite, but we're getting there
-
Jan
-
S. Lohr. Smart dust? Not quite, but we're getting there. New York Times, Jan. 2010. http://www.nytimes.com/2010/01/31/business/31unboxed.html.
-
(2010)
New York Times
-
-
Lohr, S.1
-
24
-
-
84885583527
-
Implementing declarative overlays
-
B. T. Loo, T. Condie, J. M. Hellerstein, P. Maniatis, T. Roscoe, and I. Stoica. Implementing declarative overlays. In SOSP, 2005. doi:10.1145/1095810.1095818.
-
(2005)
SOSP
-
-
Loo, B.T.1
Condie, T.2
Hellerstein, J.M.3
Maniatis, P.4
Roscoe, T.5
Stoica, I.6
-
25
-
-
84885616829
-
Exploring failure transparency and the limits of generic recovery
-
D. E. Lowell, S. Chandra, and P. M. Chen. Exploring failure transparency and the limits of generic recovery. In OSDI, 2000.
-
(2000)
OSDI
-
-
Lowell, D.E.1
Chandra, S.2
Chen, P.M.3
-
27
-
-
85084162072
-
A toolkit for user-level file systems
-
D. Mazières. A toolkit for user-level file systems. In USENIX ATC, 2001.
-
(2001)
USENIX ATC
-
-
Mazières, D.1
-
28
-
-
85049119901
-
CIEL: A universal execution engine for distributed data-flow computing
-
D. G. Murray, M. Schwarzkopf, C. Smowton, S. Smith, A. Madhavapeddy, and S. Hand. Ciel: a universal execution engine for distributed data-flow computing. In NSDI, 2011. http://www.usenix.org/event/nsdi11/tech/full_papers/Murray.pdf.
-
(2011)
NSDI
-
-
Murray, D.G.1
Schwarzkopf, M.2
Smowton, C.3
Smith, S.4
Madhavapeddy, A.5
Hand, S.6
-
30
-
-
85080516050
-
-
http://linux- mm.org/OverCommitAccounting.
-
-
-
-
34
-
-
27544432494
-
Scalability and accuracy in a large-scale network emulator
-
A. Vahdat, K. Yocum, K. Walsh, P. Mahadevan, D. Kostić, J. Chase, and D. Becker. Scalability and accuracy in a large-scale network emulator. In OSDI, 2002. http://www.usenix.org/event/osdi02/tech/vahdat.html.
-
(2002)
OSDI
-
-
Vahdat, A.1
Yocum, K.2
Walsh, K.3
Mahadevan, P.4
Kostić, D.5
Chase, J.6
Becker, D.7
-
35
-
-
70450040454
-
Protection and communication abstractions for web browsers in MashupOS
-
H. Wang, X. Fan, J. Howell, and C. Jackson. Protection and communication abstractions for web browsers in MashupOS. In SOSP, 2007.
-
(2007)
SOSP
-
-
Wang, H.1
Fan, X.2
Howell, J.3
Jackson, C.4
-
36
-
-
79960538448
-
Incontext: Simple parallelism for distributed applications
-
S. Yoo, H. Lee, C. Killian, and M. Kulkarni. Incontext: simple parallelism for distributed applications. In HPDC, 2011. doi:10.1145/1996130.1996144.
-
(2011)
HPDC
-
-
Yoo, S.1
Lee, H.2
Killian, C.3
Kulkarni, M.4
-
37
-
-
33846555330
-
Multiprocessor support for event-driven programs
-
June
-
N. Zeldovich, A. Yip, F. Dabek, R. Morris, D. Mazières, and F. Kaashoek. Multiprocessor support for event-driven programs. In USENIX ATC, June 2003.
-
(2003)
USENIX ATC
-
-
Zeldovich, N.1
Yip, A.2
Dabek, F.3
Morris, R.4
Mazières, D.5
Kaashoek, F.6
|