-
1
-
-
0033752069
-
Failure detection and consensus in the crash-recovery model
-
M. K. Aguilera, W. Chen, and S. Toueg. Failure detection and consensus in the crash-recovery model. Distributed Computing, 13(2):99-125, 2000.
-
(2000)
Distributed Computing
, vol.13
, Issue.2
, pp. 99-125
-
-
Aguilera, M.K.1
Chen, W.2
Toueg, S.3
-
2
-
-
0029193089
-
LogGP: Incorporating long messages into the LogP model|one step closer towards a realistic model for parallel computation
-
New York, NY, USA ACM
-
A. Alexandrov, M. F. Ionescu, K. E. Schauser, and C. Scheiman. LogGP: Incorporating Long Messages into the LogP Model|One Step Closer Towards a Realistic Model for Parallel Computation. In Proc. 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA '95, pages 95-105, New York, NY, USA, 1995. ACM.
-
(1995)
Proc. 7th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA '95
, pp. 95-105
-
-
Alexandrov, A.1
Ionescu, M.F.2
Schauser, K.E.3
Scheiman, C.4
-
3
-
-
85021207134
-
Corfu: A shared log design for ash clusters
-
Berkeley, CA, USA
-
M. Balakrishnan, D. Malkhi, V. Prabhakaran, T. Wobber, M. Wei, and J. D. Davis. Corfu: A shared log design for ash clusters. In Proc. 9th USENIX Conference on Networked Systems Design and Implementation, NSDI'12, pages 1-1, Berkeley, CA, USA, 2012.
-
(2012)
Proc. 9th USENIX Conference on Networked Systems Design and Implementation, NSDI'12
, pp. 1-2
-
-
Balakrishnan, M.1
Malkhi, D.2
Prabhakaran, V.3
Wobber, T.4
Wei, M.5
Davis, J.D.6
-
4
-
-
85065181066
-
The chubby lock service for loosely-coupled distributed systems
-
Berkeley, CA, USA
-
M. Burrows. The Chubby Lock Service for Loosely-coupled Distributed Systems. In Proc. 7th Symposium on Operating Systems Design and Implementation, OSDI '06, pages 335-350, Berkeley, CA, USA, 2006.
-
(2006)
Proc. 7th Symposium on Operating Systems Design and Implementation, OSDI '06
, pp. 335-350
-
-
Burrows, M.1
-
5
-
-
0001038609
-
Practical byzantine fault tolerance
-
Berkeley, CA, USA
-
M. Castro and B. Liskov. Practical Byzantine Fault Tolerance. In Proc. 3rd Symposium on Operating Systems Design and Implementation, OSDI '99, pages 173-186, Berkeley, CA, USA, 1999.
-
(1999)
Proc. 3rd Symposium on Operating Systems Design and Implementation, OSDI '99
, pp. 173-186
-
-
Castro, M.1
Liskov, B.2
-
7
-
-
0028444938
-
RAID: High-performance
-
Surv June
-
P. M. Chen, E. K. Lee, G. A. Gibson, R. H. Katz, and D. A. Patterson. RAID: High-performance, Reliable Secondary Storage. ACM Comput. Surv., 26(2):145-185, June 1994.
-
(1994)
Reliable Secondary Storage ACM Comput
, vol.26
, Issue.2
, pp. 145-185
-
-
Chen, P.M.1
Lee, E.K.2
Gibson, G.A.3
Katz, R.H.4
Patterson, D.A.5
-
8
-
-
77954889082
-
Benchmarking cloud serving systems with YCSB
-
New York, NY, USA, ACM
-
B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. Benchmarking Cloud Serving Systems with YCSB. In Proc. 1st ACM Symposium on Cloud Computing, SoCC '10, pages 143-154, New York, NY, USA, 2010. ACM.
-
(2010)
Proc. 1st ACM Symposium on Cloud Computing, SoCC '10
, pp. 143-154
-
-
Cooper, B.F.1
Silberstein, A.2
Tam, E.3
Ramakrishnan, R.4
Sears, R.5
-
9
-
-
85065170765
-
Spanner: Google's globally-distributed database
-
Berkeley, CA, USA
-
J. C. Corbett, J. Dean, M. Epstein, A. Fikes, C. Frost, J. J. Furman, S. Ghemawat, A. Gubarev, C. Heiser, P. Hochschild, W. Hsieh, S. Kanthak, E. Kogan, H. Li, A. Lloyd, S. Melnik, D. Mwaura, D. Nagle, S. Quinlan, R. Rao, L. Rolig, Y. Saito, M. Szymaniak, C. Taylor, R. Wang, and D. Woodford. Spanner: Google's Globally-distributed Database. In Proc. 10th USENIX Conference on Operating Systems Design and Implementation, OSDI'12, pages 251-264, Berkeley, CA, USA, 2012.
-
(2012)
Proc. 10th USENIX Conference on Operating Systems Design and Implementation, OSDI'12
, pp. 251-264
-
-
Corbett, J.C.1
Dean, J.2
Epstein, M.3
Fikes, A.4
Frost, C.5
Furman, J.J.6
Ghemawat, S.7
Gubarev, A.8
Heiser, C.9
Hochschild, P.10
Hsieh, W.11
Kanthak, S.12
Kogan, E.13
Li, H.14
Lloyd, A.15
Melnik, S.16
Mwaura, D.17
Nagle, D.18
Quinlan, S.19
Rao, R.20
Rolig, L.21
Saito, Y.22
Szymaniak, M.23
Taylor, C.24
Wang, R.25
Woodford, D.26
more..
-
10
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
ACM, Jan
-
J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM, 51(1):107-113, Jan. 2008.
-
(2008)
Commun
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
11
-
-
70450064414
-
Dynamo: Amazon's highly available key-value store
-
Oct
-
G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's Highly Available Key-value Store. SIGOPS Oper. Syst. Rev., 41(6):205-220, Oct. 2007.
-
(2007)
SIGOPS Oper. Syst. Rev
, vol.41
, Issue.6
, pp. 205-220
-
-
DeCandia, G.1
Hastorun, D.2
Jampani, M.3
Kakulapati, G.4
Lakshman, A.5
Pilchin, A.6
Sivasubramanian, S.7
Vosshall, P.8
Vogels, W.9
-
12
-
-
84936942406
-
Fail-in-place Network Design: Interaction between topology, routing algorithm and failures
-
Piscataway, NJ, USA, IEEE Press
-
J. Domke, T. Hoeer, and S. Matsuoka. Fail-in-place Network Design: Interaction Between Topology, Routing Algorithm and Failures. In Proc. International Conference for High Performance Computing, Networking, Storage and Analysis, SC '14, pages 597-608, Piscataway, NJ, USA, 2014. IEEE Press.
-
(2014)
Proc. International Conference for High Performance Computing, Networking, Storage and Analysis, SC '14
, pp. 597-608
-
-
Domke, J.1
Hoeer, T.2
Matsuoka, S.3
-
13
-
-
85076931521
-
FaRM: Fast remote memory
-
Berkeley, CA, USA
-
A. Dragojevic, D. Narayanan, O. Hodson, and M. Castro. FaRM: Fast Remote Memory. In Proc. 11th USENIX Conference on Networked Systems Design and Implementation, NSDI'14, pages 401-414, Berkeley, CA, USA, 2014.
-
(2014)
Proc. 11th USENIX Conference on Networked Systems Design and Implementation, NSDI'14
, pp. 401-414
-
-
Dragojevic, A.1
Narayanan, D.2
Hodson, O.3
Castro, M.4
-
14
-
-
0022045868
-
Impossibility of distributed consensus with one faulty process
-
Apr
-
M. J. Fischer, N. A. Lynch, and M. S. Paterson. Impossibility of Distributed Consensus with One Faulty Process. J. ACM, 32(2):374-382, Apr. 1985.
-
(1985)
J. ACM
, vol.32
, Issue.2
, pp. 374-382
-
-
Fischer, M.J.1
Lynch, N.A.2
Paterson, M.S.3
-
16
-
-
84899678292
-
Enabling highly-scalable remote memory access programming with mpi-3 one sided
-
New York, NY, USA, ACM
-
R. Gerstenberger, M. Besta, and T. Hoeer. Enabling Highly-scalable Remote Memory Access Programming with MPI-3 One Sided. In Proc. International Conference on High Performance Computing, Networking, Storage and Analysis, SC '13, pages 53:1-53:12, New York, NY, USA, 2013. ACM.
-
(2013)
Proc. International Conference on High Performance Computing, Networking, Storage and Analysis, SC '13
, pp. 531-5312
-
-
Gerstenberger, R.1
Besta, M.2
Hoeer, T.3
-
18
-
-
83155160934
-
Modeling and tolerating heterogeneous failures in large parallel systems
-
New York, NY, USA, ACM
-
E. Heien, D. Kondo, A. Gainaru, D. LaPine, B. Kramer, and F. Cappello. Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems. In Proc. of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC '11, pages 45:1-45:11, New York, NY, USA, 2011. ACM.
-
(2011)
Proc. of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC '11
, pp. 451-4511
-
-
Heien, E.1
Kondo, D.2
Gainaru, A.3
LaPine, D.4
Kramer, B.5
Cappello, F.6
-
19
-
-
0025460579
-
Linearizability: A correctness condition for concurrent objects
-
July
-
M. P. Herlihy and J. M. Wing. Linearizability: A Correctness Condition for Concurrent Objects. ACM Trans. Program. Lang. Syst., 12(3):463-492, July 1990.
-
(1990)
ACM Trans. Program. Lang. Syst
, vol.12
, Issue.3
, pp. 463-492
-
-
Herlihy, M.P.1
Wing, J.M.2
-
20
-
-
79951761350
-
ZooKeeper: Wait-free coordination for internet-scale systems
-
Berkeley, CA, USA
-
P. Hunt, M. Konar, F. P. Junqueira, and B. Reed. ZooKeeper: Wait-free Coordination for Internet-scale Systems. In Proc. 2010 USENIX Conference on USENIX Annual Technical Conference, USENIX ATC'10, pages 11-11, Berkeley, CA, USA, 2010.
-
(2010)
Proc. 2010 USENIX Conference on USENIX Annual Technical Conference, USENIX ATC'10
, pp. 11
-
-
Hunt, P.1
Konar, M.2
Junqueira, F.P.3
Reed, B.4
-
21
-
-
84987721797
-
-
InfiniBand trade association Release 1.2.1
-
InfiniBand Trade Association. InfiniBand Architecture Specification: Volume 1, Release 1.2.1. 2007
-
InfiniBand Architecture Specification
, vol.1
, pp. 2007
-
-
-
22
-
-
80155183109
-
Memcached design on high performance RDMA capable interconnects
-
Washington, DC, USA
-
J. Jose, H. Subramoni, M. Luo, M. Zhang, J. Huang, M. Wasi-ur Rahman, N. S. Islam, X. Ouyang, H. Wang, S. Sur, and D. K. Panda. Memcached Design on High Performance RDMA Capable Interconnects. In Proc. 2011 International Conference on Parallel Processing, ICPP '11, pages 743-752, Washington, DC, USA, 2011.
-
(2011)
Proc. 2011 International Conference on Parallel Processing, ICPP '11
, pp. 743-752
-
-
Jose, J.1
Subramoni, H.2
Luo, M.3
Zhang, M.4
Huang, J.5
Wasi-Ur Rahman, M.6
Islam, N.S.7
Ouyang, X.8
Wang, H.9
Sur, S.10
Panda, D.K.11
-
23
-
-
77956573226
-
Paxos for system builders: An overview
-
New York, NY, USA, ACM
-
J. Kirsch and Y. Amir. Paxos for System Builders: An Overview. In Proc. 2nd Workshop on Large-Scale Distributed Systems and Middleware, LADIS '08, pages 3:1-3:6, New York, NY, USA, 2008. ACM.
-
(2008)
Proc. 2nd Workshop on Large-Scale Distributed Systems and Middleware, LADIS '08
, pp. 31-36
-
-
Kirsch, J.1
Amir, Y.2
-
24
-
-
0017972109
-
The implementation of reliable distributed multiprocess systems
-
L. Lamport. The implementation of reliable distributed multiprocess systems. Computer Networks (1976), 2(2):95-114, 1978.
-
(1978)
Computer Networks (1976)
, vol.2
, Issue.2
, pp. 95-114
-
-
Lamport, L.1
-
25
-
-
0032058184
-
The part-time parliament
-
ACM, May
-
L. Lamport. The Part-time Parliament. ACM Trans. Comput. Syst., 16(2):133-169, May 1998.
-
(1998)
Trans. Comput. Syst
, vol.16
, Issue.2
, pp. 133-169
-
-
Lamport, L.1
-
26
-
-
1242321054
-
Paxos made simple
-
Dec
-
L. Lamport. Paxos Made Simple. SIGACT News, 32(4):51-58, Dec. 2001.
-
(2001)
SIGACT News
, vol.32
, Issue.4
, pp. 51-58
-
-
Lamport, L.1
-
28
-
-
84884857545
-
ZHT: A light-weight reliable persistent dynamic scalable zero-hop distributed hash table
-
Washington, DC, USA
-
T. Li, X. Zhou, K. Brandstatter, D. Zhao, K. Wang, A. Rajendran, Z. Zhang, and I. Raicu. ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash Table. In Proc. 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, IPDPS '13, pages 775-787, Washington, DC, USA, 2013.
-
(2013)
Proc 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, IPDPS '13
, pp. 775-787
-
-
Li, T.1
Zhou, X.2
Brandstatter, K.3
Zhao, D.4
Wang, K.5
Rajendran, A.6
Zhang, Z.7
Raicu, I.8
-
30
-
-
85076877328
-
Mencius: Building efficient replicated state machines for WANs
-
Berkeley, CA, USA
-
Y. Mao, F. P. Junqueira, and K. Marzullo. Mencius: Building Efficient Replicated State Machines for WANs. In Proc. 8th USENIX Conference on Operating Systems Design and Implementation, OSDI'08, pages 369-384, Berkeley, CA, USA, 2008.
-
(2008)
Proc. 8th USENIX Conference on Operating Systems Design and Implementation, OSDI'08
, pp. 369-384
-
-
Mao, Y.1
Junqueira, F.P.2
Marzullo, K.3
-
31
-
-
84973360726
-
-
Jun
-
C. D. Martino, F. Baccanico, Z. Kalbarczyk, R. Iyer, J. Fullop, and W. Kramer. Lessons Learned From the Analysis of System Failures at Petascale: The Case of Blue Waters. Jun 2014.
-
(2014)
Lessons Learned from the Analysis of System Failures at Petascale: The Case of Blue Waters
-
-
Martino, C.D.1
Baccanico, F.2
Kalbarczyk, Z.3
Iyer, R.4
Fullop, J.5
Kramer, W.6
-
32
-
-
85077206568
-
Using one-sided RDMA reads to build a fast, CPU-efficient key-value store
-
Berkeley, CA, USA
-
C. Mitchell, Y. Geng, and J. Li. Using One-sided RDMA Reads to Build a Fast, CPU-efficient Key-value Store. In Proc. 2013 USENIX Conference on Annual Technical Conference, USENIX ATC'13, pages 103-114, Berkeley, CA, USA, 2013.
-
(2013)
Proc. 2013 USENIX Conference on Annual Technical Conference, USENIX ATC'13
, pp. 103-114
-
-
Mitchell, C.1
Geng, Y.2
Li, J.3
-
33
-
-
78650831692
-
Design, modeling, and evaluation of a scalable multi-level checkpointing system
-
Washington, DC, USA
-
A. Moody, G. Bronevetsky, K. Mohror, and B. R. d. Supinski. Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System. In Proc. 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC '10, pages 1-11, Washington, DC, USA, 2010.
-
(2010)
Proc. 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC '10
, pp. 1-11
-
-
Moody, A.1
Bronevetsky, G.2
Mohror, K.3
Supinski B R, D.4
-
34
-
-
84889654415
-
There is more consensus in egalitarian parliaments
-
New York, NY, USA. ACM
-
I. Moraru, D. G. Andersen, and M. Kaminsky. There is More Consensus in Egalitarian Parliaments. In Proc. 24th ACM Symposium on Operating Systems Principles, SOSP '13, pages 358-372, New York, NY, USA, 2013. ACM.
-
(2013)
Proc. 24th ACM Symposium on Operating Systems Principles, SOSP '13
, pp. 358-372
-
-
Moraru, I.1
Andersen, D.G.2
Kaminsky, M.3
-
36
-
-
84947200665
-
Failure trends in a large disk drive population
-
Berkeley, CA, USA
-
E. Pinheiro, W.-D. Weber, and L. A. Barroso. Failure Trends in a Large Disk Drive Population. In Proc. 5th USENIX Conference on File and Storage Technologies, FAST '07, pages 2-2, Berkeley, CA, USA, 2007.
-
(2007)
Proc. 5th USENIX Conference on File and Storage Technologies, FAST '07
, pp. 2
-
-
Pinheiro, E.1
Weber, W.-D.2
Barroso, L.A.3
-
37
-
-
79960074613
-
Minimum density RAID-6 codes
-
June
-
J. S. Plank, A. L. Buchsbaum, and B. T. Vander Zanden. Minimum Density RAID-6 Codes. Trans. Storage, 6(4):16:1-16:22, June 2011.
-
(2011)
Trans. Storage
, vol.6
, Issue.4
, pp. 161-1622
-
-
Plank, J.S.1
Buchsbaum, A.L.2
Vander Zanden, B.T.3
-
39
-
-
84877700680
-
Design and modeling of a non-blocking checkpointing system
-
Los Alamitos, CA, USA
-
K. Sato, N. Maruyama, K. Mohror, A. Moody, T. Gamblin, B. R. de Supinski, and S. Matsuoka. Design and Modeling of a Non-blocking Checkpointing System. In Proc. International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pages 19:1-19:10, Los Alamitos, CA, USA, 2012.
-
(2012)
Proc. International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12
, pp. 191-1910
-
-
Sato, K.1
Maruyama, N.2
Mohror, K.3
Moody, A.4
Gamblin, T.5
De Supinski, B.R.6
Matsuoka, S.7
-
40
-
-
0025564050
-
Implementing fault-tolerant services using the state machine approach: A tutorial
-
ACM, Dec
-
F. B. Schneider. Implementing Fault-tolerant Services Using the State Machine Approach: A Tutorial. ACM Comput. Surv., 22(4):299-319, Dec. 1990.
-
(1990)
Comput. Surv
, vol.22
, Issue.4
, pp. 299-319
-
-
Schneider, F.B.1
-
41
-
-
84904536046
-
High availability, elasticity, and strong consistency for massively parallel scans over relational data
-
Aug
-
P. Unterbrunner, G. Alonso, and D. Kossmann. High Availability, Elasticity, and Strong Consistency for Massively Parallel Scans over Relational Data. The VLDB Journal, 23(4):627-652, Aug. 2014.
-
(2014)
The VLDB Journal
, vol.23
, Issue.4
, pp. 627-652
-
-
Unterbrunner, P.1
Alonso, G.2
Kossmann, D.3
|