-
1
-
-
84870515887
-
-
http://dumps.wikimedia.org/enwiki/.
-
-
-
-
2
-
-
84870571522
-
-
https://www.grid5000.fr/.
-
-
-
-
3
-
-
33745224219
-
FARSITE: Federated, available, and reliable storage for an incompletely trusted environment
-
Atul Adya, William J. Bolosky, Miguel Castro, Gerald Cermak, Ronnie Chaiken, John R. Douceur, Jon Howell, Jacob R. Lorch, Marvin Theimer, and Roger P. Wattenhofer. FARSITE: Federated, Available, and Reliable Storage for an Incompletely Trusted Environment. In OSDI, 2002.
-
(2002)
OSDI
-
-
Adya, A.1
Bolosky, W.J.2
Castro, M.3
Cermak, G.4
Chaiken, R.5
Douceur, J.R.6
Howell, J.7
Lorch, J.R.8
Theimer, M.9
Wattenhofer, R.P.10
-
4
-
-
76349109236
-
Extreme binning: Scalable, parallel deduplication for chunk-based file backup
-
Deepavali Bhagwat, Kave Eshghi, Darrell D. E. Long, and Mark Lillibridge. Extreme Binning: Scalable, Parallel Deduplication for Chunk-based File Backup. In MASCOTS, 2009.
-
(2009)
MASCOTS
-
-
Bhagwat, D.1
Eshghi, K.2
Long, D.D.E.3
Lillibridge, M.4
-
5
-
-
84976810280
-
Copy detection mechanisms for digital documents
-
S. Brin, J. Davis, and H. Garcia-Molina. Copy Detection Mechanisms for Digital Documents. In SIGMOD, 1995.
-
(1995)
SIGMOD
-
-
Brin, S.1
Davis, J.2
Garcia-Molina, H.3
-
6
-
-
85077072489
-
Tradeoffs in scalable data routing for deduplication clusters
-
Wei Dong, Fred Douglis, Kai Li, Hugo Patterson, Sazzala Reddy, and Philip Shilane. Tradeoffs in Scalable Data Routing for Deduplication Clusters. In FAST, 2011.
-
(2011)
FAST
-
-
Wei, D.1
Douglis, F.2
Li, K.3
Patterson, H.4
Reddy, S.5
Shilane, P.6
-
7
-
-
79955995661
-
HYDRAstor: A scalable secondary storage
-
Cezary Dubnicki, Leszek Gryz, Lukasz Heldt, Michal Kaczmarczyk, Wojciech Kilian, Przemyslaw Strzelczak, Jerzy Szczepkowski, Cristian Ungureanu, and Michal Welnicki. HYDRAstor: a Scalable Secondary Storage. In FAST, 2009.
-
(2009)
FAST
-
-
Dubnicki, C.1
Gryz, L.2
Heldt, L.3
Kaczmarczyk, M.4
Kilian, W.5
Strzelczak, P.6
Szczepkowski, J.7
Ungureanu, C.8
Welnicki, M.9
-
8
-
-
10444235961
-
Loglog counting of large cardinalities
-
M. Durand and P. Flajolet. Loglog counting of large cardinalities. In ESA, 2003.
-
(2003)
ESA
-
-
Durand, M.1
Flajolet, P.2
-
10
-
-
59949100196
-
The diverse and exploding digital universe: An updated forecast of worldwide information growth through 2011
-
March
-
J. F. Gantz, C. Chute, A. Manfrediz, S. Minton, D. Reinsel, W. Schlichting, and A. Toncheva. The Diverse and Exploding Digital Universe: An Updated Forecast of Worldwide Information Growth Through 2011. Technical report, An IDC White Paper - sponsored by EMC, March 2008.
-
(2008)
Technical Report, an IDC White Paper - Sponsored by EMC
-
-
Gantz, J.F.1
Chute, C.2
Manfrediz, A.3
Minton, S.4
Reinsel, D.5
Schlichting, W.6
Toncheva, A.7
-
11
-
-
84860571208
-
Building a high-performance deduplication systems
-
Fanglu Guo and Petros Efstathopoulos. Building a High-performance Deduplication Systems. In USENIX ATC, 2011.
-
(2011)
USENIX ATC
-
-
Guo, F.1
Efstathopoulos, P.2
-
12
-
-
85077053929
-
Bimodal content defined chunking for backup streams
-
Erik Kruus, Cristian Ungureanu, and Cezary Dubnicki. Bimodal Content Defined Chunking for Backup Streams. In FAST, 2010.
-
(2010)
FAST
-
-
Kruus, E.1
Ungureanu, C.2
Dubnicki, C.3
-
14
-
-
84933071250
-
Sparse indexing: Large scale, inline deduplication using sampling and locality
-
Mark Lillibridge, Kave Eshghi, Deepavali Bhagwat, Vinay Deolalikar, Greg Trezise, and Peter Camble. Sparse Indexing: Large Scale, Inline Deduplication Using Sampling and Locality. In FAST, 2009.
-
(2009)
FAST
-
-
Lillibridge, M.1
Eshghi, K.2
Bhagwat, D.3
Deolalikar, V.4
Trezise, G.5
Camble, P.6
-
15
-
-
85077032135
-
A study of practical deduplication
-
Dutch T. Meyer and William J. Bolosky. A Study of Practical Deduplication. In FAST, 2011.
-
(2011)
FAST
-
-
Meyer, D.T.1
Bolosky, W.J.2
-
16
-
-
34547637575
-
Discovering and exploiting keyword and attribute-value co-occurrences to improve P2P routing indices
-
Sebastian Michel, Matthias Bender, Nikos Ntarmos, Peter Triantafillou, Gerhard Weikum, and Christian Zimmer. Discovering and Exploiting Keyword and Attribute-Value Co-occurrences to Improve P2P Routing Indices. In CIKM, 2006.
-
(2006)
CIKM
-
-
Michel, S.1
Bender, M.2
Ntarmos, N.3
Triantafillou, P.4
Weikum, G.5
Zimmer, C.6
-
18
-
-
84870815282
-
Alternatives for detecting redundancy in storage systems data
-
C. Policroniades and I. Pratt. Alternatives for Detecting Redundancy in Storage Systems Data. In USENIX ATC, 2004.
-
(2004)
USENIX ATC
-
-
Policroniades, C.1
Pratt, I.2
-
19
-
-
79955492701
-
Exploiting similarity for multi-source downloads using file handprints
-
Himabindu Pucha, David G. Andersen, and Michael Kaminsky. Exploiting Similarity for Multi-Source Downloads Using File Handprints. In NSDI, 2007.
-
(2007)
NSDI
-
-
Pucha, H.1
Andersen, D.G.2
Kaminsky, M.3
-
20
-
-
76349123641
-
Fast, inexpensive content-addressed storage in foundation
-
Sean Rhea, Russ Cox, and Alex Pesterev. Fast, inexpensive content-addressed storage in foundation. In USENIX ATC, 2008.
-
(2008)
USENIX ATC
-
-
Rhea, S.1
Cox, R.2
Pesterev, A.3
-
21
-
-
77953300991
-
Efficient similarity estimation for systems exploiting data redundancy
-
Kanat Tangwongsan, Himabindu Pucha, David G. Andersen, and Michael Kaminsky. Efficient Similarity Estimation for Systems Exploiting Data Redundancy. In INFOCOM, 2010.
-
(2010)
INFOCOM
-
-
Tangwongsan, K.1
Pucha, H.2
Andersen, D.G.3
Kaminsky, M.4
-
22
-
-
85077053402
-
HydraFS: A high-throughput file system for the HYDRAstor content-addressable storage system
-
Cristian Ungureanu, Benjamin Atkin, Akshat Aranya, Salil Gokhale, Stephen Rago, Grzegorz Calkowski, Cezary Dubnicki, and Aniruddha Bohra. HydraFS: a High-Throughput File System for the HYDRAstor Content-Addressable Storage System. In FAST, 2010.
-
(2010)
FAST
-
-
Ungureanu, C.1
Atkin, B.2
Aranya, A.3
Gokhale, S.4
Rago, S.5
Calkowski, G.6
Dubnicki, C.7
Bohra, A.8
-
23
-
-
85077122692
-
Characteristics of backup workloads in production systems
-
Grant Wallace, Fred Douglis, Hangwei Qian, Philip Shilane, Stephen Smaldone, Mark Chamness, and Windsor Hsu. Characteristics of Backup Workloads in Production Systems. In FAST, 2012.
-
(2012)
FAST
-
-
Wallace, G.1
Douglis, F.2
Qian, H.3
Shilane, P.4
Smaldone, S.5
Chamness, M.6
Hsu, W.7
|