-
4
-
-
76349109236
-
Extreme binning: Scalable, parallel deduplication for chunk-based file backup
-
Sept
-
BHAGWAT, D., ESHGHI, K., LONG, D. D. E., AND LILLIBRIDGE, M. Extreme binning: Scalable, parallel deduplication for chunk-based file backup. In Proceedings of the 17th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2009) (Sept. 2009).
-
(2009)
Proceedings of the 17th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS 2009)
-
-
Bhagwat, D.1
Eshghi, K.2
Long, D.D.E.3
Lillibridge, M.4
-
5
-
-
33846663173
-
Improving duplicate elimination in storage systems
-
BOBBARJUNG, D. R., JAGANNATHAN, S., AND DUBNICKI, C. Improving duplicate elimination in storage systems. Trans. Storage 2, 4 (2006), 424-448.
-
(2006)
Trans. Storage
, vol.2
, Issue.4
, pp. 424-448
-
-
Bobbarjung, D.R.1
Jagannathan, S.2
Dubnicki, C.3
-
6
-
-
84976810280
-
Copy detection mechanisms for digital documents
-
In
-
BRIN, S., DAVIS, J., AND GARCIA-MOLINA, H. Copy detection mechanisms for digital documents. In In Proceedings of the ACM SIGMOD Annual Conference (1995), pp. 398-409.
-
(1995)
Proceedings of the ACM SIGMOD Annual Conference
, pp. 398-409
-
-
Brin, S.1
Davis, J.2
Garcia-Molina, H.3
-
8
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
CHOWDHURY, A., FRIEDER, O., GROSSMAN, D., AND MCCABE, M. C. Collection statistics for fast duplicate document detection. ACM Trans. Inf. Syst. 20, 2 (2002), 171-191.
-
(2002)
ACM Trans. Inf. Syst.
, vol.20
, Issue.2
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.C.4
-
11
-
-
85058564018
-
-
United States Patent June
-
DOUGLIS, F., KULKARNI, P., LAVOIE, J. D., AND TRACEY, J. M. Method and apparatus for data redundancy elimination at the block level. United States Patent 20050131939, June 2005.
-
(2005)
Method and Apparatus for Data Redundancy Elimination at the Block Level
-
-
Douglis, F.1
Kulkarni, P.2
Lavoie, J.D.3
Tracey, J.M.4
-
12
-
-
79955995661
-
Hydrastor: A scalable secondary storage
-
USENIX Association
-
DUBNICKI, C., GRYZ, L., HELDT, L., KACZMARCZYK, M., KILIAN, W., STRZELCZAK, P., SZCZEPKOWSKI, J., UNGUREANU, C., AND WELNICKI, M. HYDRAstor: a Scalable Secondary Storage. In Proccedings of the 7th conference on File and storage technologies (2009), USENIX Association, pp. 197-210.
-
(2009)
Proccedings of the 7th Conference on File and Storage Technologies
, pp. 197-210
-
-
Dubnicki, C.1
Gryz, L.2
Heldt, L.3
Kaczmarczyk, M.4
Kilian, W.5
Strzelczak, P.6
Szczepkowski, J.7
Ungureanu, C.8
Welnicki, M.9
-
13
-
-
57349172483
-
A framework for analyzing and improving content-based chunking algorithms
-
HP Laboratories, 10
-
ESHGHI, K., AND TANG, H. K. A framework for analyzing and improving content-based chunking algorithms. Technical report HPL-2005-30R1, HP Laboratories, 10 2005.
-
(2005)
Technical Report HPL-2005-30R1
-
-
Eshghi, K.1
Tang, H.K.2
-
14
-
-
32344441912
-
Finding similar files in large document repositories
-
New York, NY, USA
-
FORMAN, G., ESHGHI, K., AND CHIOCCHETTI, S. Finding similar files in large document repositories. In KDD'05: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining (New York, NY, USA, 2005), pp. 394-400.
-
(2005)
KDD'05: Proceeding of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining
, pp. 394-400
-
-
Forman, G.1
Eshghi, K.2
Chiocchetti, S.3
-
16
-
-
67249153907
-
Rank-indexed hashing: A compact construction of bloom filters and variants
-
Oct
-
HUA, N., ZHAO, H., LIN, B., AND XU, J. Rank-indexed hashing: A compact construction of bloom filters and variants. In IEEE International Conference on Network Protocols (ICNP 2008) (Oct. 2008), pp. 73-82.
-
(2008)
IEEE International Conference on Network Protocols (ICNP 2008)
, pp. 73-82
-
-
Hua, N.1
Zhao, H.2
Lin, B.3
Xu, J.4
-
18
-
-
33846694964
-
-
Tech. rep., Technical Report Dept. of Comp. Sc., Univ. of Texas at Austin
-
JAIN, N., DAHLIN, M., AND TEWARI, R. TAPER: Tiered Approach for Eliminating Redundancy in Replica Synchronization. Tech. rep., Technical Report TR-05-42, Dept. of Comp. Sc., Univ. of Texas at Austin, 2005.
-
(2005)
TAPER: Tiered Approach for Eliminating Redundancy in Replica Synchronization
-
-
Jain, N.1
Dahlin, M.2
Tewari, R.3
-
19
-
-
70349675338
-
Optimal fast hashing
-
Apr
-
KANIZO, Y., HAY, D., AND KESLASSY, I. Optimal fast hashing. In 28th IEEE International Conference on Computer Communications (INFOCOM) (Apr. 2009), pp. 2500-2508.
-
(2009)
28th IEEE International Conference on Computer Communications (INFOCOM)
, pp. 2500-2508
-
-
Kanizo, Y.1
Hay, D.2
Keslassy, I.3
-
20
-
-
85091109842
-
Redundancy elimination within large collections of files
-
KULKARNI, P., DOUGLIS, F., LAVOIE, J., AND TRACEY, J. Redundancy Elimination Within Large Collections of Files. In Proceedings of the USENIX Annual Technical Conference (2004).
-
(2004)
Proceedings of the USENIX Annual Technical Conference
-
-
Kulkarni, P.1
Douglis, F.2
Lavoie, J.3
Tracey, J.4
-
21
-
-
84933071250
-
Sparse indexing: Large scale, inline deduplication using sampling and locality
-
USENIX Association
-
LILLIBRIDGE, M., ESHGHI, K., BHAGWAT, D., DEOLALIKAR, V., TREZISE, G., AND CAMBLE, P. Sparse indexing: large scale, inline deduplication using sampling and locality. In Proceedings of the 7th conference on File and storage technologies (2009), USENIX Association, pp. 111-123.
-
(2009)
Proceedings of the 7th Conference on File and Storage Technologies
, pp. 111-123
-
-
Lillibridge, M.1
Eshghi, K.2
Bhagwat, D.3
Deolalikar, V.4
Trezise, G.5
Camble, P.6
-
23
-
-
79953769032
-
Using the power of two choices to improve bloom filters
-
LUMETTA, S., AND MITZENMACHER, M. Using the power of two choices to improve bloom filters. Internet Mathematics 4, 1 (2007), 17-34.
-
(2007)
Internet Mathematics
, vol.4
, Issue.1
, pp. 17-34
-
-
Lumetta, S.1
Mitzenmacher, M.2
-
25
-
-
0036036764
-
A low-bandwidth network file system
-
New York, NY, USA
-
MUTHITACHAROEN, A., CHEN, B., AND MAZIÈRES, D. A low-bandwidth network file system. In SOSP'01: Proceedings of the eighteenth ACM symposium on Operating systems principles (New York, NY, USA, 2001), pp. 174-187.
-
(2001)
SOSP'01: Proceedings of the Eighteenth ACM Symposium on Operating Systems Principles
, pp. 174-187
-
-
Muthitacharoen, A.1
Chen, B.2
Mazières, D.3
-
27
-
-
84885575252
-
Persifs: A versioned file system with an efficient representation
-
New York, NY, USA
-
PORTS, D. R. K., CLEMENTS, A. T., AND DEMAINE, E. D. PersiFS: a versioned file system with an efficient representation. In SOSP'05: Proceedings of the twentieth ACM symposium on Operating systems principles (New York, NY, USA, 2005), pp. 1-2.
-
(2005)
SOSP'05: Proceedings of the Twentieth ACM Symposium on Operating Systems Principles
, pp. 1-2
-
-
Ports, D.R.K.1
Clements, A.T.2
Demaine, E.D.3
-
30
-
-
1142267351
-
Winnowing: Local algorithms for document fingerprinting
-
New York, NY, USA
-
SCHLEIMER, S., WILKERSON, D. S., AND AIKEN, A. Winnowing: local algorithms for document fingerprinting. In SIGMOD'03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data (New York, NY, USA, 2003), pp. 76-85.
-
(2003)
SIGMOD'03: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data
, pp. 76-85
-
-
Schleimer, S.1
Wilkerson, D.S.2
Aiken, A.3
-
31
-
-
84949965216
-
-
Tech. rep., Department of Computer Science, University of Texas at Austin
-
SPIRIDONOV, A., THAKER, S., AND PATWARDHAN, S. Sharing and bandwidth consumption in the low bandwidth file system. Tech. rep., Department of Computer Science, University of Texas at Austin, 2005.
-
(2005)
Sharing and Bandwidth Consumption in the Low Bandwidth File System
-
-
Spiridonov, A.1
Thaker, S.2
Patwardhan, S.3
-
32
-
-
2442563450
-
Improved file synchronization techniques for maintaining large replicated collections over slow networks
-
Washington, DC, USA
-
SUEL, T., NOEL, P., AND TRENDAFILOV, D. Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks. In ICDE'04: Proceedings of the 20th International Conference on Data Engineering (Washington, DC, USA, 2004), p. 153.
-
(2004)
ICDE'04: Proceedings of the 20th International Conference on Data Engineering
, pp. 153
-
-
Suel, T.1
Noel, P.2
Trendafilov, D.3
-
34
-
-
0003570191
-
-
Technical report TR-CS-96-05, Australian National University, Deparment of Computer Science, FEIT, ANU
-
TRIDGELL, A., AND MACKERRAS, P. The rsync algorithm. Technical report TR-CS-96-05, Australian National University, Deparment of Computer Science, FEIT, ANU, 1996.
-
(1996)
The Rsync Algorithm
-
-
Tridgell, A.1
Mackerras, P.2
-
35
-
-
77953974152
-
DEBAR: A scalable high-performance de-duplication storage system for Backup and Archiving
-
YANG, T., JIANG, H., FENG, D., AND NIU, Z. DEBAR: A Scalable High-Performance De-duplication Storage System for Backup and Archiving. CSE Technical reports (2009), 58.
-
(2009)
CSE Technical Reports
, pp. 58
-
-
Yang, T.1
Jiang, H.2
Feng, D.3
Niu, Z.4
-
37
-
-
28444469734
-
Deep store: An archival storage system architecture
-
Washington, DC, USA
-
YOU, L. L., POLLACK, K. T., AND LONG, D. D. E. Deep Store: An Archival Storage System Architecture. In ICDE'05: Proceedings of the 21st International Conference on Data Engineering (Washington, DC, USA, 2005), pp. 804-8015.
-
(2005)
ICDE'05: Proceedings of the 21st International Conference on Data Engineering
, pp. 804-8015
-
-
You, L.L.1
Pollack, K.T.2
Long, D.D.E.3
-
38
-
-
85066881368
-
Avoiding the disk bottleneck in the data domain deduplication file system
-
Berkeley, CA, USA, USENIX Association
-
ZHU, B., LI, K., AND PATTERSON, H. Avoiding the disk bottleneck in the data domain deduplication file system. In FAST'08: Proceedings of the 6th USENIX Conference on File and Storage Technologies (Berkeley, CA, USA, 2008), USENIX Association, pp. 1-14.
-
(2008)
FAST'08: Proceedings of the 6th USENIX Conference on File and Storage Technologies
, pp. 1-14
-
-
Zhu, B.1
Li, K.2
Patterson, H.3
-
39
-
-
20444480211
-
Hierarchical bloom filter arrays (HBA): A novel, scalable metadata management system for large cluster-based storage
-
Sept
-
ZHU, Y., JIANG, H., AND WANG, J. Hierarchical Bloom filter arrays (HBA): a novel, scalable metadata management system for large cluster-based storage. In Cluster Computing, 2004 IEEE International Conference on (Sept. 2004), pp. 165-174.
-
(2004)
Cluster Computing, 2004 IEEE International Conference on
, pp. 165-174
-
-
Zhu, Y.1
Jiang, H.2
Wang, J.3
|