Tech Report: HPL-2008-30R2: Efficient Detection of Large Scale Redundancy in Enterprise File Systemsdoi:10.1145/1496909.1496926data miningdirectory similarity and de-duplicationfile systemsmin-hashingscalabilityset sketchesstorage managementIn order to catch and reduce waste in the exponential demand for ...
Tech Report: HPL-2008-30R2: Efficient Detection of Large Scale Redundancy in Enterprise File Systemsdoi:10.1145/1496909.1496926data miningdirectory similarity and de-duplicationfile systemsmin-hashingscalabilityset sketchesstorage managementIn order to catch and reduce waste in the exponential demand for ...