duperemove: add page (#4231)

This commit is contained in:
sedrubal 2020-08-01 19:33:41 +02:00 committed by GitHub
parent 4aeb0ec69d
commit d36da71fb6
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

22
pages/linux/duperemove.md Normal file
View File

@ -0,0 +1,22 @@
# duperemove
> Finds duplicate file system extents and optionally schedule them for deduplication.
> An extent is small part of a file inside the file system.
> On some file systems one extent can be referenced multiple times, when parts of the content of the files are identical.
> More information: <https://markfasheh.github.io/duperemove/>.
- Search for duplicate extents in a directory and show them:
`duperemove -r {{path/to/directory}}`
- Deduplicate duplicate extents on a Btrfs or XFS (experimental) file system:
`duperemove -r -d {{path/to/directory}}`
- Use a hash file to store extent hashes (less memory usage and can be reused on subsequent runs):
`duperemove -r -d --hashfile={{path/to/hashfile}} {{path/to/directory}}`
- Limit I/O threads (for hashing and dedupe stage) and CPU threads (for duplicate extent finding stage):
`duperemove -r -d --hashfile={{path/to/hashfile}} --io-threads={{N}} --cpu-threads={{N}} {{path/to/directory}}`