Find (partial content) duplicate files.
-
Updated
Dec 10, 2022 - Python
Find (partial content) duplicate files.
Implementation of Rolling hash function supporting both string prefix and suffix hashing.
Print FastCDC rolling hash chunks and checksums.
Golem, a Rust tool for plagiarism detection.
Some structures or algorithms written in java.
Fast detection of maximal exact matches via fixed sampling of query k-mers and Bloom filtering of index k-mers
command line tool for generating files delta with using rolling hash algorithm
Karp-Rabin sequence-matching implementation using Python.
Content-Addressable File System (used by BitWrk)
Add a description, image, and links to the rolling-hash topic page so that developers can more easily learn about it.
To associate your repository with the rolling-hash topic, visit your repo's landing page and select "manage topics."