C Implemetation of FastCDC for chunking
FastCDC is an advanced chunking algorithm which has a better speed than Rabin CDC.
Compile:
cd src && g++ -I./ -o main main.cpp fastcdc/fastcdc.c -lcrypto
Execute:
./main
This repository that mainly sources its content from https://github.com/wxiacode/FastCDC-c. However, the original code contains numerous issues:
For example, files smaller than 1GB are treated as 1GB in size, which leads to illegal memory access during chunking, even though no error is reported.
Additionally, the original repository lacks integrity checks and does not save the content of data chunks. We have added these features in our repository.
By the way, it is amazing that deepsketch(CCF-A ATC 2022)[https://github.com/dgist-datalab/deepsketch-fast2022/blob/main/compress.h] implements finesse (a resemblance detection algorithm) by fixed chunking?
- openssl
- zlib
- uthash