A python implementation of localitiy sensitive hashing(lsh).
To run, clone repo first using:
git clone https://github.com/POOSARLADIVAKAR/LSH.git
cd LSH
-
Add All files into Corpus folder and pass it to Preprocessing by adding directory_path
-
Run python main.py or run individual files
-
Run individual files :
- python preprocess.py
- python shingle.py
- python singnature.py
- python lsh.py
following python modules are required:
- pandas
- numpy
- pickle
- tqdm
- OS
- time