allx / tika-similarity Goto Github PK
View Code? Open in Web Editor NEWThis project forked from chrismattmann/tika-similarity
Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
License: Apache License 2.0