tssk / tika Goto Github PK
View Code? Open in Web Editor NEWThis project forked from apache/tika
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
Home Page: https://tika.apache.org/
License: Apache License 2.0