This is just a playground repository so far! :)
-
Make sure you have pip installed (try typing pip). If not, install it with the command:
sudo easy_install pip.
-
Install pdfminer
sudo pip install pdfminer==20131113
-
Run the pdfreader.py:
python pdfreader.py
A program that takes as input a PDF file and spits out a JSON formatted document with suggested tags.
Steps:
- Convert a PDF to text
- Using different types of data from the AlchemyAPI
- Keywords
- Concepts
- Entities
We need some sort of IT