This simple scripts extracts to a CSV file all the sources included in the html files found at the directory and subdirectories specified by the user.
- If using virtual envs, create it and activate it.
- Then install dependencies:
pip install -r requirements.txt
- Then run the script specifying the parent directory to look for html files.
python main.py <your directory>