- crawls goodreads using python bs4 (crawler/crawlBooks.py)
- pushes data to SOLR via bash script (script/solrUpdateScript.sh)
- utilizes SOLR + PHP to create a web search engine
0xburo / goodreads-crawler Goto Github PK
View Code? Open in Web Editor NEWbs4 crawler for goodreads + PHP / SOLR search engine