illCrawler - is crawler for web pages from darknet (tor network). You can use it for creating your own database of tor sites, own tor search engine and etc.
-
The minimum required PHP version 5, MySQL.
-
Tor proxy (Socks5). It can be local tor proxy (example: Privoxy + tor).
-
Create database in MySQL.
$ git clone https://github.com/rvasources/illCrawler
-
Import
.sql
file to MySQL database. -
Add domain(s) to database.
-
Open
illum.php
file and change database data.
- Crawling all pages of all domains in database (it saves title, metategs, text and url of page).
- Getting and checking all new domains which got from scanned pages.
- Updating content of outdated pages (1 time in 2 days).
- Removing died sites from database.
Software has 4 functions which you can use with special prefixes. For example: ./illum.php --crawler
. This software working as a daemon (you can add it to crontab).