Crawler using Selenium, generating a graph to display results.
This crawler can be use as a spider.
Install python3 and pip:
sudo apt install python3 python3-pipInstall chromium and chromium-driver:
sudo apt install chromium chromium-driverThe installation has been tested in Debian bullseye/sid x86_64 (february 2019)
git clone https://github.com/x1n5h3n/Advance_Crawler.gitcd Advance_Crawlerpip3 install -r requirements.txtCrawl an url using default options:
python3 crawler.py -u https://github.comCrawl an url using a blacklist of url (ex: https://google.com):
python3 crawler.py -u https://github.com -b blacklist.txtCrawl an url with a define depth (default 1):
python3 crawler.py -u https://github.com -d 10Crawl an url and display result with a mode (default domain):
- domain mode will return unique domains found
 - link mode will return unique links found
 
python3 crawler.py -u https://github.com -m link
python3 crawler.py -u https://github.com -m domainCrawl an url using a mobile emulation (default false):
python3 crawler.py -u https://github.com -e trueCrawl an url without crawling external domains (default true):
python3 crawler.py -u https://github.com -a falseCrawl like a spider:
python3 crawler.py -u https://github.com -d 1337 -m link -a falsePrint help:
python3 crawler.py -hThis project is licensed under the GPLv3 License - see the LICENSE file for details.