A webcrawler implemented using selenium and Google Chrome
- Debian 8 (Jessie)
- Python 2.7
- Google Chrome
- selenium==3.11.0
- pyvirtualdisplay==0.2.1
- Install Google Chrome: https://www.linuxbabe.com/desktop-linux/how-to-install-google-chrome-on-debian-the-easy-way
- Install selenium: https://selenium-python.readthedocs.io/installation.html
$ python crawler.py target-folder/
The page ranges in "urllist.txt" were retrieved manually. In the future it should be automated.