Realistic scraper that extracts your Facebook friends' birthdays in a calendar format. Hopefully, it's going to work for a long time, until Facebook decides to do an UI change.
- Create a new venv with
python -m venv env - Install the requirements with
pip install -r requirements.txt - Install the project as editable with
pip install -e .(yes, with period) cd src/scrapy crawl facebook- Enjoy :)
Optional steps (for running under Docker/Airflow):
docker-compose up airflow-init- initializes the DB and creates a default user,airflow, with the passwordairflow.
Airflow is a blanao task scheduler which can create DAGs (directed acyclic graphs) of tasks.
Please run
docker-compose up airflow-initfor the first time, for upgrading the DB models and creating an user account that grants access to Airflow's interface.
docker-compose up airflow-webserver airflow-worker airflow-scheduler flower- fires up the essential services to make Airflow run.
Almost copy-pasted @stephen-fox's guide.