Extract datasets for a football team of your choice, with various types of useful information, into Excel files.
This script, in its current format takes your football team and year as input, and extracts all matches played by the team with relevant information, such as score, formations, lineups, and stats. The project has only been updated twice since its creation, but in the near future, I will be extracting more detailed events per match. Long-term, I hope to be able to extract more detailed statistics per match, as well as develop a user-friendly GUI.- Anaconda distribution of Python 3.7
- Selenium library on Python and Chrome extension, to scrape websites (make sure you install the appropriate versions
- xlsxwriter library on Python, to write information on Excel files
- Pandas library on Python, for stats tables and future computations
- fbref website, as a source for all stats
The current implementation extracts, per match, the following information:
- Date
- Competition
- Scores, including xG or Penalties
- Team Managers & Captains
- Stats such as Passing, Shots, Possession
- Formation & Lineups
- (new) Stats per player per team, on different sheets
In the near future, I hope to implement the following:
- Extract various match events
- Adding more features for dataset generation, e.g. get all formations in one file, or generate stats per player in a team (partially done!)
- Creating a GUI for the project that takes few variables as input rather than have people edit code