Skip to content

SSITB/EPL_Extract

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 

Repository files navigation

EPL_Extract

Extract datasets for a football team of your choice, with various types of useful information, into Excel files.

Description

This script, in its current format takes your football team and year as input, and extracts all matches played by the team with relevant information, such as score, formations, lineups, and stats. The project has only been updated twice since its creation, but in the near future, I will be extracting more detailed events per match. Long-term, I hope to be able to extract more detailed statistics per match, as well as develop a user-friendly GUI.

Installation/Dependencies

  • Anaconda distribution of Python 3.7
  • Selenium library on Python and Chrome extension, to scrape websites (make sure you install the appropriate versions
  • xlsxwriter library on Python, to write information on Excel files
  • Pandas library on Python, for stats tables and future computations
  • fbref website, as a source for all stats

Project Motivation

I wished to analyze the effect of frequent lineup changes made in Chelsea FC's early 2020-2021 season on their performance, but couldn't find online datasets that provided lineups for all matches consolidated in one source. So, I sought out to create those datasets myself by web scraping and automation, and I kept expanding the scope from thereon to include this information for other teams and across different years.

Current Implementation/Future Steps

The current implementation extracts, per match, the following information:

  • Date
  • Competition
  • Scores, including xG or Penalties
  • Team Managers & Captains
  • Stats such as Passing, Shots, Possession
  • Formation & Lineups
  • (new) Stats per player per team, on different sheets

In the near future, I hope to implement the following:

  • Extract various match events
  • Adding more features for dataset generation, e.g. get all formations in one file, or generate stats per player in a team (partially done!)
  • Creating a GUI for the project that takes few variables as input rather than have people edit code

image

image

image

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%