Causal Analysis of Corner Kicks

This work was selected as one of the Research Presentations at the OPTA FORUM 2024, held in London, on the 20th of March.

The Presentation Video Slide Deck used Brief Overview Learn about the Features used Check out the Results Conclusion

Brief Overview

This project proposes to use a Causal Framework to first estimate the effect of various granular level features on shot attempts, during corner kick situations. Then it shows how we can intervene on a combination of these features to distill the causal information from the data and compare how different Attacking setups perform against different Defensive Setups and also on different Corner Types.

Causal graph

Proposed Casual Model

How is this work beneficial?

A lot of things that happen during corner kicks aren't independent of each other, they are causally entangled. For eg., the attacking team knows the type of delivery they expect the corner taker to deliver, they also know the type of defensive setups they are coming up against. Utilizing a causal approach takes into account relationships like these during the modelling process. This work shows how we can use a Causal approach to account for various features and boil it down to a numerical percentage of how good a particular attack is against a particular type of defense or during a particular type of delivery. Thus better guiding the analysts/coaches on what the optimal matchups are, against a particular opposition.

We take inspiration from the work of Laurie Shaw and utilize a Causal framework to solve one of their proposed future works, which was to quantize how different Attacking setups performed against different Defensive Setups.

The current iteration of the work uses only categorical features due to limited time availability and labelled data. The main value of the proposed framework is that it can be extended to continuous features as well. xG, Goalkeeper positioning, ball speed are some continuous features that can be integrated. One of the main advantages is that the data from the training grounds can also be used as part of this analysis. As these training sessions provide the perfect environment for performing Controlled Randomized tests.

Features

How did we come up with the Proposed Causal Graph?

All the features/variables are assumed to be impacted by other features from which they have an incoming edge. All the 5 variables, Corner Type, Attacking Setup, Defense Type, Closely Marked & Dynamic Movement are assumped to be impacting Shot Attempts during a corner situation.

The way the team plans to attack the corners is assumed to be impacted by the type of delivery they expect and the defensive setup they are up against. Whether the attacking team needs to make movements to get to the ball location is dependent on the way they setup and the type of delivery they expect to be put in. This, then further impacts whether the defenders are able to closely guard the attackers or not, which is also impacted by how the defensive team sets up.

The following 6 features were extracted from the combination of Opta's tracking and events data. The original dataset comprised of 100 matches from the 2021-22 Ligue 1 season. The original and the extracted data are both proprietary, thus cannot be shared.

Already recorded in the data:

Extracted using Clustering some features:

Extracted using Thresholding the data distribution:

Results

Based on the dataset used, Outswinging corners were seen producing more shots on goal than In-swinging corners. In order to see some exploratory analysis of the different features in the dataset, I would recommend checking out the slide deck or the presentation video.

Attack vs Corner Type

Matchups - Attack vs Corner Type

Near post runs, as well as players being positioned close to the near post were seen the best shot producing options on Out-swinging corners. Near post runs were found to be the best attacking choice for in-swining corners as well.

Attack vs Defense Type

Matchups - Attack vs Defense Type

On comparing how different types of attacking setups did against the different defensive setups, we saw that against Zonal defenses, when the attackers made two way movements, they were successful in producing more shots. Near post runs were once again seen to be fruitful, this time when facing man-marking defenses.

Conclusion

The idea behind this project was to find a way to be able to quantify how different attacking setups fare against different defensive setups. Thus being able to give a data point to the coaches, of specific matchups to focus upon, against different oppositions. This framework enables us to do that while also conditioning on various intertwined features that impact shots during corner kicks. As mentioned above, the framework provides the flexibility to add on more granular level features, continuous or categorical, and condition on them. The addition of such features will also give us the ability to further compute counter factuals for each situation as well. Automated feature extraction and the use of a causal approach, that provides explainable & actionable insights, are the main advantages of this work.



Get In Touch

If you want to know more about me or my work, please don't hesitate in reaching out.