Clustering Passes to analyze what teams do frequently.
Most of this is the implementation of https://www.americansocceranalysis.com/home/2019/3/11/using-k-means-to-learn-what-soccer-passing-tells-us-about-playing-styles with a few changes of my own.
I divided all the passes of the 2017-2018 Premier League Season in 50 clusters. This is how all of they look.
Based on the number of passes present in a particular cluster I looked at the ones that occur most frequently.
Long balls are the least frequent owing to less chances of being completed.
I used pass accuracy and the average threat created by the passes in a particular cluster to find those which had the highest payoff.
Here the linewidths suggest the average threat created by a pass in that cluster. Crosses, passes into the box or setting these up are the passes which have the highest payoff.
Trying to classify all progressive passes into 10 clusters.
By comparing the percentage of passes in a particular cluster for a team to all the other teams we can get a sense of the kinds of passes they use more than the rest and the kind of passes they use less than an average team. Given below are the most freuent clusters used by Burnley.