Comments (4)
Those fields described by @JoGall are generated by the data cleaning functions of https://github.com/statsbomb/StatsBombR.
Got a decent understanding because I'm finishing to port them all to Python (check out https://github.com/ElSaico/pyStatsBomb in the next few days - I'll owe you all the API functionality because I lack the necessary
Shots
Shot5
, Shot6
, etc. seem to be earlier glitches from importing that already got fixed: statsbomb/StatsBombR@2e38647
All distance variables use the same unit as the positions, i.e. they're scaled to a 120x80 pitch.
DistToGoal
is exactly what it implies, but DistToKeeper
refers, counter-intuitively, to the distance between keeper and goal (!). The distance between shot and goal is in DistSGK
.
All angular variables are in degrees. AngleToGoal
and AngleToKeeper
are the opening angles formed by DistToGoal
and DistToKeeper
, respectively, while AngleDeviation
is the opening angle between both.
Freeze frames
density
and density.income
are both described in the README:
- Density is calculated as the aggregated inverse distance for each defender behind the ball.
- Density in the cone is the density filtered for only defenders who are in the cone between the shooter, and each goal post.
The other variables are:
DefendersInCone
- amount of defending players between the shooter and the goaldistance.ToD1
- distance between shooter and nearest defending playerdistance.ToD2
- distance between shooter and second-nearest defending playerInCone.GK
- whether the goalkeeper is in the path between the shooter and the goalAttackersBehindBall
andDefendersBehindBall
- self-explanatoryDefArea
- area of the smallest square that covers all opposite defenders (which means centre-backs and full-backs only)
All variables exclude the defending goalkeeper, except obviously for InCone.GK
Time
All extra time-related variables are in milliseconds and seem to have pretty descriptive names.
from open-data.
I'd also really like to know what variables like density
, density.incone
, AngleDeviation
, Shot5
, Shot6
, etc... mean, and whether variables like DistToGoal
and DistToKeeper
are given in metres or arbitrary pitch units.
from open-data.
Thanks for taking the time for such a detailed reply @ElSaico!
I thought DistToKeeper
was much lower than expected so wondered if it was given in an unexpected unit of measurement, that makes more sense! For anyone else reading, DistToKeeper
is the distance from the GK to the centre of the goal (not the nearest part of the goal line).
I didn't notice density
and density.incone
in the documentation when I first looked -- seems they'd be very useful for xG models. I haven't seen several of the other variables (e.g. DistSGK
, AttackersBehindBall
, DefArea
) as I don't think they're available in the free data but good to know.
Good luck with pyStatsBomb and making the data accessible to more people!
from open-data.
At some point we'll tidy up StatsBombR and document the inner workings of @YamStats brain, but for the most part it's provided as is to give people a bit of a leg up using the data. Happy to see issues raised in the other repo for any other improvements. In the meantime, the docs have been updated today so there shouldn't be anything in the raw data that's not covered now.
from open-data.
Related Issues (20)
- open_goal appears missing
- Missing Stadium Data in the 2003/04 PL Matches File
- Where is the current/most up-to-date PL data? Do I need full API access for that?
- Problem unzip
- Can't fully download zip file
- Location in meters
- Details of pass id 6a793934-d6d0-4e0e-a09c-7c0a69f67a0f appears incorrect
- Only data until mid-2021?
- zip file corrupted
- Stata
- Women's World Cup: erroneous competition stages
- Invalid locations for specific events: GoalKeeper, Pressure and ¿more?
- Missing counterpress column
- Champions League Matches data missing
- Versioning
- HTTP Error (503 Server Error: first byte timeout for url)
- Trouble fetching Open La Liga 360 data frames
- Check the data: Two managers are registered for the home team.
- Ligue 1 2015-2016 Matches file is empty HOT 1
- StatsBombFreeEvents() command bug HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from open-data.