|
3 | 3 | This Project was making a public dataset for professional dota leagues and matches.
|
4 | 4 | I included events from this [list](https://liquipedia.net/dota2/Tier_1_Tournaments)
|
5 | 5 |
|
6 |
| -Database Specification: |
7 |
| - |
8 |
| -**Leagues** |
9 |
| - |
10 |
| -| Field | type | Description | |
11 |
| -| ---------- | ------- | ----------------------------------------- | |
12 |
| -| leagueName | String | The name of the league | |
13 |
| -| leagueID | String | The league's dotaTV id | |
14 |
| -| prizePool | Int | The league's prizepool in USD | |
15 |
| -| startDate | Date | The league's first main event day | |
16 |
| -| endDate | Date | The league's last main event day | |
17 |
| -| teams | Array | This contains json team objects | |
18 |
| -| url | String | This string is the url source for this data | |
19 |
| - |
20 |
| - |
21 |
| -To use the steam api you must generate a key and set the environmental variable 'STEAM_API_KEY' to its value. |
22 |
| - |
23 |
| -LOCALMONGOSTR |
24 |
| -Notes: |
25 |
| - |
26 |
| -Some teams have a name mismatch between the actual name and the name on the results notably: VG.r (Vici Gaming Reborn), 4 Anchors + Sea Captain, RoX, Relax, they have not been processed. |
27 |
| - |
28 |
| -Summit 5, 6 and 7 have a duplicate eventID so I have not included those teams in the team collection, the results are still intact however |
29 |
| - |
30 |
| -Events without event IDs (There will be no match data for these events): |
31 |
| - |
32 |
| -The International 2011 |
33 |
| - |
34 |
| -ASUS Open 2012 Finals |
35 |
| - |
36 |
| -Thor Open 2012 |
37 |
| - |
38 |
| -World Cyber Games 2012 |
39 |
| - |
40 |
| -Electronic Sports World Cup 2012 |
41 |
| - |
42 |
| -The Premier League: Season 2 |
43 |
| - |
44 |
| -DreamHack Summer 2012 |
45 |
| - |
46 |
| -StarLadder StarSeries Season 1 |
47 |
| - |
48 |
| -The Premier League: Season 1 |
49 |
| - |
50 |
| -The Premier League: Season 1 |
51 |
| - |
52 |
| -Dota2 Star Championship |
| 6 | +## Database Specification |
| 7 | + |
| 8 | +### Leagues |
| 9 | + |
| 10 | +| Field | Type | Description | |
| 11 | +| ---------- | ------- | ------------------------ | |
| 12 | +| leagueName | String | The name of the league. | |
| 13 | +| leagueID | String | The league's dotaTV id. | |
| 14 | +| prizePool | Int | The league's prizepool in USD. | |
| 15 | +| startDate | Date | The league's first main event day. | |
| 16 | +| endDate | Date | The league's last main event day. | |
| 17 | +| teams | Array | This contains json team objects. | |
| 18 | +| url | String | This string is the url source for this data. | |
| 19 | + |
| 20 | +### Teams |
| 21 | + |
| 22 | +| Field | Type | Description | |
| 23 | +| ---------- | ------- | ------------------------ | |
| 24 | +| leagueName | String | The name of the league. | |
| 25 | +| result | String | The place the team got. Ties are rounded up.| |
| 26 | +| org | String | The team's organization | |
| 27 | +| players | Array | An array of {role: \<the role that player had\>), player: \<playerObject\>} | |
| 28 | + |
| 29 | +### Players |
| 30 | + |
| 31 | +| Field | Type | Description | |
| 32 | +| ---------- | ------- | ------------------------ | |
| 33 | +| tag | String | The players tag. | |
| 34 | +| playerName | String | The player's name (romanized in some cases). | |
| 35 | +| birthday | Date | The player's birthday. | |
| 36 | +| playerID | Int | The player's ID | |
| 37 | +| country | Array | An array of strings with player's nation's names. | |
| 38 | +| roles | String | The roles this player has played. | |
| 39 | +| url | String | The url source of this data. | |
| 40 | + |
| 41 | +### Matches |
| 42 | + |
| 43 | +| Field | Type | Description | |
| 44 | +| ---------- | ------- | ------------------------ | |
| 45 | +| players | Array | Array of players that contains their stats for this match. | |
| 46 | +| radiantWin | Boolean | True if radiant won. | |
| 47 | +| duration | Int | Duration of the match in seconds. | |
| 48 | +| matchID | Int | The ID of the match. | |
| 49 | +| leagueID | Int | The ID of the league. | |
| 50 | + |
| 51 | +## Dataset Notes |
| 52 | + |
| 53 | +- Some teams have a name mismatch between the actual name and the name on the results notably: VG.r (Vici Gaming Reborn), 4 Anchors + Sea Captain, RoX, Relax, they have not been processed. |
| 54 | +- Summit 5, 6 and 7 have a duplicate eventID |
| 55 | +- Players with no page on Liquipedia have not been processed. |
| 56 | +- Some events do not an eventID they are: |
| 57 | + - The International 2011 |
| 58 | + - ASUS Open 2012 Finals |
| 59 | + - World Cyber Games 2012 |
| 60 | + - Electronic Sports World Cup 2012 |
| 61 | + - The Premier League: Season 2 |
| 62 | + - DreamHack Summer 2012 |
| 63 | + - StarLadder StarSeries Season 1 |
| 64 | + - The Premier League: Season 1 |
| 65 | + - The Premier League: Season 1 |
| 66 | + - Dota2 Star Championship |
| 67 | + |
| 68 | +## Additional Resources |
| 69 | + |
| 70 | +In addition to the Dataset I also made some python classes for data collection and statistical operations. They are located in the source folder: |
| 71 | + |
| 72 | + /Dota2EsportDataset/Database_Objects/ |
| 73 | +I have also made the data collection tools available, feel free to use them to update the dataset or scan a different list of tournaments. |
| 74 | + |
| 75 | +In order to use them you will need to set a few environmental variables: |
| 76 | + |
| 77 | +- To use the steam api you must generate a key and set the environmental variable 'STEAM_API_KEY' to its value. |
| 78 | +- I was using mongoDB locally so I set the environmental variable LOCALMONGOSTR to my mongoString. |
| 79 | + |
| 80 | +For more details on the steam API see [here](https://wiki.teamfortress.com/wiki/WebAPI) |
| 81 | + |
| 82 | +Thanks to the volunteers / editors of [Liquipedia](https://liquipedia.net/dota2/Main_Page) without whom this project would not have been possible. |
| 83 | + |
| 84 | +Shoutout to [DatDota](http://www.datdota.com/) and Noxville for helping me out. |
0 commit comments