- 106306032 資管四 楊存濬
- 106306069 資管四 汪君儫
- 106306076 資管四 謝采辰
- 108304056 統計二 陳岷逸
- What are the important factors in a League of Lengends match to win?
- What champions should I choose for a higher winning rate?
- How is the relation of KDA (Kill, Death, Assist) between different roles in a match?
- Use the dataset to train model in order to predict the League of Legends match outcome.
- Online Reactive Visual Graphs via Shiny
- Link to Shiny Demo
- In "docs" folder.
-
Source:
- Take data integrity and consistency into account, we believe that professional players will have more stable performance in games. So we select active professional players in North America teams as targets of analysis, and crawl data of recent 100 matches of each player via “Riot developer API”.
-
Input Format:
-
Excel file format - (921 obs. of 320 variables)
-
Attribute Information:
- KDA = (kill+assist) / (death)
- blue_ad_kda
- blue_sup_kda
- blue_mid_kda
- blue_jungle_kda
- blue_top_kda
- firstBaron
- firstInhibitor
- firstTower
-
-
Further Data Analysis:
- In Champion, we provide User to query the winning rate of the top 10 champion in each position of blue and purple.
- In KDA, we observed the player's KDA (kill+assist) / (death).We found that the KDA data has the characteristics of left-biased peaks, so the data +1 is taken as a new feature that can be considered.
- From the data, we can see that the correlation between support and ADC is high, while the correlation between top and other teammates KDA is low. However, there is no obvious correlation between each other.
-
Data Preprocessing:
- No missing data
- In the beginning, we use covariance matrix to do the PCA, but the Scree-plots suggest that using a covariance matrix is not the correct approach for calculating the principal components.
- We chose correlation matrix to try again
-
Which method do you use?
- Decision Tree
- Naive Bayes
- Random Forest
- Support Vector Machine (SVM)
-
What is a null model for comparison?
- blue team win 0.5103149
- purple team win 0.4896851
- Assume blue team wins every match
-
How do your perform evaluation?
- Cross-validation
- Accuracy
- Area Under Curve (AUC)
-
Which metrics did you use?
- AUC / Accuracy / Sensitivity / Specificity / Precision / Recall / F1
- Decision tree: 0.9 / 0.99 / 0.9 / 0.9 / 0.91 / 0.9 / 0.9
- Naive Bayes: 0.92 / 0.99 / 0.92 / 0.93 / 0.92 / 0.92 / 0.92
- Random forest: 0.98 / 0.94 / 0.93 / 0.95 / 0.95 / 0.93 / 0.94
- Svm: 0.97 / 0.99 / 0.88 / 0.94 / 0.94 / 0.88 / 0.91
-
Is your improvement significant?
- Yes, from 0.51 to 0.98
-
What is the challenging part of your project?
- Our data mainly uses the data from the professional players, so the condition and mindset might be different from casual players.
- Might require more features to make even more precise predicitonsin the future.
- The limitation of data being able to aquire due to League of Legends official website.
- Packages you used:
- openxlsx
- corrplot
- caret
- rpart
- ROCR
- pROC
- e1071
- randomForest
- class
- ggplot2
- ggradar
- scales
- dplyr
- tibble
- shiny