League of Legends - Analysis and Prediction

Group Members

106306032 資管四楊存濬
106306069 資管四汪君儫
106306076 資管四謝采辰
108304056 統計二陳岷逸

Goals

What are the important factors in a League of Lengends match to win?
What champions should I choose for a higher winning rate?
How is the relation of KDA (Kill, Death, Assist) between different roles in a match?
Use the dataset to train model in order to predict the League of Legends match outcome.

Demo

Online Reactive Visual Graphs via Shiny
Link to Shiny Demo

Folder organization and its related information

Docs

In "docs" folder.

Data

Source:
- Take data integrity and consistency into account, we believe that professional players will have more stable performance in games. So we select active professional players in North America teams as targets of analysis, and crawl data of recent 100 matches of each player via “Riot developer API”.
Input Format:
- Excel file format - (921 obs. of 320 variables)
- Attribute Information:
  - KDA = (kill+assist) / (death)
  1. blue_ad_kda
  2. blue_sup_kda
  3. blue_mid_kda
  4. blue_jungle_kda
  5. blue_top_kda
  6. firstBaron
  7. firstInhibitor
  8. firstTower
Further Data Analysis:
- In Champion, we provide User to query the winning rate of the top 10 champion in each position of blue and purple.
- In KDA, we observed the player's KDA (kill+assist) / (death).We found that the KDA data has the characteristics of left-biased peaks, so the data +1 is taken as a new feature that can be considered.
- From the data, we can see that the correlation between support and ADC is high, while the correlation between top and other teammates KDA is low. However, there is no obvious correlation between each other.
Data Preprocessing:
- No missing data
- In the beginning, we use covariance matrix to do the PCA, but the Scree-plots suggest that using a covariance matrix is not the correct approach for calculating the principal components.
- We chose correlation matrix to try again

Code

Which method do you use?
- Decision Tree
- Naive Bayes
- Random Forest
- Support Vector Machine (SVM)
What is a null model for comparison?
- blue team win 0.5103149
- purple team win 0.4896851
- Assume blue team wins every match
How do your perform evaluation?
- Cross-validation
- Accuracy
- Area Under Curve (AUC)

Results

Which metrics did you use?
- AUC / Accuracy / Sensitivity / Specificity / Precision / Recall / F1
- Decision tree: 0.9 / 0.99 / 0.9 / 0.9 / 0.91 / 0.9 / 0.9
- Naive Bayes: 0.92 / 0.99 / 0.92 / 0.93 / 0.92 / 0.92 / 0.92
- Random forest: 0.98 / 0.94 / 0.93 / 0.95 / 0.95 / 0.93 / 0.94
- Svm: 0.97 / 0.99 / 0.88 / 0.94 / 0.94 / 0.88 / 0.91
Is your improvement significant?
- Yes, from 0.51 to 0.98
What is the challenging part of your project?
- Our data mainly uses the data from the professional players, so the condition and mindset might be different from casual players.
- Might require more features to make even more precise predicitonsin the future.
- The limitation of data being able to aquire due to League of Legends official website.

Reference

Packages you used:
- openxlsx
- corrplot
- caret
- rpart
- ROCR
- pROC
- e1071
- randomForest
- class
- ggplot2
- ggradar
- scales
- dplyr
- tibble
- shiny

tsai00150 / league-of-legend-analysis-and-prediction Goto Github PK

league-of-legend-analysis-and-prediction's Introduction

League of Legends - Analysis and Prediction

Group Members

Goals

Demo

Folder organization and its related information

Docs

Data

Code

Results

Reference

league-of-legend-analysis-and-prediction's People

Contributors

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent