Comments (11)
24/10: Read through Game Theory-Based Opponent Modeling in Large Imperfect Information Games (first pass excl. full experimental results). Created Github project and issues to track progress and provide stepping stones in the right direction.
from cs344-opponent-exploitation-poker.
25/10 Read part of Morrhill and von Stengel and wrote their entries into the GitHub issue
from cs344-opponent-exploitation-poker.
26/10 Read part of intro to CFR again, researched what Poker game to play primarily looking at work by Jeary. Decided on 2 player Limit texas hold 'em for all initial work. Defined future work and created the milestone for the progress report.
from cs344-opponent-exploitation-poker.
28/10 Supervisor meeting, added more issues to Github
Tabula: "Discussed which game to focus on, abstraction methods, opponent exploitation and potentially using EFR vs CFR and what benefits that might bring."
from cs344-opponent-exploitation-poker.
31/10 Read part of Evaluating State-Space Abstractions in Extensive-Form Games . Reading Hindsight and Sequential Rationality of Correlated Play to help understand EFR.
1/11 Continued reading Hindsight and Sequential Rationality of Correlated Play and started reading the follow on paper.
from cs344-opponent-exploitation-poker.
2/11 Supervisor meeting : "Discussed EFR, I will write on the motivation and intuition for the algorithm. Project will likely look at how opponent exploitation can be applied along such an algorithm."
Started with EFR writeup
from cs344-opponent-exploitation-poker.
3/11 Continued with EFR writeup. Found that deviations permitted by EFR results in a payoff increase when playing against a more static opponent (data from Morhill's EFR paper). Would be interesting to see performance against exploitable opponents.
from cs344-opponent-exploitation-poker.
6/10 Continued with EFR writeup in Overleaf.
from cs344-opponent-exploitation-poker.
07/11 Understanding line by line the EFR algorithm. Continued with EFR writeup, added EFR algorithm, wrote notes on EFR. Developed more intuition regarding correlated equilibria. Need to further develop understanding of CFR by using https://www.ma.imperial.ac.uk/~dturaev/neller-lanctot.pdf.
from cs344-opponent-exploitation-poker.
08/11 Reading CFR intro thoroughly to help to writeup
from cs344-opponent-exploitation-poker.
09/11 Started progress report added literature review section and project management section.
from cs344-opponent-exploitation-poker.
Related Issues (20)
- Create a beautiful AI presentation HOT 1
- Create list of presentation requirements
- Draft a skeleton structure for the presentation slides
- Implement Jam/Fold opponent in OpenSpiel
- Create exploitability results for presentation
- Add remaining deviation types HOT 1
- TIPS deviation type
- Behavioural deviation type
- Add background information slides
- Add EFR slides
- Add methodology slides
- Add project management slides
- Add evaluation slides
- Introduction slides
- Future work using online learning
- MCCFR Opponent HOT 1
- Epsilon Exploitable Opponent HOT 1
- Relevant work
- Review literature around general reinforcement learning
- Write full final report structure
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cs344-opponent-exploitation-poker.