bupticybee / alphanlholdem Goto Github PK

View Code? Open in Web Editor NEW

42.0 5.0 12.0 2.24 MB

An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.

License: GNU Affero General Public License v3.0

Python 16.57% JavaScript 2.75% HTML 0.50% Jupyter Notebook 80.18%

alpha-go alphastar artificial-intelligence reinforcement-learning texas-holdem texas-holdem-poker

alphanlholdem's People

Contributors

Stargazers

Watchers

Forkers

williamyuanv0 tzuren solips1sm wildcamel web-logs2 cat-zzz sjtuwy lizhongguo dashbaord202401 zhigal memsyi

alphanlholdem's Issues

the file agi.mis is missing

Hi! icybee,
The file agi.mis in train_league.py might be missing.

Issues in RLCard environment

Thanks for the great work!

Would you mind sharing more details about the issues in the RLCard environment? We want to fix them.

Trinal-Clip PPO in Alphaholdem is not used？

Thanks for your wonderful project. But I wonder why Trinal-Clip PPO in Alphaholdem is not used.

Issues with available releases of the Ray Library?

Installation Requirements Outdated?

ERROR: Could not find a version that satisfies the requirement tensorflow==1.15.2 (from versions: 2.8.0rc0, 2.8.0rc1, 2.8.0, 2.8.1, 2.8.2, 2.8.3, 2.8.4, 2.9.0rc0, 2.9.0rc1, 2.9.0rc2, 2.9.0, 2.9.1, 2.9.2, 2.9.3, 2.10.0rc0, 2.10.0rc1, 2.10.0rc2, 2.10.0rc3, 2.10.0, 2.10.1, 2.11.0rc0, 2.11.0rc1, 2.11.0rc2, 2.11.0, 2.11.1, 2.12.0rc0, 2.12.0rc1, 2.12.0, 2.12.1, 2.13.0rc0, 2.13.0rc1, 2.13.0rc2, 2.13.0, 2.13.1, 2.14.0rc0, 2.14.0rc1, 2.14.0)
ERROR: No matching distribution found for tensorflow==1.15.2

Written with an extremely outdated library

This was apparently written in 2023 but is using a version of ray released in early 2020. Not sure why that is but it makes it impossible to use because the ray 0.8.3 branch does not contain tune. The only version of ray[tune] that is available is for the current releases and not the legacy releases.

Issues with available releases of the Ray Library?

Is there any way to make this work on any of the Ray releases that are available for install using pip? The furthest version that I can go back to is 0.8.6, but there seems to be a lot of unsupported code when I tried that version.

Is 0.8.3 a must? If so, are there any sources for it still available that I can compile for my machine?

Thanks in advance!

How do you judge/track the convergency of the holdem model?

Even after ~ 1 billion self-play, over 1000 checkpoints, the model seems still not converge

Thanks for your wonderful project. May I ask how do you judge the convergency of a self-play game model? Are there any evaluation metrics recommended?

I don't think evaluating with rewards (or utilities) is a good choice, because it is continuously self-improving on both sides at the training stage.

bupticybee / alphanlholdem Goto Github PK

alphanlholdem's People

Contributors

Stargazers

Watchers

Forkers

alphanlholdem's Issues

the file agi.mis is missing

Issues in RLCard environment

Trinal-Clip PPO in Alphaholdem is not used？

Issues with available releases of the Ray Library?

Installation Requirements Outdated?

Written with an extremely outdated library

Issues with available releases of the Ray Library?

How do you judge/track the convergency of the holdem model?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent