rule-based AI for Dou Dizhu maybe help the agent train with reasonable actions ? impro

<a href="https://paperswithcode.com/paper/combinational-q-learning-for-dou-di-zhu" rel

rule-based AI for Dou Dizhu can guidance the agent train with astrict actions? about rlcard HOT 5 CLOSED

datamllab commented on May 19, 2024

rule-based AI for Dou Dizhu can guidance the agent train with astrict actions?

from rlcard.

Comments (5)

daochenzha commented on May 19, 2024

@sunnyForIOS I think so. One way could be to generate self-play data with rules. We can use supervised learning to train an initial model. Then keep training the model with NFSP or DQN.

We are developing some rules, which should be available around the end of this year.

from rlcard.

AIMan-Zzx commented on May 19, 2024

so should we change the kicker net such as three with one, because the RHCP AI do not decode the kicker action with the same method which we use now . by the way ,I have a idea, when any player has only two or one card, we can use guess his handcards then use mcst to train replace the previous net,

from rlcard.

daochenzha commented on May 19, 2024

@sunnyForIOS I agree that using mcts when any player has only two or one hands would be a good idea. The search space is reasonable for mcts, which may deliver better performance than the neural net.

I think kicker net could also be trained based on the rules? But I am not sure whether it is applicable or not.

from rlcard.

AIMan-Zzx commented on May 19, 2024

Combinational Q-Learning for Dou Di Zhu in this paper the rule-based AI RHCP player is nomel level except no cooperate
maybe you had read it

from rlcard.

daochenzha commented on May 19, 2024

Yeah, I have read this paper. RHCP seems to be a good rule baseline to guide the agent.

from rlcard.

Recommend Projects

rule-based AI for Dou Dizhu can guidance the agent train with astrict actions? about rlcard HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent