Code Monkey home page Code Monkey logo

Comments (7)

zqwerty avatar zqwerty commented on June 8, 2024

In multi-domain dialogue, multiple actions are common. DQN cannot solve this directly. In ConvLab, most frequent dialogue act combinations are considered as actions, which could not cover every possible dialogue act combinations. However, we welcome your contributions to adding more RL methods. You can follow the structure of the existing RL policy in ConvLab-2. Look forward to your success!

from convlab-2.

sherlock1987 avatar sherlock1987 commented on June 8, 2024

Hey, since my focus point is in RL-Reward function part, so for me there are not so much RL policies I could use, there are only PPO, and PG, is there any other RL methods I could implement in this platform?

I have made some statics, you are right, there are more than 8000 actions in this Multi Domain dataset.

from convlab-2.

zqwerty avatar zqwerty commented on June 8, 2024

You could try DQN, A3C, A2C, or any other that you are interested in.

from convlab-2.

sherlock1987 avatar sherlock1987 commented on June 8, 2024

Thanks, I am not sure if DQN will work. Since in Convlab1, there are only 300 combination of actions in total, but in convlab2 it will be more than 8000. When using same dataset, did you know why the combination of Convlab2 is so much?

from convlab-2.

sherlock1987 avatar sherlock1987 commented on June 8, 2024

Hey, about the action space, do you gays have some clue? Like should I add some model of action encoder to Convlab2? So I can narrow the action space from 8000 to 300, by only considering the most frequent actions.

from convlab-2.

zqwerty avatar zqwerty commented on June 8, 2024

In ConvLab, we select the most frequent action combinations from all combinations in the dataset (https://github.com/ConvLab/ConvLab/blob/master/data/multiwoz/da_slot_cnt.json). You can try this approximation in ConvLab-2, too.

from convlab-2.

truthless11 avatar truthless11 commented on June 8, 2024

Please refer to #96

from convlab-2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.