Code Monkey home page Code Monkey logo

Comments (3)

wadefk avatar wadefk commented on August 25, 2024 1

hello,koukou

from kobe.

qibinc avatar qibinc commented on August 25, 2024 1

Hi @jiqiujia ,

Thanks for your interest in this work!

I'm glad that someone finally noticed that part of code! Indeed we've tried to add RL objective and optimize with policy gradient. But unfortunately, they didn't work well enough, so we gave it up before the submission of the paper.

Specifically, we've tried two types of reward, they are:

BLEU reward

As many people did (e.g., https://arxiv.org/pdf/1705.04304), BLEU is straight-forward for combining RL and NLP. We expected an improved BLEU after adding this objective. However, it doesn't work and slows down the training. Also no significant improvement is shown in the generated descriptions, so we gave up BLEU reward.

Attractiveness reward

Our ultimate objective in this paper is to generate good product descriptions, which means the more users are interested and attracted, the better. However, the current objective is merely maximizing the likelihood of generated product descriptions in the dataset, which ignores the users' reaction (e.g., click through rate, page stay time).

Considering this issue, we designed an "attractiveness reward function", which is a trained classifier. This classifier is trained to predict user CTR (Click Through Rate) of a product description. With this reward, we expect the generated descriptions to be more attractive to users (in Chinese, more like 标题党). However, the model tends to overfit the reward function, (i.e., the reward function gives a very high reward but the generated description is not actually attractive judged by human).

To address this problem, we considered adversarial training of the seq2seq model and the classifier (reward giver) to be necessary. We didn't have enough time back then and left that part for future work.

If you are interested in this direction and want further discussions, contact me on my Messenger or add my WeChat: chenqibin99

Best,
Qibin Chen

from kobe.

jiqiujia avatar jiqiujia commented on August 25, 2024

I get it. Thank you for your reply.

from kobe.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.