Code Monkey home page Code Monkey logo

recosa's People

Contributors

zhanghainan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

recosa's Issues

Want Code

您好!看了你的论文 ReCoSa: Detecting the Relevant Contexts with Self-Attention for
Multi-turn Dialogue Generation,就直接过来了想看看您的源码,不知道什么时候可以release?

和transformer的差别

首先感谢作者公开的源码
源代码中是使用tensorflow写的,但是我对tensorflow并不熟悉
请问作者 我是否可以 使用pytorch的transformer 在最前面加一层lstm encoder层
万分感谢

Why you convert the whole dataset to tensor?

We conduct the Ubuntu experiment follow your pipeline, but we get "ValueError: Cannot create a tensor proto whose content is larger than 2GB". We found that you convert the whole training data to tensor in load_data.py:

def get_batch_data():

# Load data
X,X_length, Y, sources,targets = load_train_data()
# calc total batch count
num_batch = len(X) // hp.batch_size
# Convert to tensor
X = tf.convert_to_tensor(X, tf.int32)
Y = tf.convert_to_tensor(Y, tf.int32)

We are sure that the Ubuntu dataset is mush larger than 2GB, so we are confused how did you do the Ubuntu experiment?

how to obtain the jd dataset

您好,请问论文中用到的京东对话数据集有什么方式获取呢,目前京东官方已经不提供下载了

上下文输入

你好!十分感谢你的开源代码。只是在阅读时,有一处细节不太了解——在模型输入时,你将多轮的上下文拆解成多个样本。即,

The dialogue data:Hello How are you? Good, you? I'm fine, what's new?
Souce looks like:

Hello How are you?

Hello How are you? Good, you?

Hello How are you? Good, you? I'm fine, what's new?

Target:

Good, you?

I'm fine, what's new?

Nothing much...

请问,这是多轮对话的通用处理方式吗?还是直接将多轮的上下文作为输入,也可以?

可能这个问题略显幼稚,但还是期待你的回复。

Run the code on DailyDialog but have terrible result

Hi, thanks for your open source codes of this work.
I try to apply your code on a new dataset DialogDialog, but I found that the outputs of the model are all the token '.' which means nothing.

So, I'm very curious that if this code is not appropriate to other datasets?
Can you help me troubleshoot the issue?

Question about the Transformer Decoder

Hi. When I employ the Transformer Encoder-Decoder framework on dialogue generation task, there are two serious issues on the generated results. First, the generated results are always begin with word 'i'; second, the repetition badly weakens the performance.

Target: hmm i guess on the 28th
Predict: i i am i the bus of and and and and and and are and and and and and and and and and and and

The above is a typical bad example. Did you face such issue? Could you provide some suggestions to handle this problem? Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.