pclucas14 / gansfallingshort Goto Github PK

View Code? Open in Web Editor NEW

58.0 58.0 11.0 1.65 GB

Code for "Language GANs Falling Short"

Python 21.82% Jupyter Notebook 78.18%

gansfallingshort's People

Contributors

Stargazers

Watchers

Forkers

codeaudit jt17383 ml-lab paper-implementation sachin19 guoyinwang haofuml tianxieeryang orionw xuanqing94 gmftbygmftby

gansfallingshort's Issues

Some questions about the model settings of the generator

The generators of different models are not the same in layers and dimensions, why not keep the generators sharing the same model size as the oracle model just like the conventional practice?

Which arg should I follow to train SEQGAN?

Hi,

Sorry one more question,
Which arg should I follow to train SEQGAN(not leakGAN)?
Is it the one in real_data_experiments/trained_models/news/word/best_gan/args.json?
Note that "seqgan_reward": 0, so I guess that's for leakGAN?

Thanks!

best_mle

There is no best_mle in trained_models/news/word/. Would you please provide the best_mle , thanks very much.

problems of BLEU and SELF-BLEU on test data of COCO in Table 5 in the paper

Hi,
I found a problem with the training data in Table 5 in your paper. COCO's selfbleu5 is bigger than selfbleu4. Here is the BLEU and SELF-BLEU I tested on test data of COCO.
bleu2=0.744 bleu3=0.526 bleu4=0.344 bleu5=0.222
self-bleu2=0.895 self-bleu3=0.751 self-bleu4=0.581 self-bleu5=0.422

Reproduce best_mle_temp_rlm_score

I ran score_models.py to evaluate the lm and rlm, loading the trained_models/news/word/best_mle/models/gen80th model. But the rlm results I got were worse than those in your best_mle_temp_rlm_score.csv.
Are the parameter settings during rlm training incorrect? I used the parameter settings of get_rlm_args in your code and args.mle_epochs=80.
I have measured the rlm when alpha=1.0 and the result is rlm=4.02, which is much worse than 3.991.
Thanks.

Issue running CoT

Hi, I like the paper very much, and recommended it to people in my group, thanks for the good work.
Question about the code, it seems that you implemented CoT, I try to run it using the real_data_experiments/trained_models/news/word/best_CoT_nlltest/args.json, however there's an error:

File "main.py", line 203, in main
avg_accs += [(fake_acc+real_acc)/2]
UnboundLocalError: local variable 'fake_acc' referenced before assignment

I guess fake_acc should not be calculated when training for CoT. Is the code ready for running CoT or does it need revision?
Thanks!

Calculating Self-BLEU scores

According the Self-BLEU original paper Zhu et al. 2018, each generation is compared against all the other references.

The current Self-BLEU implementation includes the selected hypothesis in the list of references. This risks inflation in the self-bleu scores as there will be always a direct match between the hypothesis and one of the references.

GansFallingShort/real_data_experiments/metrics.py

Line 151 in ecaa60a

def get_bleu(self):

    def get_bleu(self):
        ngram = self.gram
        bleu = list()
        reference = self.get_reference()
        weight = tuple((1. / ngram for _ in range(ngram)))
        with open(self.test_data) as test_data:
            for hypothesis in test_data:
                hypothesis = nltk.word_tokenize(hypothesis)
                bleu.append(nltk.translate.bleu_score.sentence_bleu(reference, hypothesis, weight,
                                                                    smoothing_function=SmoothingFunction().method1))
        return sum(bleu) / len(bleu)

I understand that this is the implementation of the Texygen as is but I was wondering if
we should remove the target hypothesis from the set of references or am I missing something here?

Thanks for the help in advance

GAN training

While using best_mle parameters, I found this code also use GAN for training.

pclucas14 / gansfallingshort Goto Github PK

gansfallingshort's People

Contributors

Stargazers

Watchers

Forkers

gansfallingshort's Issues

Some questions about the model settings of the generator

Which arg should I follow to train SEQGAN?

best_mle

problems of BLEU and SELF-BLEU on test data of COCO in Table 5 in the paper

Reproduce best_mle_temp_rlm_score

Issue running CoT

Calculating Self-BLEU scores

GAN training

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent