sanqiang / text_simplification Goto Github PK

Text Simplification Model based on Encoder-Decoder (includes Transformer and Seq2Seq) model.

Python 56.02% Shell 1.93% Jupyter Notebook 17.00% JavaScript 0.55% Roff 2.89% Perl 1.14% Java 1.80% HTML 3.66% CSS 0.02% XSLT 0.08% Makefile 3.26% M4 0.15% PostScript 6.07% C 2.37% C++ 3.05%

deep-learning seq2seq-model transformer nlp

text_simplification's Introduction

Integrating Transformer and Paraphrase Rules for Sentence Simplification

Paper Link: http://www.aclweb.org/anthology/D18-1355

Note that some improvement from original EMNLP paper:

we modified the code to allow supporting subword and the model performs well.
we found replacing name entities might not be a good idea (i.e. replace John to person0) since it lose some information. Instead, subword is helpful for reducing the huge vocabulary coming from name entities.
we found the context(memory) addressing is probably redundant. Without it, the model can achieve same(even better) performance.

Data Download:

https://drive.google.com/open?id=132Jlza-16Ws1DJ7h4O89TyxJiFSFAPw7

Pretrained Model Download:

https://drive.google.com/open?id=16gO8cLXttGR64_xvLHgMwgJeB1DzT93N

Command to run the model:

python model/train.py -ngpus 1 -bsize 64 -fw transformer -out bertal_wkori_direct -op adagrad -lr 0.01 --mode transbert_ori -nh 8 -nhl 6 -nel 6 -ndl 6 -lc True -eval_freq 0 --fetch_mode tf_example_dataset --subword_vocab_size 0 --dmode wk --tie_embedding all --bert_mode bert_token:bertbase:init --environment aws --memory direct python model/eval.py -ngpus 1 -bsize 256 -fw transformer -out bertal_wkori_direct -op adagrad -lr 0.01 --mode transbert_ori -nh 8 -nhl 6 -nel 6 -ndl 6 -lc True -eval_freq 0 --subword_vocab_size 0 --dmode wk --tie_embedding all --bert_mode bert_token:bertbase:init --environment aws

Arugument instruction

bsize: batch size
out: the output folder will contains log, best model and result report
tie_embedding: all means tie the encoder/decoder/projection w embedding, we found it can speed up the training
bert_mode: the mode of using BERT bert_token indicates we use the subtoken vocabulary from BERT; bertbase indicates we use BERT base version (due to the memory issue, we did not try BERT large version yet)
environment: the path config of the experiment. Please change it in model/model_config.py to fit to your system

More config you can check them in util/arguments.py

Citation

Zhao, Sanqiang, et al. "Integrating Transformer and Paraphrase Rules for Sentence Simplification." Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 2018.

@article{zhao2018integrating,
  title={Integrating Transformer and Paraphrase Rules for Sentence Simplification},
  author={Zhao, Sanqiang and Meng, Rui and He, Daqing and Andi, Saptono and Bambang, Parmanto},
  journal={arXiv preprint arXiv:1810.11193},
  year={2018}
}

text_simplification's People

Contributors

Stargazers

Watchers

Forkers

afcarl cniklaus xiaoqiangkx naushadzaman ashwathnanda dliangsta bhsjdfhsdfn zchenack colinsongf crista23 techlowd firedraky yueyedeai liyingcv trellixvulnteam

text_simplification's Issues

can you give more Guidance document？

can you give more Guidance document？Contains files Operation order？

read

Missing Files

@Sanqiang Can you please share the files present in data/dataplain ?

Where is L_critic implemented?

The file loss.py seems to contain a function for sequence loss, but not critic loss. In fact, after searching the repo for "critic_loss", "critic", "crit", and similar terms, I haven't been able to find the function anywhere. Do you mind referring me to the place in the code where L_critic is implemented?

Thanks for your time!

Vocab Rules file missing

@Sanqiang Can you please provide the file for vocab rules /zfs1/hdaqing/saz31/dataset/tmp_trans/ner/rule_vocab2

How to sample from the pre-trained model?

Can you please share the command to sample from the pre-trained model you made available? Thanks!

Proper links in model_config.py

Can anyone please provide the proper hierarchy and folder for text_simplification_data.

is it possible to use this model on general sentences?

i'm trying to pass some sentences as input to the model and paraphrase them some how, do you think that this model works for me? also i really appreciate any hints or scripts for doing the above work

download links are invalid

Hi,

It seems that the links are invalid now. https://drive.google.com/open?id=16gO8cLXttGR64_xvLHgMwgJeB1DzT93N
anyone can share a new link?

Thanks,
Jun

Tensorflow version

What version of Tensorflow is needed to run the code?

Model output

Hi would it be possible for you to share your model output on newsela + turk dataset.

Thank you!

Can we add a wechat

Hi Dear

Can we add a wechat?

code:zwz87865918

Thanks
weizhen

How to obtain the .map and .features files?

Can you please provide instructions on how to get *.map files (for example ../text_simplification_data/test2/nmap/test.8turkers.tok.map) and *.features (for example text_simplification_data/test2/ncomp/test.8turkers.tok.norm.features) using your code?

It would be great if you could share the data files you used to be able to run the code, thanks!