Code Monkey home page Code Monkey logo

plm_annotator's Introduction

Hi there 👋 I'm Xiachong Feng.

Xiachong's GitHub stats

plm_annotator's People

Contributors

xcfcode avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

plm_annotator's Issues

关于第一步get loss的问题

两个问题:

  1. python get_loss.py -d ami 处理速度慢;

每个样例数据需要处理大约1~2min,请问是否为正常速度?

  1. 第二步中python recover_word_loss.py -d [自己数据集]报错,
    Load train_loss.json finished, Data size:100 Load valid_loss.json finished, Data size:30 Load test_loss.json finished, Data size:30 Traceback (most recent call last): File "recover_word_loss.py", line 90, in <module> process(train_datas, dataset, "train") File "recover_word_loss.py", line 70, in process res.append(process_one(data)) File "recover_word_loss.py", line 62, in process_one words, losses = recover_word_level(subwords, losses) # recover word-level losses File "recover_word_loss.py", line 49, in recover_word_level assert len(dialogue.split()) == len(word_level_losses.split()) AssertionError
    我猜测可能是datasets可能有固定json格式?我用的是[["summary1", "dialogues1"], ... ["summary_N_", "dialogues_N_"]]这种格式。
    其中summary为“ summary ”
    dialogue为“utterence1\n utterence2\n ... ”
    如果有严格的格式要求,还望麻烦告知一下!

感谢您的工作,也希望您能尽快解决我的问题,不胜感激!

Re-implementation results

Hi,
I tried to re-implement the results on SAMSum following the README.md. However, the rouge score I got are only:
rouge-1: 55.71 55.75 53.04
rouge-2: 29.65 29.20 27.85
rouge-l: 51.50 51.57 49.71
ROUGE 1-2-L F: 53.04-27.85-49.71

I can get the exact matched scores shown in your paper by the generated summaries provided:
rouge-1: 57.90 54.90 53.70
rouge-2: 31.22 29.53 28.79
rouge-l: 53.72 51.54 50.81
ROUGE 1-2-L F: 53.70-28.79-50.81

I tested with the best checkpoint which actually didn't update after the first several epochs.
Is this variance acceptable? Are there any details I may lost before?

Thank you very much!

能分享一下源码吗

你的另一篇论文,Incorporating Commonsense Knowledge into Abstractive Dialogue
Summarization via Heterogeneous Graph Networks,事实感知的我挺感兴趣的,但是没有找到源码,方便分享一下嘛

Code for Data Preprocessing

Hello, can you release the code for giving annotations about keywords extraction, redundancy detection, and topic segmentation? We want to process some new datasets.

Thanks!

Max sentence length shouldn't exceed 800

I download your model and put it in ckpt to reproduce your reslults, but it runs error "max sentence length shouldn't exceed 800". I used the data download from here, and ran bpe.sh and binarise.sh to preprocess them. If there is any wrong? I would appreciate if got your reply.

Special Tokens Representation

Hello! Do these special tokens have special representations while doing the bpe tokenization? Or just treat them as a normal word. Because I haven't found any of the ##KEY## [TS] [RDD] in the encoder.json which is provided by your Google Drive.
I would appreciate if got your reply. @xcfcode

Rouge Score

Hello.
Could you please tell the rouge score in your paper is r, p or f? And for bart baseline , did you do any preprocessing like deleting \n\r or add some special tokens ? thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.