Code Monkey home page Code Monkey logo

hgsurvey's Introduction

HGSurvey

A collection of papers and resources related to Large Language Models.

The organization of papers refers to our survey "Neural Headline Generation: A Comprehensive Survey".

Summary of the models and frameworks used for headline generation tasks

Strategy Work Code Model Description Learning Framework
Extractive MLRank[87] Transformer+ Non-autoregressive Seq2seq
Abstractive NACC[137] [Code] Transformer+ Non-autoregressive Seq2seq
NAUS[136] [Code] Transformer+ Non-autoregressive Seq2seq
TI-C-NHG[171] Transformer + Copy Mechanism Seq2seq
IT5[151] [Code] T5 Transfer Learning
Heng[57] UNILM Adversarial Learning
Amin et al.[91] GRU + Attention Seq2seq
Kanungo et al.[70] MLM + Transformer Reinforcement Learning
SLGen[95] GCN + GRU + Attention Seq2seq
DeepTitle[6] BERT Transfer Learning
HG-News[3] GPT-2 + Pointer Network Transfer Learning
Matsushita et al.[94] BART Transfer Learning
PENS[30] [Code] Transformer + Pointer Network Seq2seq
ZmBART[80] mBART Transfer Learning
Shavrina et al.[172] ruGPT-3 Fine-tuning
CNHG[161] BiGRU + Attention Reinforcement Learning
Shu et al.[130] Variational Auto-Encoder+RNN Seq2seq
Littman et al.[144] BERT Fine-tuning
Scarlatos et al.[67] [Code] GPT-2 Fine-tuning
Jang et al.[170] Transformer Adversarial Learning
Hybrid MuD2H[96] GCN + BiLSTM Seq2seq
SHEG[5] GRU + CNN + BiLSTM Seq2seq
Song et al.[104] BiLSTM Reinforcement Learning
Liu et al.[78] BERT Transfer Learning

Summary of style infusion strategies for headline generation tasks

Methods Work Code Description Style
Pre-processing Littman et al.[144] Construct datasets Satirical
Lorenzo et al.[127] Train on style datasets
In-process Zhan et al.[173] Multi-task learning
Song et al.[104] Popularity classifier Attractive
TitleStylist[10] [Code] Multi-task learning Clickbait
Li et al.[74] Style constraints module Attractive
Shu et al.[130] Style discriminator Clickbait
Xu et al.[8] Reinforce learning Clickbait
Post-processing Alnajjar et al.[9] Word replacement Humor
Stegeren et al.[1] Word replacement
Alnajjar et al.[77] Headline replacement Metaphorical
Fu et al.[193] Style transfer model

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.