Code Monkey home page Code Monkey logo

iterative_summarization's Introduction

Iterative summarization interpretability

Experiment on Iterative Summarization, potentially linked to Natural Abstraction. Made for the Alignment jam #4, see the write-up for more details.

Related work and sources

Demo notebooks

The repository is accompanied by notebooks without the cell's outputs, see below.

  • Training notebook (resulting weights publicly available, see the notebook)

Open In Colab

  • Interpretability notebook

Open In Colab

Future work

With more time and resources fine-tuning could be done on a bigger model with a bigger dataset. The idea could be to emphasize the experiment on abstraction and iterative models, see the write-up for references.

iterative_summarization's People

Contributors

xmaster6y avatar

Watchers

 avatar

iterative_summarization's Issues

Notebook and write-up

User story

Work made for the Alignment jam #4 on Mechanistic Interpretability. A first draft of ideas on the subject.

Acceptance Criterion

  • The notebook and the code are documented
  • The write-up corresponds to the code

Tasks

  • Fine-tuning of GPT-2 for summarization
  • Rendering using Neel Nanda's toolbox
  • Trying to deduce and explain the graphs
  • Add the write-up

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.