Light

fbarez / interpreting-context-look-ups Goto Github PK

View Code? Open in Web Editor NEW

6.0 1.0 0.0 129.75 MB

Python 5.98% Jupyter Notebook 94.02%

interpreting-context-look-ups's Introduction

Repository for "Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions"

Authors: Clement Neo, Shay B. Cohen, and Fazl Barez

This repository contains the code and data for the paper "Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions". The paper explores the interplay between attention heads and specialized "next-token" neurons in the Multilayer Perceptron (MLP) layer of transformers, focusing on how these components interact to predict specific tokens.

Usage Instructions

Run the notebooks. The main notebooks in notebooks/ are notebooks 1-7, while 8 and 9 are additional experiments (like head ablation and neuron ablation). The main notebooks save data to experiment_data/.

References

To cite this work, please use the following BibTeX entry:

@misc{neo2024interpreting,
  title={Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions}, 
  author={Clement Neo and Shay B. Cohen and Fazl Barez},
  year={2024},
  eprint={2402.15055},
  archivePrefix={arXiv},
  primaryClass={cs.CL}
}

interpreting-context-look-ups's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.