Code Monkey home page Code Monkey logo

selective_tuning's Introduction

Selective Tuning

Requirements

  1. Python > 3.6.1
  2. pip install -r requirements.txt

Data

In the data directory is the following datasets:

  1. sst - which is the Stanford Sentiment Treebank. Which has be downloaded from this link.. Number of train, dev, and test instances; 8544, 1101, and 2210 respectively.

Tasks

  1. Fine grained sentence level sentiment analysis using the Stanford Sentiment Treebank with 5 sentiment labels.

Models

In the model_configs directory there are two models:

  1. Standard word embedding model
  2. BERT model

These model files use a batch size of 1 and to ensure that the optimiser updates every N we set the num_gradient_accumulation_steps to N.

Running these models with VSCode in debug mode

If you want to run these models in VSCode in debug model use the model_test.py file and put your break point on any line in the selective_tuning/allen/training/modified_trainer.py file.

The modified_trainer.py is the file that contains the code that stores the gradients for each sample within a batch on lines 367 to 413.

selective_tuning's People

Contributors

apmoore1 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.