Code Monkey home page Code Monkey logo

mt-bert's Introduction

MT-BERT

Table of Contents

About The Project

In MT-BERT we reproduce a neural language understanding model based on the paper by Liu et al.(2019). Such model implements a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple NLU tasks. MT-DNN extends the model proposed in paper by Liu et al.(2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT.

More details about the project are available in the presentation

Original implementation available at repo

Built With

Getting Started

To get a local copy up and running follow these simple steps.

Prerequisites

The project provide a Pipfile file that can be managed with pipenv. pipenv installation is strongly encouraged in order to avoid dependency/reproducibility problems.

  • pipenv
pip install pipenv

Installation

  1. Clone the repo
git clone https://gitlab.com/reddeadrecovery/mt-bert
  1. Install Python dependencies
pipenv install

Usage

Here's a brief description of each and every file in the repo:

  • model.py: Model definition
  • task.py: Task dataset preprocessing and definition
  • train_glue.py: Training file for Multi task training on GLUE
  • fine_tune_task.py: Fine tuning, domain adaptation and single task training file
  • utils.py: utils file

There is also a executable jupyter notebook:train.ipnyb

Authors

Acknowledgments

Machine Learning © Course held by Professor Paolo Frasconi - Computer Engineering Master Degree @University of Florence

mt-bert's People

Contributors

abaldrati avatar giovanniberti avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.