Code Monkey home page Code Monkey logo

multimodal-emotion-recognition's Introduction

Multimodal Emotion Recognition

This repository contains a project realized as part of the Natural Language Processing exam of the Master's degree in Artificial Intelligence, University of Bologna.

Project structure

The project is structured as follows:

├───checkpoints # Model checkpoints
│   ├───audio_mel # Feature extractor audio_mel model
│   ├───audio_wav2vec2 # Feature extractor audio_wav2vec2 model
│   ├───text # Feature extractor text model (roberta-base and roberta-large)
│   └───m2fnet # M2FNet model
├───data
│   └───MELD.Raw
│       ├───dev_splits_complete # Validation set
│       │   ├───mel_spectrograms
│       │   └───wav
│       ├───output_repeated_splits_test # Test set
│       │   ├───mel_spectrograms
│       │   └───wav
│       └───train_splits # Training set
│           ├───mel_spectrograms
│           └───wav
├───embeddings # Embeddings coming from the feature extractor models
│   ├───audio_mel # audio: original
│   ├───audio_wav2vec2 # audio: Wav2Vec2.0
│   ├───text_base # text: roberta-base
│   └───text_large # text: roberta-large
├───paper # paper references
├───scripts # bash scripts for dataset download, audio extraction and venv creation
└───src
    └───feature_extractors # Feature extractors models and training scripts
        ├───audio_mel
        │   └───losses # Adaptive triplet based loss function
        ├───audio_wav2vec2
        └───text
	# M2FNet model and training scripts
    config.yaml
    dataset.py
    model.py
    test.py
    train.py
    utils.py

In every subfolder of src there is a config.yaml file that contains the configurations of the corresponding model.

Prerequisites

This project was developed in Python3 and pytorch. Run the following command to install the prerequisites:

# Linux
pip install --no-cache -r ./requirements_linux.txt

# Windows
pip install --no-cache -r ./requirements.txt

Otherwise, you can build a ready-to-go virtual environment by running the following scripts from the project's folder:

# Linux
> ./scripts/build-venv.sh

# Windows
> .\scripts\build-venv.bat

Download FFMPEG

Download ffmpeg from here. Follow online tutorials to install it correctly based on your OS.

Download dataset

Now, you need to download and prepare the dataset. Run the following commands from the project's folder:

# Linux
> ./scripts/MELD_download.sh # Download dataset
> ./scripts/video2wav.sh # Extract audio

# Windows
> .\scripts\MELD_download.bat # Download dataset
> .\scripts\video2wav.bat # Extract audio

Group members

Reg No. Name Surname Email Username
1005278 Ludovico Granata [email protected] LudovicoGranata
973719 Parsa Dahesh [email protected] ParsaD23
984854 Simone Persiani [email protected] iosonopersia

multimodal-emotion-recognition's People

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

ludovicogranata

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.