Code Monkey home page Code Monkey logo

multimodal-llm's Introduction

README for the Multimodal Output Analysis Tool (MOAT)

Overview

The Multimodal Output Analysis Tool (MOAT) is a Streamlit-based application designed to assist researchers and practitioners in analyzing and understanding the outputs from multimodal Large Language Models (MLLMs) within the context of the ScienceQA (or other Huggingface) dataset. It provides functionalities to load data, display questions with their associated answer choices and images, and perform answer analysis with extracted reasoning.

Features

  • Load Data: Automatically loads the ScienceQA dataset (any other can be specified in the script) and any .csv data files from a the Input_files folder into dataframes for analysis.
  • Display Questions: Shows the question text, answer choices, and associated images from the dataset. It creates as many columns as there are usable .csv files in the folder, so if you want 25 columns ร  10px width, feel free.
  • Answer Analysis: Provides a detailed analysis of the answers including the extracted answer, extracted reasoning, and comparison with the correct answer.
  • Statistics Calculation: Calculates and displays statistics like the length of the dataframe, the number of correct matches, and the number of extracted reasonings.
  • Data Visualization: Plots the number of correct matches and the relative accuracy for each dataframe.
  • Interactivity: Users can generate random numbers to analyze random entries or input specific numbers for targeted analysis.

Installation

To run MOAT, you need to have Python installed on your machine along with the necessary libraries. Here are the steps to set up the application:

  1. Clone the GitHub repository:
git clone <repository-url>
  1. Navigate to the cloned repository directory.

  2. Install the required packages using pip:

pip install -r requirements.txt
  1. Insert your output .csv files into the Input_files folder

  2. Run the Streamlit application:

streamlit run multimodal_annotator.py

Customization

The tool is designed to be flexible and can be customized to suit different datasets and requirements. You can modify the source code to include different datasets, add new analysis metrics, or adjust the visualizations.

License

Moat is distribute under the MIT license.

Support

For support, please open an issue on the GitHub repository, and a maintainer (me) will (maybe) assist you.

multimodal-llm's People

Contributors

louispk avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.