Code Monkey home page Code Monkey logo

fgrr_medvqa's Introduction

Fine-Grained Knowledge Fusion for Retrieval-Augmented Medical Visual Question

💡Overview

Fine-Grained Re-Weighting (FGRW) is an online retrieval-augmented framework for Medical Visual Question Answering, utilizing fine-grained encoding of multi-source knowledge and computing re-calculate relevance scores between queries and knowledge. These scores serve as supervised priors, guiding the fusion of queries and knowledge and reducing interference from redundant information in answering questions. overview

📔PEIR Digital Library

The Pathology Education Informational Resource (PEIR) Digital Library, a public multidisciplinary image database for medical education, provided us with 30k image-text pairs, serving as an additional multi-modal knowledge base.

🔨Setup

Requirement

conda create -n fgrr python=3.8
conda activate fgrr
pip install -r requirements.txt

📑Data Preparation

We test our model on:

External knowledge bases beyond PEIR.

📑Pre-extracted Feature and Annotations

Coming Soon.

Dataset Source Feature Annotation
VQA-RAD . . .
SLAKE . . .
PathVQA . . .
PEIR . . .
ROCO . . .

🔨Pre-trained weights

Coming Soon.

🔨Training

Note: The first time you run this, it will take time to load knowledge features from ./Annotations/memory/memo_list.json into the temporary folder ./temp/. Please be patient.

 bash trainval_pathvqa.sh

📝Acknowledgements

We also reference the excellent repos of BioMedCLIP in addition to other specific repos to the baselines we examined (see paper).

📝Citation

If you find this paper useful, please consider staring 🌟 this repo and citing 📑 our paper:


fgrr_medvqa's People

Contributors

ecoxial2007 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.