Code Monkey home page Code Monkey logo

ai-vengers's Introduction

Contributors Stargazers Issues

AI-Vengers

This GitHub repository holds training and validation code for Deep Learning and Machine Learning models to detect racial demographics of patients through the use of medical images.

The data and models folders will be empty as -

  1. The data and model file occupy a large amount of data and cannot be pushed onto GitHub repositories.
  2. Some of the data used to conduct experiments is proprietary. URLs were attached for the open-source datasets.
  3. The trained ML/DL models can be used to re-create the patient information unto certain levels which could leak the proprietary data.

The training data folder has training code for all the experiments. The experiments, corresponding data and model were as following -

Training Folder Name Training File Name Data Model
CXR_training CheXpert_resnet34_race_detection_2021_06_29.ipynb CheXpert ResNet34
CXR_training Emory_CXR_resnet34_race_detection_2021_06_29.ipynb Emory CXR ResNet34
CXR_training MIMIC_resnet34_race_detection_2021_06_29.ipynb MIMIC Resnet34
EM-CS_training Emory_C-spine_race_detection_2021_06_29.ipynb Emory Cervical Spine Resnet34
EM_Mammo_Training training code.ipynb Mammogram EfficientNetB2
Densenet121_CXR_Training Lung_segmentation_MIMIC.ipynb MIMIC U-Net
Densenet121_CXR_Training Race classification with No Finding label only_MIMIC_Densenet121.ipynb MIMIC DenseNet121
Densenet121_CXR_Training Race classification_MIMIC_Densenet121.ipynb MIMIC DenseNet121
Densenet121_CXR_Training Race_classification_Emory_Densenet121.ipynb Emory CXR DenseNet121
digital_hand_atlas dha_2_classes.ipynb Digital Hand Atlas ResNet50

The final ipython-notebook โ€” bias_pred.ipynb has validation code for all the above training models (except frequency training).

To run the validation code -

  1. Fork/Download the GitHub repository.
  2. Fetch the data from the data URLs for open-source datasets and drop them in the data folder.
  3. Run the corresponding training code and save the trained model in the models folder.
  4. Change the model path in the validation code and the corresponding function.

https://emory-hiti.github.io/AI-Vengers/

ai-vengers's People

Contributors

anbhimi avatar blackboxradiology avatar chima20097 avatar hzhang0 avatar judywawira avatar medicresource avatar ryan-re-wang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

ai-vengers's Issues

The splitting for chexpert dataset

Hey there, I didn't find the description of how you split the chexpert data in your paper, would you mind briefly introduce how you did it? You split it randomly or follow some specific rules (keep the perveance rate etc)? Many thanks in advance

Pathology Detection Training

Hello there,
You mentioned that you trained the models on 14 pathology labels including (no finding and support devices).
The thing is that the cheXpert dataset contains up to four values for each label (nan, 0, 1, -1).
How you normalized those values before training? Which values are assumed as positive and which are assumed as negative?
If you considered some values to represent uncertain how you dealt with those uncertain values?

Pretrained Models

Can you host pretrained models on a certain hosting service and share links to it?
I mean to resolve the problem of not being able to share on github.

race labels for Chexpert dataset?

Where can I find the race labels for chexpert dataset?

I got approval from Stanford University to download the dataset, but I am unable to locate race information.

Could you please help me find the race info for chexpert dataset

Image processing code?

Hi - Could you please release the code used for preprocessing the input images/Xrays?
(Mainly to make comparisons against other datasets as equivalent as possible, and also to save on time?)

e.g. in the example CXR training scripts:
"resnet34, **preprocess_input** = Classifiers.get('resnet34')

(Xrays are greyscale, not RGB, and processing them to match the expected inputs of pretrained models can have gotchas in it. I'd rather use exactly what you did)

Pathology detection training

Hi,

Super interesting research! I'm looking to replicate the results as part of my own group's research (for ProjectX). I've looked through the Densenet121 training notebooks, but noticed that they're exclusively for race detection (experiment A1). My group is interested in exploring the implicitly learned weights during pathology detection training (experiment A3).

Am I correct in thinking that my group will have to train our own pathology detection algorithm as in experiment A3? Are there any pointers you could give us?

Thanks,
David

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.