emory-hiti / ai-vengers Goto Github PK

Jupyter Notebook 98.07% Python 1.92% Shell 0.01%

ai-vengers's Introduction

AI-Vengers

This GitHub repository holds training and validation code for Deep Learning and Machine Learning models to detect racial demographics of patients through the use of medical images.

The data and models folders will be empty as -

The data and model file occupy a large amount of data and cannot be pushed onto GitHub repositories.
Some of the data used to conduct experiments is proprietary. URLs were attached for the open-source datasets.
The trained ML/DL models can be used to re-create the patient information unto certain levels which could leak the proprietary data.

The training data folder has training code for all the experiments. The experiments, corresponding data and model were as following -

Training Folder Name	Training File Name	Data	Model
CXR_training	CheXpert_resnet34_race_detection_2021_06_29.ipynb	CheXpert	ResNet34
CXR_training	Emory_CXR_resnet34_race_detection_2021_06_29.ipynb	Emory CXR	ResNet34
CXR_training	MIMIC_resnet34_race_detection_2021_06_29.ipynb	MIMIC	Resnet34
EM-CS_training	Emory_C-spine_race_detection_2021_06_29.ipynb	Emory Cervical Spine	Resnet34
EM_Mammo_Training	training code.ipynb	Mammogram	EfficientNetB2
Densenet121_CXR_Training	Lung_segmentation_MIMIC.ipynb	MIMIC	U-Net
Densenet121_CXR_Training	Race classification with No Finding label only_MIMIC_Densenet121.ipynb	MIMIC	DenseNet121
Densenet121_CXR_Training	Race classification_MIMIC_Densenet121.ipynb	MIMIC	DenseNet121
Densenet121_CXR_Training	Race_classification_Emory_Densenet121.ipynb	Emory CXR	DenseNet121
digital_hand_atlas	dha_2_classes.ipynb	Digital Hand Atlas	ResNet50

The final ipython-notebook — bias_pred.ipynb has validation code for all the above training models (except frequency training).

To run the validation code -

Fork/Download the GitHub repository.
Fetch the data from the data URLs for open-source datasets and drop them in the data folder.
Run the corresponding training code and save the trained model in the models folder.
Change the model path in the validation code and the corresponding function.

https://emory-hiti.github.io/AI-Vengers/

ai-vengers's People

Contributors

Stargazers

Watchers

ai-vengers's Issues

The splitting for chexpert dataset

Hey there, I didn't find the description of how you split the chexpert data in your paper, would you mind briefly introduce how you did it? You split it randomly or follow some specific rules (keep the perveance rate etc)? Many thanks in advance

Open Source Data URL's are missing

URL's for the open-source datasets used in the experiments are missing in the data folder.

Pathology Detection Training

Hello there,
You mentioned that you trained the models on 14 pathology labels including (no finding and support devices).
The thing is that the cheXpert dataset contains up to four values for each label (nan, 0, 1, -1).
How you normalized those values before training? Which values are assumed as positive and which are assumed as negative?
If you considered some values to represent uncertain how you dealt with those uncertain values?

Pretrained Models

Can you host pretrained models on a certain hosting service and share links to it?
I mean to resolve the problem of not being able to share on github.

race labels for Chexpert dataset?

Where can I find the race labels for chexpert dataset?

I got approval from Stanford University to download the dataset, but I am unable to locate race information.

Could you please help me find the race info for chexpert dataset

Explainable image classification

Great work guys.

Maybe you can find something by using methods such as SHAP or LIME for Image classification.
Would be really interesting.

Here is a link: https://towardsdatascience.com/knowing-what-and-why-explaining-image-classifier-predictions-680a15043bad

Best
Can

Image processing code?

Hi - Could you please release the code used for preprocessing the input images/Xrays?
(Mainly to make comparisons against other datasets as equivalent as possible, and also to save on time?)

e.g. in the example CXR training scripts:
"resnet34, **preprocess_input** = Classifiers.get('resnet34')

(Xrays are greyscale, not RGB, and processing them to match the expected inputs of pretrained models can have gotchas in it. I'd rather use exactly what you did)

Pretrained model

Hi @blackboxradiology , please can you share the pre-trained model of the datasets, e.g the ChexPert, Mimic CXR, etc.

Thank you

Pathology detection training

Hi,

Super interesting research! I'm looking to replicate the results as part of my own group's research (for ProjectX). I've looked through the Densenet121 training notebooks, but noticed that they're exclusively for race detection (experiment A1). My group is interested in exploring the implicitly learned weights during pathology detection training (experiment A3).

Am I correct in thinking that my group will have to train our own pathology detection algorithm as in experiment A3? Are there any pointers you could give us?

Thanks,
David

race labels for MIMIC-CXR ?

Hi,

I wondered how to obtain the race labels for MIMIC - CXR ?

I do have access to https://physionet.org/content/mimic-cxr/2.0.0/ and https://physionet.org/content/mimic-cxr-jpg/2.0.0/ but could not locate where you get the white/asian/black labels?

Like how to create the modified_viewposition_race_4-race-ethnicity_60-10-30_split_with_gender_age_ver_b.csv that you use in the training code?

Thanks for any help,
Best,
Robin

emory-hiti / ai-vengers Goto Github PK

ai-vengers's Introduction

AI-Vengers

ai-vengers's People

Contributors

Stargazers

Watchers

Forkers

ai-vengers's Issues

Recommend Projects

Recommend Topics

Recommend Org