kshannon / intracranial-hemorrhage-detection Goto Github PK

View Code? Open in Web Editor NEW

8.0 2.0 3.0 22.98 MB

Repo to preform intracranial hemorrhage detection using data from RSNA's Medical Imaging competition.

License: MIT License

Python 30.01% Dockerfile 0.35% Shell 0.30% Jupyter Notebook 69.34%

medical-imaging rsna deep-learning tensorflow hemorrhage image-detection

intracranial-hemorrhage-detection's Introduction

Howdy, I'm Kyle 👋

🌱 currently growing TurnKey Trips and bringing healthcare accessability to rural America.

🤓 teaching data science & cognitive science at UCSD

💬 love connecting with people

🎾 likely on a tennis court

Connect with me:

intracranial-hemorrhage-detection's People

Contributors

Stargazers

Watchers

Forkers

ieee820 vuely-io deepmd-io

intracranial-hemorrhage-detection's Issues

baseline (MNIST) type model

Goal is to get a baseline model trained and submitted this week.

add random data augmentation to data loader

Rotation: could be bound to 90 degree increments or perhaps 10 degree...
Flip: yes this is a good option to include
Scaling: could random zooms work? possibly not need to be careful because we could crop out the actual hemorrhage thus losing class 1 data and misclassifying it....
Translation: could be maybe +/- 50px? how to interpolate area (constant, edge, reflect, wrap?)
Denoising/Noise: add Gaussian noise?

Training data augmentation. Real-time data augmentation (Fig. 3, M4) was performed by applying geometric transformations (rotation, scaling and translation) to make models learn invariant features to geometric perturbations. In addition, to improve invariance of the model to noise, either standard or denoised images was randomly selected to be used. We generated denoised images for standard cases by applying a median filter with a window size of 3 and used the scanner-generated denoised images if they already existed in the datasets. For the cases only with scanner-generated denoised images, only the denoised images were used as we were concerned about a bias that might be produced by reversing the denoising processes that are unknown to us. Rotation angles ranging from –10o to 10o with an interval of 1o, scaling ratios of heights and widths ranging from 90% to 100% with an interval of 1%, translation parameters ranging from − 12 to 12 pixels in x and y directions with an interval of 1 pixel, and a median filter with a window size of 3 were used for augmentation. All these parameters were randomly selected in the predefined ranges. Lee et. al. NATURE BIOMEDICAL ENGINEERING | VOL 3 | MARCH 2019 | 173–182

Add Batch Processing to predict.py

right now predict.py processes one image at a time, would be nice to run these as a batch. Current avg run time is about 1:45.

Enhance predict script

Need to make use of the data loader to do batch processing of predictions and write them out to a csv file. Most of the infrastructure for this already exists currently. Might need to tweak the data loader slightly. Might be a good task for @utcsox to work.

define a model template script

scrpt should have a tf model defined along with consts that can be imported to the train.py script
which model-{custom-name}.py you use will be passed as an arg to the train.py folder

EDA

Please add an intracranial-hemorrhage-detection/eda/ directory at the top level and put *.ipynb eda scrips in this folder.
Tony, you had mentioned you did a first pass EDA on the data? If so you can upload it and maybe chris can do some more work on it once he gets the data?

Kaggle Team

Accept Tony/Chris on team. general housekeeping..

Update Dicom Reader Function

need to add the ability to recenter, window and level a dicom. These should be flagged and params set for the size fo the window/level.

3D ideas

Thoughts on maximizing 3d representation via interpolation w/o training a 3d model:

Slice inter-polation was introduced to mimic how radiologists integrate infor-mation from all adjacent images of a contiguous three-dimensional (3D) volume concurrently, rather than examine each single axial 2D slice in isolation. Interpolated images from adjacent slices were pro-vided to the model with a modified loss function during training to imitate the 3D integration of image interpretation by radiologists. Lee et. al. NATURE BIOMEDICAL ENGINEERING | VOL 3 | MARCH 2019 | 173–182 |

Though note this salient point from the same paper about strict 3d voxel based approaches:

Another approach to address inter-slice dependency is to build a 3D network that directly inputs the voxel data from the entire imaging volume into a 3D format rather than as pixel-data from discrete axial slices in a 2D format. To compare the 3D versus 2D approaches, we trained a 3D model using previously described methodology21 by using case-level labels aggregated from slice-level labels, as well as volume data with a standardized dimensionality (24 × 512 × 512 voxels) generated using 2D slices. The resulting 3D model, however, achieved a mAP of only 0.328 for the multi-label classification of our five ICH subtypes, which is substantially infe-rior to the mAP we obtained with our existing 2D model (mAP of 0.686). This finding is consistent with the ‘curse of dimensionality’ reported in a previous study24, which noted that the amount of data required to train a deep-learning model scales exponentially with the dimensionality of the data.

Modeling Ideas

vgg16 and resnet50 as a starting point was a good idea, and it seems that other people in the field have taken similar courses of actions.

Our proposed system for the detection and classification of ICH uses multiple ImageNet6 pretrained deep convolutional neural networks (DCNNs), a preprocessing pipeline, an atlas creation module and a prediction-basis selection module (Fig. 1). The four DCNNs used for building our model are VGG167, ResNet-508, Inception-v39 and Inception-ResNet-v210." Lee et. al. NATURE BIOMEDICAL ENGINEERING | VOL 3 | MARCH 2019 | 173–182 |

Enhance data loader

Might be nice for the data loader to asisst with:

upsample/downsample of class All or 0
good to consider being able to train only on class != 0
Good to be able to mini batch sample on class 1/0 also possibly on subclass level

we need to talk more about this soon as this will allow to apply more complex training schemes and modeling.

7 col: id, subtype 1-5, all
remove duplicates (see comment below)
randomly downsample class 0 to match class 1
split into train/validation (look at bar plot distributions
create 3 sets of csvs [(balanced train/val), (class_any train/val), (class_subtypes train/val)] all using the same base test/train split.