Melanoma Detection Using CNN

Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution which can evaluate images and alert the dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.The purpose is to build a CNN based model which can accurately detect melanoma.

Melanoma Detection Using CNN

General Information

Melanoma is a type of cancer that can be deadly if not detected early. It accounts for 75% of skin cancer deaths. A solution which can evaluate images and alert the dermatologists about the presence of melanoma has the potential to reduce a lot of manual effort needed in diagnosis.The purpose is to build a CNN based model which can accurately detect melanoma. The model being built is a multiclass classification model using a custom convolutional neural network in TensorFlow.

The dataset consists of 2357 images of malignant and benign oncological diseases, which were formed from the International Skin Imaging Collaboration (ISIC). All images were sorted according to the classification taken with ISIC, and all subsets were divided into the same number of images, with the exception of melanomas and moles, whose images are slightly dominant. The data set contains the following diseases:

Actinic keratosis
Basal cell carcinoma
Dermatofibroma
Melanoma
Nevus
Pigmented benign keratosis
Seborrheic keratosis
Squamous cell carcinoma
Vascular lesion

Business Goals

The purpose is to build a CNN based model which can accurately detect melanoma. The model being built is a multiclass classification model using a custom convolutional neural network in TensorFlow.

Model Building

Data Reading/Data Understanding → Defining the path for train and test images
Dataset Creation→ Create train & validation dataset from the train directory with a batch size of 32. Also, make sure you resize your images to 180*180.
Dataset visualisation → Create a code to visualize one instance of all the nine classes present in the dataset
Model Building & training :
- Create a CNN model, which can accurately detect 9 classes present in the dataset.
- While building the model, rescale images to normalize pixel values between (0,1).
- Choose an appropriate optimiser and loss function for model training
- Train the model for ~20 epochs
- Check if there is any evidence of model overfit or underfit.
Chose an appropriate data augmentation strategy to resolve underfitting/overfitting
Model Building & training on the augmented data :
- Choose data augmentation technique to address issues of underfit\overfit in previous model.
- Train the model for ~20 epochs
- Check if the earlier issue is resolved or not.
Class distribution: Examine the current class distribution in the training dataset
- Which class has the least number of samples?
- Which classes dominate the data in terms of the proportionate number of samples?
Handling class imbalances: Rectify class imbalances present in the training dataset with Augmentor library.
Model Building & training on the rectified class imbalance data :
- Check for Class Imbalance and apply Class Rebalancing technique to address Class imbalance
- Train the model for ~30 epochs
- Check if the earlier issue is resolved or not and impact on model performance.

Dataset

The dataset is available in the google drive. The dataset consists of 2357 images of malignant and benign oncological diseases, which were formed from the International Skin Imaging Collaboration (ISIC). All images were sorted according to the classification taken with ISIC, and all subsets were divided into the same number of images, with the exception of melanomas and moles, whose images are slightly dominant. The data set contains the following diseases:

Actinic keratosis
Basal cell carcinoma
Dermatofibroma
Melanoma
Nevus
Pigmented benign keratosis
Seborrheic keratosis
Squamous cell carcinoma
Vascular lesion

Conclusions

The class rebalance in the final model helped in reducing overfititng of the data and thus the loss is reduced. It also enhanced the overall accuracy of the model.
Initially we tried building model without the ImageDataGenerator which created data to highly overfit.
Then we introduced dropout to address overfitting and ImageDataGenerator for data augmentation which reduced the over fit, but significantly reduced the overall accuracy as well.
At last we tried Batch Normalization and Augumentation which really helped in carry forward

Technologies Used

pandas
numpy
matplotlib
tensorflow
keras
augmentor

Acknowledgements

Pratik Patil

Contact

https://github.com/patil-pratik-87

patil-pratik-87 / melanoma-detection Goto Github PK

melanoma-detection's Introduction

Melanoma Detection Using CNN

Table of Contents

General Information

Business Goals

Model Building

Dataset

Conclusions

Technologies Used

Acknowledgements

Contact

melanoma-detection's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent