Technology stack:
Python
Keras - TensorFlow
Matplotlib
Numpy
Seaborn
sklearn
I have used the Diabetes dataset hosted by Kaggle.
(https://www.kaggle.com/uciml/pima-indians-diabetes-database)
The datasets consist of several medical predictor (independent) variables and one target (dependent) variable, Outcome. Independent variables include the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.
Visualization.ipynb: This notebook consists of a study of the dataset using matplotlib and seabron libraries.
utils.ipynb: This notebook consists of the study of data before preprocessing and the preprocessing and feature engineering itself.
main.ipynb: This notebook consists of the multilayer perceptron model with specifications as follows:
Sequential model with 3 layers: 1st layer - 32 hidden neurons with relu activation, 2nd layer - 16 hidden neurons with relu activation, output layer - 1 neuron with sigmoid activation. Adam optimizer with binary_crossentropy loss.
1. Accuracy:
![Accuracy](/images/Accuracy.PNG)