Heart Failure Prediction using Microsoft Azure

Project Overview

Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worlwide. Heart failure is a common event caused by CVDs. People with cardiovascular disease or who are at high cardiovascular risk need early detection and management wherein a machine learning model can be of great help. This project involves training Machine Learning Model to predict mortality by Heart Failure using Microsoft Azure and deployment of the model as a web service. We also figure the main factors that cause mortality.

Project Architecture

Figure 1 : The following diagram shows the overall architecture and workflow of the project.

Project Set Up and Installation

Firstly, we need an Azure subscription to access the Azure workspace. For this project, the Azure subscription provided by Udacity was used.

Create a Workspace

The workspace is the top-level resource that provodes a centralized place to work with all the artifacts we create when we use Azure Machine Learning. The workspace keeps a history of all training runs, including logs, metrics, output, and a snapshot of our scripts.

The workspace can be created with the help of Create and manage Azure Machine Learning workspaces document.

Set up Compute Instance

A compute instance is a managed cloud-based workstation which is used as a fully configured and managed development environment.

A compute instance with name notebook139012 and virtual machine size of STANDARD_DS3_V2 was created.

Figure 2 : The screenshot below shows the registered compute instances.

Set up Compute Cluster

Compute cluster is a managed-compute infrastructure that allows us to easily create a single or multi-node compute. Compute clusters scales up automatically when a job is submitted and can run jobs securely in a virtual network environment.

A compute cluster new-compute with virtual machine size of STANDARD_D2_V2 and max_nodes = 4 was created.

Set up Docker

Docker is an open platform for developing, shipping, and running applications. Here Docker is used for running Swagger on localhost.

The docker can set up using Get Docker.

Dataset

A dataset external to the Azure ML ecosystem and supported by the Azure ML's automl API was used.

Overview

The dataset used in this project is Heart Failure Prediction dataset from Kaggle. It consists of 299 rows and 13 columns

Exploratory Data Analysis

The preliminary exploratory data analysis on the dataset is shown in Exploratory Data Analysis.ipynb

Task

Heart failure is a common event caused by cardiovascular diseases which is the number 1 cause of death globally. People with cardiovascular disease or who are at high cardiovascular risk need early detection and management wherein a machine learning model can be of great help.

The total features of the dataset are :

age - Age
anaemia - Decrease of red blood cells or hemoglobin (boolean)
creatinine_phosphokinase - Level of the CPK enzyme in the blood (mcg/L)
diabetes - If the patient has diabetes (boolean)
ejection_fraction - Percentage of blood leaving the heart at each contraction (percentage)
high_blood_pressure - If the patient has hypertension (boolean)
platelets - Platelets in the blood (kiloplatelets/mL)
serum_creatinine - Level of serum creatinine in the blood (mg/dL)
serum_sodium - Level of serum sodium in the blood (mEq/L)
sex - Woman or man (binary)
smoking - If the patient smokes or not (boolean)
time - Follow-up period (days)
DEATH_EVENT - If the patient deceased during the follow-up period (boolean)

Out of the 13 features mentioned above, the first 12 features are used for training the model and the last feature DEATH_EVENT is the target column. The task is to classify DEATH_EVENT to 0 or 1 i.e to predict the mortality by heart failure.

Access

The dataset is first downloaded from the Kaggle as a csv file. It is then uploaded from the local files and registered to the Azure Workspace as a Tabular dataset format.

Figure 3 : The registered datasets are shown below.

This dataset can then be accessed in our jupyter notebook by

ds = Dataset.get_by_name(ws, 'heart-failure-dataset')

Automated ML

Overview

Automated machine learning is the process of automating the time consuming, iterative tasks of machine learning model development.

AutoML is used to automate the repetitive tasks by creating a number of pipelines in parallel that try different algorithms and parameters. This iterates through ML algorithms paired with feature selections, where each iteration produces a model with a training score. The higher the score, the better the model is considered to fit the data. This process terminates when the exit criteria defined in the experiment is satisfied.

AutoML Configuration

Instantiate an AutoMLConfig object for AutoML Configuration.

The parameters used here are:

n_cross_validation = 3 : Since our dataset is small. We apply cross validation with 3 folds instead of train/validation data split.
primary_metric = 'accuracy' : The primary metric parameter determines the metric to be used during model training for optimization. Accuracy primary metric is chosen for binary classification dataset.
experiment_timeout_minutes = 30 : This defines how long, in minutes, our experiment should continue to run. Here this timeout is set to 30 minutes.
max_concurrent_iterations = 4 : To help manage child runs and when they can be performed, we match the number of maximum concurrent iterations of our experiment to the number of nodes in the cluster. So, we get a dedicated cluster per experiment.
task = 'classification' : This specifies the experiment type as classification.
compute_target = cpu_cluster : Azure Machine Learning Managed Compute is a managed service that enables the ability to train machine learning models on clusters of Azure virtual machines. Here compute target is set to cpu_cluster which is already defined with 'STANDARD_D2_V2' and maximum nodes equal to 4.
training_data = train_data : This specifies the training data to be used in this experiment which is set to train_data which is a part of the dataset uploaded to the datastore.
label_column_name = 'DEATH_EVENT' : The target column here is set to DEATH_EVENT which has values 1 if the patient deceased or 0 if the patient survived.
featurization= 'auto' : This indicates that as part of preprocessing, data guardrails and featurization steps are performed automatically.

# Automl settings
automl_settings = {
    "n_cross_validations": 3,
    "primary_metric": 'accuracy',
    "experiment_timeout_minutes": 30,
    "max_concurrent_iterations": 4
}

# automl config here
automl_config = AutoMLConfig(task = 'classification',
                            compute_target = cpu_cluster,
                             training_data = train_data,
                             label_column_name = 'DEATH_EVENT',
                             featurization= 'auto',
                             **automl_settings
                            )

Then the experiment was submitted using these settings and configurations.

Results

Once submitted, the progress of the run can be viewed using RunDetails widget.

Figure 4 : The screenshot below shows the status of the run using RunDetails widget.

Figure 5 : The following figure shows the scatter plot of the Accuracy produced by training different models.

Figure 6 : The following screenshot shows the different algorithms that were trained.

The Best Model generated by the AutoML run was the Voting Ensemble Model.

Figure 7 : The best AutoML Run Details with its Run Id is shown below :

Figure 8 : The details of the best Model run are shown below.

Figure 9 : The best AutoML Run metrics are shown below.

Figure 10 : A screenshot of the AutoML best model parameters, found by printing the detail of the best_automl_model object returned from the get_output method of the run object.

Figure 11 : The Precision-Recall Curve and ROC Curve are shown below.

Azure Automated ML has a feature called Explainability which shows information about the data after the model is trained. In this experiment, the Explanations tab visualizes the importance of each feature in the dataset. This gives ingsights about how to improve the dataset in the future to get better performance.

Figure 12 : The figure below shows the Aggregate Feature importance.

Hyperparameter Tuning

Overview

Hyperparameters are adjustable parameters that let us control the model training process. Model performance depends heavily on these hyperparameters. Hyperparameter tuning is typically computationally expensive and manual. Azure Machine Learning lets us automate hyperparameter tuning and run experiments in parallel to efficiently optimize hyperparameters.

For this, first the data was taken from the heart-failure-dataset. Then the datset was passed to clean_data function in train.py file to clean the data. Then the dataset was split into train and test set. After that the training data was passed to a Logistic Regression Model as it is a binary classification dataset.

HyperDrive Configuration

Hyperparameter Sampling Method and Space :

The hyperparameters of the logistic regression model such as Inverse of Regularization strength (C) and Maximum Number of Iterations (max_iter) were tuned using Microsoft Azure Machine Learning's hyperparameter tuning package HyperDrive.

The search space for randomly choosing hyperparameters was selected. The search space in this project were specified as choice for max_iter, and uniform for C. This search space was then fed to a parameter sampler which specified the method to select hyperparameters from the search space.

This experiment used a RandomParameterSampling sampler to randomly select values specified in the search space for C and max_iter. In this sampling algorithm, hyperparameter values are randomly selected from the defined search space and supports early termination of low-performance runs thus taking less computational efforts.

# Create the different params that we will be using during training
param_sampling = RandomParameterSampling({'--C' : uniform(0.01,100),
                                        '--max_iter': choice(16,32,64,128,256)})

Termination Policy :

Here BanditPolicy was used as a stopping policy as Bandit Policy with a smaller allowable slack is the most aggressively compute saving policy.

Bandit policy is based on slack factor/slack amount and evaluation interval. Bandit terminates runs where the primary metric is not within the specified slack factor/slack amount compared to the best performing run.

early_termination_policy = BanditPolicy(slack_factor=0.1, evaluation_interval = 2, delay_evaluation=5)

HyperDrive Configuration :

hyperdrive_run_config = HyperDriveConfig(hyperparameter_sampling = param_sampling,
                                         primary_metric_name = "Accuracy", 
                                         primary_metric_goal = PrimaryMetricGoal.MAXIMIZE, 
                                         max_total_runs = 25, 
                                         max_concurrent_runs=4, 
                                         policy=early_termination_policy, 
                                         run_config=estimator)

Results

Once submitted, the progress of the run can be viewed using RunDetails widget.

Figure 13 : The screenshot below shows the status of the run using RunDetails widget.

Figure 14 : The screenshot below shows the different Run details.

Figure 15 : The following figure shows the scatter plot of the Accuracy produced by training different models.

Figure 16 : The following figure shows the 2 D scatter plot of the Accuracy obtained with different values of C and max_iter.

Figure 17 : The following screenshots shows the HyperDrive Run Deatils

The Best Model generated by the HyperDrive run was the with:

C value : 85.769
max_iter value : 128

Figure 18 : The details of the best HyperDrive run are shown below.

Figure 19 : The best HyperDrive Run metrics are shown below.

Model Deployment

The AutoML run produced a more accurate model (88.79% accurate Voting Ensemble) than the HyperDrive-optimized Logistic Regression (80% accurate). Thus, we deploy the model generated by the AutoML run.

Azure Container Instance is used for deploying the model. Azure Container Instance offers the fastest and simplest way to run a container in Azure, without having to manage any virtual machines and without having to adopt a higher-level service. ACI is a great solution for any scenario that can operate in isolated containers, including simple applications, task automation, and build jobs.

To deploy the AutoML model, we first download the score.py script from the files generated by AutoML run. A file containing the environment details and dependencies myenv.yml is also downloaded.

Then we use the Model.deploy() method to deploy our best performing model:

inference_config = InferenceConfig(entry_script='score.py', environment=myenv)

service = Model.deploy(workspace=ws, 
                       name='aci-service', 
                       models=[model], 
                       inference_config=inference_config, 
                       deployment_config=aci_config)

service.wait_for_deployment(show_output=True)

Figure 20 : The screenshot below shows the deployed model endpoint with Deplyment State - Healthy.

Next we interact with the endpoint. For this, two data sample were extracted randomly from the test dataset and then converted to JSON format. Following commands passes the data to the model as an HTTP POST request and records the response -

resp = requests.post(service.scoring_uri, input_data, headers = headers)

Figure 21 : Screenshot below shows the sample data response from the deployed model.

Here we can see that the label of these data sampled were predicted to be [0, 0] and their actual labels were also [0,0].

Screen Recording

Screen Recording of the project can be viwed using this YouTube link.

This shows demo of:

Working model
Deployed model
Sample request sent to the endpoint and its response

Standout Suggestions

1. Converting the Model into ONNX Format:

Open Neural Network Exchange (ONNX) is an open standard format for representing machine learning models. ONNX helps in enabling interoperability between different frameworks and streamlining the path from research to production.

Our best model i.e the AutoML model is converted to ONNX format which will allow us to interchange models between various ML frameworks and tools.

For this we set the parameter return_onnx_model = True to retrieve the best ONNX model, instead of the Python model.

from azureml.automl.runtime.onnx_convert import OnnxConverter
best_run, onnx_mdl = remote_run.get_output(return_onnx_model=True)

This model is then saved and the results are predicted.

onnxrt_helper = OnnxInferenceHelper(mdl_bytes, onnx_res)
pred_onnx, pred_prob_onnx = onnxrt_helper.predict(test_df)
print(pred_onnx)

More details about the ONNX model generated can viewed in this notebook.

2. Enabling Application Insights:

Application Insights, a feature of Azure Monitor, is an extensible Application Performance Management (APM) service. It is used to monitor our live applications. It will automatically detect performance anomalies, and includes powerful analytics tools to help us diagnose issues and to understand what users actually do with our application.

The application insights for displaying logs can be enabled for the deployed model endpoint by:

service.update(enable_app_insights = True)

Figure 22 : The screenshot below shows the Application Insight Enabled for the deployed model.

Figure 23 : We can access the logs through the Application Insights URL

3. Consuming Model Endpoint:

In this step, we will consume the deployed model using Swagger to interact with the HTTP REST API endpoint documentation. For this we will first download the swagger.json file that Azure provides for deployed models. Next we will run the swagger.sh file to pull the latest swagger-ui docker image and run it on port 9000. Finally we run serve.py script to serve the swagger.json for our model on an HTTP server.

To interact with deployed web service's API resources, we go to localhost:9000 and change URL on Swagger UI to http://localhost:8000/swagger.json.

Figure 24 : This is shown in the figure below

Figure 25 : The screenshot below shows the HTTP API POST operation with the parameters being passed.

Figure 26 : The reponse received by the POST Operation

Improvements and Future Work

The AutoML Run gave insights about the importance of each feature. We can use that to create a better dataset.
The HyperDrive run can be trained with different values of the hyperparameters to see which hyperparameter gives the best accuracy with the Logistic Regression algorithm.
For the HyperDrive run, different algorithms such as Random Forest or Voting Ensemble can be used to obtain a better accuracy.
The settings and configuration of the AutoML run such as experiment_timeout_minutes and n_cross_validations can they can be changed to other options and the performance can be compared.
Similarly, the configuration of the HyperDrive such as Bandit Policy and Random Parameter Sampling can be altered to be able to fit the model better.

singh2010nidhi / heart-failure-prediction-using-msazure Goto Github PK

heart-failure-prediction-using-msazure's Introduction

Heart Failure Prediction using Microsoft Azure

Project Overview

Project Architecture

Project Details

Project Set Up and Installation

Create a Workspace

Set up Compute Instance

Set up Compute Cluster

Set up Docker

Dataset

Overview

Exploratory Data Analysis

Task

Access

Automated ML

Overview

AutoML Configuration

Results

Hyperparameter Tuning

Overview

HyperDrive Configuration

Results

Model Deployment

Screen Recording

Standout Suggestions

Improvements and Future Work

References

heart-failure-prediction-using-msazure's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org