Code Monkey home page Code Monkey logo

realtimerdeployment's Introduction

Author: Hong Ooi

Real-time model deployment with R using Azure Container Registry and Azure Kubernetes Service

Overview

This repository hosts deployment artifacts for the reference architecture "Real-time model deployment with R". You can use these artifacts to deploy a containerised predictive service in Azure.

Design

The workflow in this repository builds a sample machine learning model: a random forest for housing prices, using the Boston housing dataset that ships with R. It then builds a Docker image with the components to host a predictive service:

  • Microsoft Machine Learning Server (only the R components).
  • The Azure CLI (necessary to use Model Operationalization).
  • the model object plus any packages necessary to use it (randomForest in this case).
  • a script that is run on container startup.

This image is pushed to a Docker registry hosted in Azure, and then deployed to a Kubernetes cluster, also in Azure.

Prerequisites

To use this repository, you should have the following:

  • A recent version of R. It's recommended to use Microsoft R Open, although the standard R distribution from CRAN will work perfectly well.

  • The following packages for working with Azure. Note that if you are using Microsoft R, they will probably not be in the MRAN snapshot that is your default repository; run options(repos="https://cloud.r-project.org") to set your repository to an up-to-date CRAN mirror. before running install.packages.

  • Docker, kubectl and helm installed on your machine.

  • An Azure subscription.

Setup

Edit the file resource_specs.R to contain the following:

  • Your Azure Active Directory tenant. This can be either your directory name or a GUID.
  • Your subscription ID.
  • The name of the resource group that will hold the resources created. The resource group will be created if it does not already exist.
  • The location of the resource group; for a list of regions where AKS is available, see this page.
  • The names for the ACR and AKS resources to be created.
  • The number of nodes for the AKS cluster.

Deployment steps

In general, you should not run these scripts in an automated fashion, eg via source() or by pressing Ctrl-Shift-Enter in RStudio. This is because the process of creating and deploying resources in the cloud involves significant latencies; it's sometimes necessary to wait until a given step has finished before starting on the next step. Because of this, you should step through the scripts line by line, following any instructions in the comments.

Building the model image

The script 00_train_model.R trains a simple model (a random forest for house prices, using the Boston dataset), and saves the model object to a .RDS file.

Creating the Azure resources

The script 01_create_resources.R checks if the resource group for the deployment exists, and creates it if necessary. It then creates the ACR and AKS resources. Note that creating an AKS resource can take several minutes.

Installing an ingress controller

The script 02_install_ingress.R installs nginx on the Kubernetes cluster, and downloads a TLS certificate from Let's Encrypt.

Deploying the service

The script 03_deploy_service.R deploys the actual predictive service. First, it builds the Docker image for the predictive service, saving the MLS model operationalization password to Key Vault. The image is about 2GB in size. It then pushes the image built previously to the container registry, and creates a deployment and service on the Kubernetes cluster using that image. This step involves uploading the image to Azure, so may take some time depending on the speed of your Internet connection. At the end, it brings up the Kubernetes dashboard so you can verify that the deployment has succeeded.

Note that the MMLS install used in this image is licensed for development and testing purposes only. For a production image, contact your Microsoft representative about licensing details.

Testing the service

The script 04_test_service.R tests that the service works properly (which is not the same as testing that the deployment succeeded). It uses the httr package to send requests to the API endpoint; you can check that the responses are as expected.

realtimerdeployment's People

Contributors

hongooi73 avatar microsoftopensource avatar msftgits avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.