Code Monkey home page Code Monkey logo

modeldrift's Introduction

Productionizing Machine Learning: From Deployment to Drift Detection

Here is a prototype on productionizing a ML model pipeline, and monitoring it for drift, for subsequent retraining and deployment.

This uses glassware manufacturing dataset, which is synthesized to showcase model drift.

To review the code in notebook format using HTML https://joelcthomas.github.io/modeldrift

Architecture Overview

Deployment to Drift Detection - a Typical Workflow

  • To understand the data, we start with EDA (Exploratory Data Analysis)
  • Using historical data, we explore various modeling methods, tune its hyperparameters, and identify our best model
  • All the experiment runs are tracked using MLflow and we tag the best model for production use
  • While scoring in a streaming pipeline, production model is accessed from MLflow
  • Model is stable for first ‘x’ days
  • Model Drift KPIs
    • KPIs and its margin depends on the model and business problem
    • Sometimes more than 1 KPI maybe needed at times to capture behavior changes
  • After ‘y’ days, we see model drift occur, as identified by tracking KPIs
  • This triggers re-training process
  • Once again, we explore various modeling methods, tune its hyperparameters, and identify our new best model
  • The new model is tagged as current production model in MLflow
  • We once again observe that KPIs are back within acceptable range
  • Over time, based on business demands, it may be needed to update KPIs and its acceptable limits

Run

To reproduce this example, please import attached model_drift_webinar.dbc file to databricks workspace.

Instructions on how to import notebooks in databricks

For more information on using databricks
https://docs.databricks.com/

modeldrift's People

Contributors

joelcthomas avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.