This repository offers content for a 3 day single scenario deep-dive training outline that has been used for the (ML) track of the Partner AI airlifts in the spring of 2019
This material should enable you to learn how to combine the strengths of Azure Databricks (ADB) for data wrangling and ML model building and training on big data, with the strengths of Azure Machine Learning (AML) services for ML Experimentation, Management, Deployment, and DevOps.
For this purpose, we will use a real-world scenario for anomaly detection and predictive maintenance on factory floors. This use-case scenario was selected based on popular demand by the field, and its applicability across industry verticals.
The scenario involves structured as well as unstructured time series data. The structured time series data will consist of telemetry data collected by sensors on many manufacturing machines. The unstructured time series data will be video recordings of cameras surveilling factory floors. The telemetry data will mainly be used on the first 2 days of the airlift, to show the integration of ADB and AML for classical ML. The video data will be used on the third day, to extend our use-case to detecting anomalies (e.g. accidents) in surveillance videos of factory floors, invoking deep learning on GPUs with AML compute.
Day 1 | Day 2 | Day 3 |
---|---|---|
Machine Learning on Azure โ Solution Brief | Logistic Regression w/ Spark ML (Hands On) | CI/CD and DevOps (Hands On) |
Data Prep w/ ADB (1; Hands On) | Random Forests w/ Spark ML (Hands On) | Deep Learning w/ GPUs on AmlCompute (Hands On) |
Data Prep w/ ADB (2; Hands On) | AML Model Management and ML Experimentation (Hands On) | Hyperparameter tuning w/ HyperDrive (Hands On) |
Model Development w/ Spark (1; Hands On) | Automated ML (Hands On w/ AML + ADB) | Hackathon |
Model Development w/ Spark (2; Hands On) | Real-time scoring w/ AKS, CI/CD and DevOps | Hackathon |
Model Development w/ Spark (3; Hands On) | Real-time scoring w/ AKS - Deployment to AKS (Hands On) | Hackathon |
Overview of AML service integration into ADB and capabilities | Recap of CI/CD plan AML Pipelines (Hands On) | Hackathon + Final Discussions |
This content can roughly be devided into 3 parts. The first part of this be performed with Azure Databricks, the other parts of the training will be done with VS Code.
Use the following links to get to the right entry point for the different parts:
- Part 1: Azure Databricks and its integration with Azure Machine Learning Services: Readme
- Part 2: Continuous Integration and Continuous Delivery (CI/CD): Readme
- Part 3: Deep learning with Azure Machine Learning Services using VS Code: Readme
We invite everybody to contribute.
- Email us with questions an comments:
[email protected]
- Submit issues and comments in our github repository:
https://github.com/Azure/LearnAI-CustomAI-Airlift/issues