View Code? Open in Web Editor NEW

a MLOps Platform, competing in MLOpsVN Hackathon 2023

Dockerfile 3.78% Python 96.22%

demo2-mlops-workflow's Introduction

Hi 👋, I'm Huy Vu Nguyen

🌱 I’m currently enhancing my skills in Building ELT/ETL pipelines, data modeling and testing.
👨‍💻 All of my projects are available at https://github.com/DurianDan
💬 Ask me about data engineering in healthcare, dbt, Airflow, and more.
📫 How to reach me: [email protected]
📄 Know about my experiences: data-engineer-resumes

demo2-mlops-workflow's People

Watchers

Dynamically make prediction based on phase_id and prob_id:
- Implement a function to create corresponding Predictor and SQLModel
- The Predictor needs to dynamically load the right registered model from mlflow based on phase_id and prob_id
- The SQLModel needs to save the raw data from the request to the PostgreSQL server
Load latest raw data from postgres: The data scientist will send request to this route to get the latest data for researching and creating new model.

When training a model: specify the python packages used to create the model in all requirements.txt files: deployment/pip_requirements/*requirements.py;
- !!!Make sure the packages and their versions are matched both in the api server and the mlflow server.
- So that the API can correctly run the loaded mlflow model
when a model is ready to registered (for serving in the API): tutorial to register a model
- Name the model with format: phase{phase_id}-prob{prob_id}
  E.g.: phase2-prob1
- Must specify the stage as "production" or "development".
  - mlflow can load the latest model in a stage, when the stage is specified. details: get_latest_version
  - then the API will use the stage name and model name to load the latest mlflow model

check for similar python packages in deployment/pip_requirements/: fastapi.requirements.txt and mlflow.requirements.txt
- every packages in mlflow server must be installed in the API server