Code Monkey home page Code Monkey logo

demo2-mlops-workflow's Introduction

Hi ๐Ÿ‘‹, I'm Huy Vu Nguyen

Data Engineer specializing in Healthcare Data Models

duriandan

  • ๐ŸŒฑ Iโ€™m currently enhancing my skills in Building ELT/ETL pipelines, data modeling and testing.

  • ๐Ÿ‘จโ€๐Ÿ’ป All of my projects are available at https://github.com/DurianDan

  • ๐Ÿ’ฌ Ask me about data engineering in healthcare, dbt, Airflow, and more.

  • ๐Ÿ“ซ How to reach me: [email protected]

  • ๐Ÿ“„ Know about my experiences: data-engineer-resumes

Connect with me:

huy-vu-nguyen-1a9851168 huyk25sinh

Languages and Tools:

docker airflow fastapi grafana typescript linux mysql pandas postgresql javascript postman python react rust scikit_learn selenium

duriandan

ย duriandan

duriandan

demo2-mlops-workflow's People

Watchers

 avatar

demo2-mlops-workflow's Issues

Complete the API to dynamically make prediction based on problems and get latest raw data for data scientist.

The API needs to have 2 routes that:

  • Dynamically make prediction based on phase_id and prob_id:
    • Implement a function to create corresponding Predictor and SQLModel
    • The Predictor needs to dynamically load the right registered model from mlflow based on phase_id and prob_id
    • The SQLModel needs to save the raw data from the request to the PostgreSQL server
  • Load latest raw data from postgres: The data scientist will send request to this route to get the latest data for researching and creating new model.

Notes for Data Scientist:

  • When training a model: specify the python packages used to create the model in all requirements.txt files: deployment/pip_requirements/*requirements.py;

    • !!!Make sure the packages and their versions are matched both in the api server and the mlflow server.
    • So that the API can correctly run the loaded mlflow model
  • when a model is ready to registered (for serving in the API): tutorial to register a model

    • Name the model with format: phase{phase_id}-prob{prob_id}
      E.g.: phase2-prob1
    • Must specify the stage as "production" or "development".
      • mlflow can load the latest model in a stage, when the stage is specified. details: get_latest_version
      • then the API will use the stage name and model name to load the latest mlflow model

CI/CD:

  • check for similar python packages in deployment/pip_requirements/: fastapi.requirements.txt and mlflow.requirements.txt
    • every packages in mlflow server must be installed in the API server

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.