Code Monkey home page Code Monkey logo

mlops-nba's Introduction

nba-mlops

Description

This project demonstrates the implementation of data pipelines in an AI context. It utilizes DVC (Data Version Control) for data versioning and management. The pipelines are designed to efficiently process and transform data, enabling seamless integration with AI models and workflows. We will then use MLFlow to experiment track our experiments, and use it as a model store.

Requirements

The requirements for the project are the following:

  • python3.9+
  • make command
    • For windows users, you can download make command following this link. For more details on other versions, follow this page
    • For linux/mac users, download make command following your sudo apt-get update & apt-get -y install make

To check make is correctly installed, type make --version

Setup the environment

Start by running make--version and python --version to make sure you have all the prerequists.

  • Run make setup
  • activate your environement :
    • Windows: .\wenv\Scripts\activate
    • Linux: ./venv/bin/activate
  • Start developping !

PS: To check that you're on the right envrionnement, type python -m mlops_nba.main.

Dev tools available:

Those command are targeting the mlops_nba folder and the configuration is here.

  • Code Quality: You can trigger those commands with make check.

    • Formatting with black + isort: To format use make format and check with make black and make isort for black and isort respectively
    • type-checking with mypy: You can use make mypy to check the types and detect errors
    • Linting with flake8 + pylint: You can use make flake8 and make pylint to lint your code using flake8 and pylint respectively.
  • Tests:

    • For testing we use pytest and target the tests in the mlops_nba using make test
    • You can generate a coverage report using make coverage and a html version using make coverage-html

    Next steps:

    • Create a preprocessed stage having the aggreation of all currated data
    • Implmenet data-quality for all data stages
    • Add unit and integration tests for all pipelines (Github actions)

mlops-nba's People

Contributors

absabry avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.