Code Monkey home page Code Monkey logo

uptrain's Introduction

UpTrain is an open-source framework to monitor machine learning models, built for engineers


Read documentation »
Try it out »

Performance

UpTrain is an open-source, data-secure tool for ML practitioners to observe and refine their ML models by monitoring their performance, checking for (data) distribution shifts, and collecting edge cases to retrain them upon. It integrates seamlessly with your existing production pipelines and takes minutes to get started ⚡.

  • Data Drift Checks - identify distribution shifts in your model inputs.
  • Performance Monitoring - track the performance of your models in realtime and get alerted as soon as a dip is observed.
  • Embeddings Support - specialized dashboards to understand model-inferred embeddings.
  • Edge Case Signals - user-defined signals and statistical techniques to detect out-of-distribution data-points.
  • Data Integrity Checks - checks for missing or inconsistent data, duplicate records, data quality, etc.
  • Customizable metrics - define custom metrics that make sense for your use case.
  • Automated Retraining - automate model retraining by attaching your training and inference pipelines.
  • Model Bias - track popularity bias in your recommendation models.
  • Data Security - your data never goes out of your machine.

🚨Coming soon🚨

  • Realtime Dashboards - to visualize your model's health.
  • Slack Integration - get alerts on Slack.
  • Label Shfit - identify drifts in your predictions. Specially useful in cases when ground truth is unavailable.
  • Prediction Stability - filter cases where model prediction is not stable.
  • AI Explainability - understand relative importance of multiple features on predictions.
  • Adversarial Checks - combat adversarial attacks

And more.

Get started 🙌

You can quickly get started with Google collab here.

To run it in your machine, follow the steps below:

Install the package through pip:

pip install uptrain

Run your first example:

git clone [email protected]:uptrain-ai/uptrain.git
cd uptrain/examples
pip install jupyterlab
jupyter lab

For more info, visit our get started guide.

UpTrain in action 🎬

One of the most common use cases of ML today is language models, be it text summarization, NER, chatbots, language translation, etc. UpTrain provides ways to visualize differences in the training and real-world data via UMAP clustering of text embeddings (inferred from bert).

Meme

Additionally, UpTrain also provides statistical measures to quantify these differences and enables automated alerts whenever this drift crosses a certain threshold.

Meme Meme

Why UpTrain 🤔?

Machine learning (ML) models are widely used to make critical business decisions. Still, no ML model is 100% accurate, and, further, their accuracy deteriorates over time 😣. For example, Sales prediction becomes inaccurate over time due to a shift in consumer buying habits. Additionally, due to the black box nature of ML models, it's challenging to identify and fix their problems.

UpTrain solves this. We make it easy for data scientists and ML engineers to understand where their models are going wrong and help them fix them before others complain 🗣️.

UpTrain can be used for a wide variety of Machine learning models such as LLMs, recommendation models, prediction models, Computer vision models, etc.

We are constantly working to make UpTrain better. Want a new feature or need any integrations? Feel free to create an issue or contribute directly to the repository.

Meme

License 💻

This repo is published under Apache 2.0 license. We're currently focused on developing non-enterprise offerings that should cover most use cases. In the future, we will add a hosted version which we might charge for.

Stay Updated ☎️

We are continuously adding tons of features and use cases. Please support us by giving the project a star ⭐!

Provide feedback (Harsher the better 😉)

We are building UpTrain in public. Help us improve by giving your feedback here.

Contributors 🖥️

We welcome contributions to uptrain. Please see our contribution guide for details.

uptrain's People

Contributors

a-r-r-o-w avatar ananis25 avatar lokeshwarlakhi avatar nishantb06 avatar progs2002 avatar scorcism avatar sourabhagr avatar vipgupta avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.