Code Monkey home page Code Monkey logo

pycaret's Introduction

drawing

An open-source, low-code machine learning library in Python
๐Ÿš€ PyCaret 3.0-rc is now out. pip install --pre pycaret

Official โ€ข Docs โ€ข Install โ€ข Tutorials โ€ข FAQs โ€ข Cheat sheet โ€ข Discussions โ€ข Contribute โ€ข Resources โ€ข Blog โ€ข LinkedIn โ€ข YouTube โ€ข Slack

Python pytest on push Documentation Status PyPI version License

Slack

alt text

Welcome to PyCaret

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, spaCy, Optuna, Hyperopt, Ray, and few more.

The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise.

Important Links
โญ Tutorials New to PyCaret? Checkout our official notebooks!
๐Ÿ“‹ Example Notebooks Example notebooks created by community.
๐Ÿ“™ Official Blog Tutorials and articles by contributors.
๐Ÿ“š Documentation The detailed API docs of PyCaret
๐Ÿ“บ Video Tutorials Our video tutorial from various events.
โœˆ๏ธ Cheat sheet Cheat sheet for all functions across modules.
๐Ÿ“ข Discussions Have questions? Engage with community and contributors.
๐Ÿ› ๏ธ Changelog Changes and version history.
๐ŸŒณ Roadmap PyCaret's software and community development plan.

Installation

PyCaret's default installation only installs hard dependencies as listed in the requirements.txt file.

pip install pycaret

To install the full version:

pip install pycaret[full]

Supervised Workflow

Classification Regression

Unsupervised Workflow

Clustering Anomaly Detection

โšก PyCaret Time Series Module

PyCaret time series module is now available with the main pycaret installation. Staying true to simplicity of PyCaret, it is consistent with our existing API and fully loaded with functionalities. Statistical testing, model training and selection (30+ algorithms), model analysis, automated hyperparameter tuning, experiment logging, deployment on cloud, and more. All of this with only few lines of code (just like the other modules of pycaret).

Important Links
โญ Time Series Quickstart Get started with Time Series Analysis
๐Ÿ“š Time Series Notebooks New to Time Series? Checkout our official (detailed) notebooks!
๐Ÿ“บ Time Series Video Tutorials Our video tutorial from various events.
โ“ Time Series FAQs Have questions? Queck out the FAQ's
๐Ÿ› ๏ธ Time Series API Interface The detailed API interface for the Time Series Module
๐ŸŒณ Time Series Features and Roadmap PyCaret's software and community development plan.

Installation

pip install --pre pycaret

alt text

Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

  • Experienced Data Scientists who want to increase productivity.
  • Citizen Data Scientists who prefer a low code machine learning solution.
  • Data Science Professionals who want to build rapid prototypes.
  • Data Science and Machine Learning students and enthusiasts.

PyCaret GPU support

With PyCaret >= 2.2, you can train models on GPU and speed up your workflow by 10x. To train models on GPU simply pass use_gpu = True in the setup function. There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default version or the full version. As of the latest release, the following models can be trained on GPU:

  • Extreme Gradient Boosting (requires no further installation)
  • CatBoost (requires no further installation)
  • Light Gradient Boosting Machine requires GPU installation
  • Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression requires cuML >= 0.15

PyCaret Intel sklearnex support

You can apply Intel optimizations for machine learning algorithms and speed up your workflow. To train models with Intel optimizations use sklearnex engine. There is no change in the use of the API, however, installation of Intel sklearnex is required:

pip install scikit-learn-intelex

License

PyCaret is completely free and open-source and licensed under the MIT license.

Contributors

pycaret's People

Contributors

ajarman avatar andrinbuerli avatar artificialzeng avatar ayushexel avatar batmanscode avatar bhanuteja2001 avatar celestinoxp avatar cspartalis avatar daikikatsuragawa avatar desaizeeshan22 avatar drmario-gh avatar goodwanghan avatar hamzafarooq avatar harsh204016 avatar incubatorshokuhou avatar jonasvdd avatar krishnansg avatar moezali1 avatar netoferraz avatar ngupta23 avatar reza1615 avatar ryankarlos avatar ryanxjhan avatar satya-pattnaik avatar sayantan1410 avatar tremamiguel avatar tvdboom avatar wkuopt avatar wolfryu avatar yard1 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.