Code Monkey home page Code Monkey logo

eda-polars-way's Introduction

eda-polars-way

Tutorial: Exploratory Data Analysis, the Polars Way

(as given at PyCon Italia 2024 and EuroPython 2024).

Preparation

Please prepare a Python environment that you can use during the workshop. We will work in Jupyter Notebook. However, you can also use jupyter lab or one of the IDES, Visual Studio Code or PyCharm.

Clone this repository

git clone https://github.com/janpipek/eda-polars-way.git

or using gh client:

gh repo clone janpipek/eda-polars-way

Alternatively, you can just download the repo as a package from here:

https://github.com/janpipek/eda-polars-way/archive/refs/heads/main.zip

Prepare Python Environment

The included requirements.txt file should be enough for you to create a Python environment using the pip command.

Python version 3.10+ is required.

First, cd into the repository directory:

cd eda-polars-way

Pip/uv installation

# Activate the environment (every time you open the shell)
python -m venv .venv         # (or `uv venv`)
source .venv/bin/activate    # Linux, Mac
.venv\Scripts\activate.bat   # Windows

# Install the required packages (once)
python -m pip install -r requirements.txt  (or `uv pip install -r requirements.txt`)

(note that we require the new, stable 1.0 version of polars)

(Absolutely lazy) on-line environment

This is not recommended but working in case you have probelms installing on your laptop.

Create an account at https://deepnote.com (for free) and launch the repo by clicking the button:

Note that you will have to install additional packages (there is a command you need to uncomment).

How to use this repo

All contents (a bit of text + all exercises) are located in exercises.ipynb. The exercise are partly filled and accompanied by hints. If you are still unsure, in solutions.ipynb, you have working code to answer the questions. To help SQL-savvy, the solutions-sql.ipynb file contains solution using the SQL API of polars).

Data sources

All the data sources are believed to be open and publicly distributable, see data/README.md for more details.

Useful links

Official documentation

Articles

Talks & videos

On-line courses

eda-polars-way's People

Contributors

janpipek avatar

Stargazers

bnln avatar  avatar Jean Jimbo  avatar  avatar auxten avatar Hendrik avatar  avatar Jan Doležal avatar  avatar Aneta Popelová avatar Sebastiano Ferraris avatar Diego Kisai avatar Bobo Jamson avatar  avatar Dr. Hunter Thompson Lockwood avatar Gustavo Juantorena avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.