Photo by Pascal Bernardon on Unsplash
In this Repository you can find Jupyter Notebooks that you should work on together as Pair-Programmers.
Please make sure you have forked the repo and set up a new virtual environment.
For this purpose you use following commands:
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
The added requirements file contains all libraries and dependencies we need to execute Pandas and Numpy.
At the end of this repo you should
- understand why Pandas and Numpy are valuable tools for Data Scientists
- be familiar to handle data in Pandas DataFrames and Numpy-Arrays
- know how to combine datasets
- be able to do some EDA on a new dataset
- know how to plot some basic plots for better data understanding
This repo covers:
- Introduction to Pandas
- Practice Pandas functionalities
- Introduction to Pandas visualization
- Practice Pandas functionalities and visualizations
- More Pandas practice and EDA
- Combining DataFrames
- Introduction to Numpy
- Practice Numpy functionalities
- One last exercise :)
- Working with large data
- Dealing with date and time data