This document is aim to explain about exploratory data analysis using python. Exploratory data analysis is an activity to check fundamental of data description to ensure that our data fit to the next round.
- Pandas Utilization
- Statistic Descriptive
- Simple Visualization
- Exploratory data analysis using univariate Part I (simple)
- Exploratory data analysis using univariate Part II (simple)
- Exploratory data analysis using multivariate
- Outlier detection
- Exploratory data analysis of missing value handler
- Exploratory data analysis categorical data -- To be continued
- Data reduction
- Numerical data transformation/normalization
Based on the Jiawei Han, dkk. Exploratory data analysis can be used to do preprocessing data in data mining process. There are three main process that we have to do :
- Data cleansing/cleaning (missing value, etc)
- Data integration and reduction (PCA, SVM, etc)
- Data transformation/normalization