Analyzed Apple's dataset to check how many people bought Airpods after buying a Mac or iPhone. Thereafter, using ML and predictive analytics to check future outcomes. Therefore, as we can see this project has 3 phases-
- Data Cleaning and Preprocessing
- Exploratory Data Analysis
- Development of an end-to-end ETL/ELT data pipeline
- Application of Statistical and mathematical models (Predictive Analytics)
I've learned in-depth application of different concepts and techniques like-
- Databricks
- Delta Tables
- Parquet
- Apache Spark
- PySpark
- Apache SQL
- SQL
- Broadcast Join
- Windows Functions- LAG and LEAD in SQL
- Factory Patterns
- MlLib
For any query, ping me on
- LinkedIn: @jabhij
- Twitter: @jabhij
- Web: LetUsTweak
Hope, it helps!! ใ