- a Data Lake which receives scraped real estate data. PySpark will be processing this data and load it into a DWH
- MinIO
- Apache Airflow
- Scrapy
- Marquez + PostgreSQL
stejul Goto Github PK
Name: Stefan Mikic
Type: User
Company: Rhomberg Sersa Rail Group Holding GmbH
Bio: Developer with interests in Data
Blog: https://smikic.com