View Code? Open in Web Editor
NEW
This project forked from mahmoudparsian/data-algorithms-with-spark
Data Algorithms with Spark
Python 69.42%
Shell 30.58%
data-algorithms-with-spark's Introduction
Data Algorithms with Spark
- Author: Mahmoud Parsian ([email protected])
- This new book (to be published by O'Reilly) is the 2nd Edition of
Data Algorithms
(published by O'Reilly)
- The first edition used Java for Spark, but for the new book, I will use PySpark (much simpler and readable)
- The goal is to use the latest version of Spark (Spark-3.0.0)
- This GitHub repository will host all source code and scripts for Data Algorithms with Spark
- Estimated Publication date: July 2021