Name: Sanjyot Bandal
Type: User
Company: Thoughtworks Inc.
Bio: Sr. Data Engineer (EPFL's Certified Scala Developer) having 7.5+ yrs of experience in Scala, Java, Akka with Big Data technologies like Spark, Kafka, AWS.
Location: Pune
Sanjyot Bandal's Projects
Repo to help you set up a basic AWS environment with an EMR cluster behind a VPC.
This repository has been built to help people learn how to do basic transformations on a single DataFrame in Spark + Scala.
A complete (distributed) BigData stack, running in containers
A repository that analyzes crime data using Spark + Scala
Started code base for Spark + Scala project.
Secure Hadoop docker image
A robot powered training repository :robot:
Just a sample repository
This repository will walk you through several katas for learning how to do joins with Spark+Scala.
A distributed publish/subscribe messaging service
Apache kafka examples
Demo applications and code examples for Apache Kafka's Streams API.
All the Git-it Workshop completers!
Project for James' Apache Spark with Scala course
This repository has been built to help people learn how to work with semi-structured data sources with Spark+Scala.
Word count example to demonstrate Spark Streaming
Open source on demand courses and cheat sheets for Git and GitHub