Demonstrates analytics on Streaming data simulated via a Kafka topic. Sample data used is Airline Ontime data.
- Kafka running on localhost with a topic named
testspark
- Cassandra running on localhost with a keyspace named
cloudpoc
mvn package -DskipTests=true
{project-dir}$ spark-submit --packages org.apache.spark:spark-streaming-kafka-0-8_2.11:2.2.0 --class org.pgmx.cloud.poc.poc1.KafkaReaderTest target/poc1-1.0.jar