Demonstrates filtering on Streaming data simulated via a Kafka topic. Sample data used is Airline Ontime data. The (raw) data is filtered by Storm, and the cleaned-up data is routed to 2 destinations:
- Another Kafka Topic (for further processing)
- Cassandra (dump)
- Kafka running on localhost with a topic named
raw
andproc
- Cassandra running on localhost with keyspace
cloudpoc
mvn package -DskipTests=true
{project-dir}$ storm jar target/poc3-1.1.1.jar org.pgmx.cloudpoc.poc3.topology.KafkaLocalReaderTopology