Twitter stream analysis using Apache spark, Apache kafka and Postgresql
- Collect twitter feed for 30 minutes with any 5 hash tags.
- Ingest into spark.
- Use 6 tumbling windows of 5 minutes each.
- Aggregate count of each of these hash tags for each of the 5 minute windows.
- Publish them as topics into Kafka.
- Export topic wise count by window into any database.