Skip to content

kylasam/KafkaRealtimeDataStreaming

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KafkaRealtimeDataStreaming

Implementing Kafka DataStreaming Data Streaming Application in kafka and Spark Processing

This is a standalone application for stream data processing from source, process the data in pyspark and load the data as JSON in the local directory..

The data pipeline is given below,

Script Flow Diagram:

Execution Steps: User just need to execute the Applciation.py a GUI opens and should Start application as below,

Upon clicking the Start application,process starts as,

Dashboard View: Since this is just a temporary project parameters needs to be tuned fined for excellent performance and better GUI/UI/UX performance. For now,the Dashboard(HTML) should be manually refreshed over browser due to the limitation of my personal laptop configurations, haven’t linked to any realtime reporting.

SNAPSHOT 2:

The code is designed so robust and versatile in extending to other applications/further integrations with new softwares/framework.Almost the code is kind of plug-play!

About

Implementing Kafka DataStreaming

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published