Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 1.71 KB

File metadata and controls

13 lines (9 loc) · 1.71 KB

Spark Streaming

  • Spark Streaming supports highly scalable, fault-tolerant continuous stream processing which is mostly used in cases like fraud detection, website monitoring, website click baits, IoT (Internet of Things) sensors, etc.

Features

Feature Remarks
Dstreams or Discretized Streams Spark Streaming first divides the data from the data stream into batches of X seconds which are called Dstreams or Discretized Streams.
Data Sources Data from sources like Kafka, Kinesis etc. are processed and pushed to various destinations like databases, dashboards, machine learning APIs or as simple as file systems.

Architecture