- Spark Streaming supports highly scalable, fault-tolerant continuous stream processing which is mostly used in cases like fraud detection, website monitoring, website click baits, IoT (Internet of Things) sensors, etc.
Feature | Remarks |
---|---|
Dstreams or Discretized Streams | Spark Streaming first divides the data from the data stream into batches of X seconds which are called Dstreams or Discretized Streams. |
Data Sources | Data from sources like Kafka, Kinesis etc. are processed and pushed to various destinations like databases, dashboards, machine learning APIs or as simple as file systems. |