Skip to content
This repository has been archived by the owner on Mar 25, 2024. It is now read-only.

A step by step guide to building a highly scalable data streaming pipeline in Python.

Notifications You must be signed in to change notification settings

iam-mhaseeb/Data-Streaming-Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data-Streaming-Pipeline

A step by step guide to building a highly scalable data streaming pipeline in Python. Article is avialable here

Architecture

image

Installation

  1. Install the Redis and run it locally.

  2. Clone the repository.

    git clone https://github.com/iam-mhaseeb/Data-Streaming-Pipeline.git
    
  3. Install the requirements.

    pip install -r requirements.txt
    
  4. You are good to go!

Quick start

  1. Start the producer quotes_spider:
    cd producer
    scrapy crawl quotes
    
  2. Start the consumer quotes_consumer:
    cd consumer
    python quotes_consumer.py
    
  3. See data flowing between pipeline as soon as data is generated from the producer.

Authors

Licensing

The project is MIT Licenced.

About

A step by step guide to building a highly scalable data streaming pipeline in Python.

Resources

Stars

Watchers

Forks

Languages