Skip to content

An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlDB. The processed analytics data is then sent to Telegram for real-time notifications.

Notifications You must be signed in to change notification settings

airscholar/YoutubeAnalytics

Repository files navigation

Real-Time YouTube Analytics Streamed to Telegram

This Python-based project aims to fetch real-time YouTube metrics like likes, views, comments, and favorites, and then streams this data via Kafka. Also, KSqlDB was used for stream processing and the processed data is then sent to a Telegram bot for real-time notifications.

Table of Contents

  1. System Architecture
  2. Requirements
  3. Getting Started
  4. Configuration
  5. Running the Code
  6. How It Works
  7. Contributing
  8. Video

System Architecture

YoutubeAnalytics architecture.png

Requirements

  • Python 3.10 (minimum)
  • Kafka
  • Telegram API
  • Docker
  • Confluent Containers (Zookeeper, Kafka, Schema Registry, Connect, ksqlDB, Control Center)

Getting Started

  1. Clone the repository.

    git clone https://github.com/airscholar/YoutubeAnalytics.git
  2. Install Python dependencies.

    pip install -r requirements.txt
  3. Make sure you have Docker and Confluent containers set up.

Configuration

  1. Open config/config.local and set the following:

    • YOUTUBE_API_KEY: Your YouTube API Key
    • PLAYLIST_ID: The YouTube playlist ID you want to track
  2. Set up your Kafka server address in the main script, by default, it's set to localhost:9092.

Running the Code

  1. Start your Kafka and other Confluent services on Docker with
    docker compose up -d
  2. Run the Python script.
    python YoutubeAnalytics.py

How It Works

  1. Fetches data from YouTube API using the given playlist ID.
  2. Sends this data to Kafka.
  3. You should have another component (not included here but in the video) that reads from this Kafka topic and performs real-time analytics using ksqlDB.
  4. The analytics results are then sent to Telegram for real-time notifications.

Contributing

Pull requests are welcome! For major changes, please open an issue first to discuss what you would like to change.

Video

YoutubeAnalytics

About

An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlDB. The processed analytics data is then sent to Telegram for real-time notifications.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published