Skip to content
View coderRaj07's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report coderRaj07

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
coderRaj07/readme.md

Software Engineer building distributed systems & AI-native infrastructure

I design systems that scale under load and stay correct under concurrency.

         

🧠 What I Build

I design and ship production-grade backend systems focused on:

  • High-concurrency distributed architectures
  • Idempotent APIs & transactional guarantees
  • Event-driven AWS infrastructure
  • Database performance at scale
  • AI-native pipelines (LLM + RAG + document intelligence)

I care deeply about correctness, scalability, and business impact.


🛠 Core Technologies


NodeJS   Django   Python   JavaScript   Docker   Redis   PostgreSQL   MongoDB   AWS


📊 GitHub Metrics


🚀 Products & Systems

Scalable AWS Ingestion Pipeline

Architecture: S3 → Lambda → SQS → Lambda → PostgreSQL
Scale: 10,000+ async jobs/day

Designed and implemented a fault-tolerant, event-driven ingestion pipeline to process large CSV datasets into PostgreSQL.

Engineering Highlights

  • Queue-based isolation for failure containment
  • Idempotent job processing to prevent duplicate writes
  • Transaction-safe inserts with DB constraints
  • Designed scaling roadmap for 10M+ rows using Step Functions + Glue

Focus Areas: Reliability • Concurrency Control • Horizontal Scalability

📖 Architecture Blog🎥 Demo


CyberIntel – AI PDF Intelligence Platform

An AI-powered system that extracts structured quantitative metrics from complex PDF reports (tables + vector charts).

Stack: FastAPI • Celery • PostgreSQL • Layout Parsing Engine • LLM Orchestration

Core Capabilities

  • Layout-aware PDF parsing engine
  • Hybrid extraction pipeline (tables + vector-based charts)
  • Confidence scoring for extracted metrics
  • Source-of-truth tagging with page-level traceability
  • Modular pipeline architecture (worker + service layer separation)

Designed for extensibility and large-scale document ingestion workflows.

GitHubDemo


SignalLens – AI Competitive Monitoring

A monitoring platform that tracks website changes and generates AI summaries for competitive intelligence.

Stack: React • FastAPI • PostgreSQL • Snapshot Engine

System Design Highlights

  • Versioned website snapshots
  • Structured diff detection engine
  • AI-generated change summaries
  • Scalable content monitoring workflow

Focused on automated intelligence extraction from dynamic web content.

🌐 LiveGitHub


Dataseeder – AI CLI Data Generator

A developer productivity CLI tool for schema-driven dataset generation using LLM workflows.

Stack: Python • LLM APIs • CLI Automation

Capabilities

  • Schema-based synthetic data generation
  • Automated dataset creation for rapid prototyping
  • Extensible command-line interface
  • AI-assisted developer workflow acceleration

GitHub


Payment System Microservices

Distributed payment architecture with strong transactional guarantees.

Stack: Java • Spring Boot • PostgreSQL

Engineering Highlights

  • ACID-compliant transaction management
  • Service isolation for payment consistency
  • Modular microservice architecture
  • Database-level consistency enforcement

Designed to simulate production-grade financial backend behavior.

GitHub


Social Media Backend

High-read performance backend optimized for feed delivery at scale.

Stack: Node.js • PostgreSQL • Redis

System Highlights

  • Feed query optimization
  • Redis-based caching strategy
  • Normalized relational schema design
  • Improved read performance under load

Built with scalability and cache efficiency in mind.

GitHub


🎯 Open To

Backend Engineering • Distributed Systems • AI Infrastructure
Remote-first teams • UAE product companies

Pinned Loading

  1. cyber_intel cyber_intel Public

    Automated document intelligence pipeline that extracts, normalizes, and structures quantitative metrics from complex PDFs (tables, charts, and text) with source-of-truth metadata.

    Python

  2. LLM-Powered-Cli-Tool LLM-Powered-Cli-Tool Public

    It is a cli tool that uses LLM to generate realistic data for formbricks

    Python

  3. signal_lens signal_lens Public

    SignalLens is a lightweight competitive intelligence tool that monitors competitor websites, tracks content changes with diff analysis and AI summaries, and provides recent activity insights via a …

    Python

  4. booking_system_microservices_ booking_system_microservices_ Public

    This application is built using Node.js and Sequelize for MySQL database interaction. We've divided our application into several microservices to make it more scalable and maintainable.

    JavaScript 1

  5. payment-system-microservice payment-system-microservice Public

    A PayPal-like payment system built with Spring Boot microservices, featuring user, transaction, reward, and notification services, integrated via API Gateway with Redis caching, Kafka event streami…

    Java

  6. social_media_backend social_media_backend Public

    Scalable social media backend with user authentication, posts, comments, likes, follows, and personalized feeds. Supports asynchronous feed updates, caching, and pagination for high-performance and…

    TypeScript