Skip to content
View GhoshSrinjoy's full-sized avatar

Block or report GhoshSrinjoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
GhoshSrinjoy/README.md
Typing SVG

👨‍💻 About Me

I'm Srinjoy Ghosh, a Data Scientist and AI/ML Engineer with a Master's in Data Science from Friedrich-Alexander-Universität Erlangen-Nürnberg, specializing in Artificial Intelligence, Machine Learning, and Software Development.

🔭 What I’m Working On

  • Software Development and Testing and AI/DS Consulting:
    • Software Testing and Development of communication protocols, Resource managers, etc, of edge-based Medical devices.
    • Currently, developing version 2 of the multi-agent system for entire end-to-end Blackbox Testing (Requirement Analysis to Feature file generation) for XpressRT (Version 1: Deployed using GitLab's CI/CD + Runpod Compute; and in testing).
Coding Animation

💼 Professional Experience

  • Software Development and Testing and AI/DS Consulting at RX-Systems, Berlin, Germany
  • AI/ML Engineer (HIWI) at RX-Systems, Berlin, Germany
  • Data Scientist (HIWI) at Fraunhofer IISB, Erlangen, Germany
  • Python Django Trainee at Golden Eagle IT Technology, Kolkata, India
  • Data Analyst Intern at Ausavi AI Pvt. Ltd., Bangalore, India

📊 Academic Projects

🏆 Master's Thesis: RAG to RICHES - Multimodal Cross-Lingual RAG System

  • Designed and implemented RICHES (RAG with Intelligent Cross-Document Clustering and Hybrid Ensemble Search), a novel self-hosted multilingual Retrieval-Augmented Generation system for secure document processing across English and German.
  • Developed a novel multimodal PDF parser achieving state-of-the-art extraction of text, tables, images, and mathematical equations using layout detection (DETR) and specialized OCR, operating entirely locally without VLM dependencies.
  • Engineered a three-pipeline architecture (Ingestor, Retriever, Generator) with progressive multi-strategy retrieval: Simple Search for factual queries, Chain-of-Thought for comparative analysis, and Agentic Planning for complex multi-step inquiries.
  • Implemented ingestion-time K-Means clustering with Elbow Method optimization, transforming brute-force search into efficient two-stage lookup (cluster centroid → members), drastically reducing query latency while maintaining accuracy.
  • Achieved 93.7% completion rate vs. 30% for OpenAI Playground on heterogeneous benchmarks, processing 100 files across PDFs, DOCX, images, and structured data with specialized parsers within 24GB VRAM constraint.
  • Built hybrid parallel retrieval combining vector search, BM25 lexical search, SQL metadata search, and image similarity using Reciprocal Rank Fusion (RRF), with language-consistent generation and inline citation traceability.
  • Processed complete ingestion pipeline in 10m 46s with predictable cost distribution and sub-minute query response times, demonstrating 3.1× better search accuracy than commercial alternatives.
AI Brain
📂 Click to view other Projects (Workflow Automation, FPGA, Smart Lock, etc.)
  • Workflow Automation using LLMs:
    • Developed a system to accurately analyze text from emails and user conversations to extract project descriptions and deploy it in Argo Workflow.
    • Automated the deployment of workflows in Argo by generating YAML files using GPT-3.5 Turbo integrated with a Retrieval-Augmented Generation (RAG) pipeline.
    • Designed a user-friendly interface with Chainlit for seamless user interaction.
  • Deploying ML Models on FPGA:
    • Explored model quantization and optimization techniques to deploy advanced ML models such as BERT and YOLOv3 on the Xilinx Kria KV260 Vision AI Starter Kit.
    • Addressed challenges related to FPGA hardware constraints and toolchain compatibility.
    • Leveraged Vitis AI for efficient model deployment.
  • Knowledge Graph Website for Architects and Planners:
    • Extracted and preprocessed text from PDFs to identify latent topics and related keywords.
    • Built a graph database, enabling visualization and querying of relationships between themes and keywords in sustainable architectural planning.
  • Voice Assistant for the Visually Impaired:
    • Created a voice-controlled assistant using VOSK speech recognition and pyttsx3 text-to-speech for enhanced accessibility.
    • Enabled features like retrieving time, date, location, and system commands, such as shutdown, tailored to improve user productivity.
  • Smart Lock Using Facial Recognition:
    • Built a smart lock system with Raspberry Pi, to identify and authenticate faces.
    • Successfully integrated a magnetic lock for automated access control.
  • Sound Mixer Using Arduino:
    • Programmed a sound mixer using Arduino UNO and piezo buzzers to reproduce music, including Metallica's "Enter Sandman".
    • Customized sound effects with the NOTE library.

🛠️ Technical Skills



Fun Fact: I recreated Metallica’s “Enter Sandman” using Arduino and piezo buzzers. 🎵
Let’s collaborate and innovate together! 🚀

🌐 Languages

• English • Bengali • Hindi • German


🐍 Contribution Snake

Pinned Loading

  1. Universal-Report-Generator Universal-Report-Generator Public

    An universal HTML report generator that can intercept logs, debug messages, and prints to create beautiful HTML reports with plots and tables. This will be a Python package that can be imported and…

    Python

  2. Web_search_mcp Web_search_mcp Public

    A comprehensive Model Context Protocol (MCP) server that provides intelligent web search and content extraction capabilities using SearXNG, Trafilatura, and Redis caching.

    Python 1

  3. Overleaf-mcp Overleaf-mcp Public

    JavaScript 7 2

  4. DeepseekOCR DeepseekOCR Public

    Deploying Deepseek_ocr in Windows Using Docker

    HTML 1 1

  5. SENSATION-Voice-assistant SENSATION-Voice-assistant Public

    Sensation is a voice-controlled assistant built in Python. It can perform various tasks such as providing the current time and date, fetching your location, taking pictures, and shutting down your …

    Python 3

  6. linkedin-job-mcp linkedin-job-mcp Public

    JavaScript 1