Skip to content
View AmirQaderi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report AmirQaderi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AmirQaderi/README.md

Amirhossein Qaderi

AI & Data Engineer Β· I build systems that make machines understand documents β€” and data teams understand their business.

LinkedIn Email GitHub


What I Build

I sit at the intersection of AI engineering and data infrastructure β€” deploying LLM-powered retrieval systems, architecting modern data stacks, and translating raw transactional data into executive-ready insight.

  • πŸ€– RAG & LLM Systems β€” Production Persian chatbot: +38% retrieval precision, ~60% hallucination reduction (FastAPI Β· Qdrant Β· OpenAI API)
  • πŸ—οΈ Data Engineering β€” End-to-end ELT pipelines with orchestration, transformation, and quality contracts (Airflow Β· dbt Β· PostgreSQL Β· Docker)
  • πŸ“Š Analytics & ML β€” Churn prediction, RFM segmentation, CLV modelling β†’ Power BI dashboards for stakeholders

Tech Stack

Languages & Querying

Python SQL PostgreSQL MySQL

Data Engineering

Apache Airflow Apache Spark Apache Kafka dbt Docker Docker Compose FastAPI

AI Β· LLM Β· NLP

OpenAI Qdrant Sentence Transformers RAG

Analytics & ML

scikit-learn Pandas NumPy Power BI PySpark


Featured Projects

πŸ€– Persian RAG Chatbot Platform (Production Β· Barmana)

Context-aware document retrieval system deployed for internal operations and external end-users.

Built a full-stack RAG pipeline over a mixed Persian/English document corpus. Implemented a retrieval evaluation framework (Precision@K, LLM-as-judge faithfulness scoring) across 1,000+ documents and 150 domain-specific queries.

Results: Retrieval precision 61% β†’ 84% (+38%) Β· Hallucination rate reduced ~60%

FastAPI Qdrant OpenAI API OCR Docker Python Sentence Transformers Multilingual Embeddings


⚑ Real-Time Fraud Detection Pipeline

End-to-end streaming architecture for real-time financial transaction monitoring and fraud alerting.

Built a decoupled Kafka β†’ Spark β†’ PostgreSQL pipeline capable of handling high-throughput transaction streams. Implemented a tier-one deterministic rule engine with micro-batch processing, secure credential injection via environment variables, and a JDBC sink writing flagged records to an auditable fraud_alerts table. Containerized full infrastructure with single-command spin-up via Docker Compose.

Apache Spark Apache Kafka Confluent Platform PostgreSQL PySpark Docker Docker Compose Python


Production-grade ELT pipeline with automated orchestration and data quality contracts.

Architected a full modern data stack on the Olist e-commerce dataset. Modular dbt project with staging + mart layers, schema tests (not_null, unique, accepted_values), and source freshness checks. Single-command environment spin-up via Docker Compose.

Apache Airflow dbt PostgreSQL Docker Git


End-to-end customer analytics on 100K+ real orders across 9 relational tables (2016–2018).

Built a Python ETL pipeline into PostgreSQL, RFM segmentation framework (Champions / At-Risk / Lost cohorts), Logistic Regression churn classifier (Precision: 0.80, Recall: 1.00), and CLV scoring β€” all surfaced through four cross-filtered Power BI dashboards.

Python PostgreSQL scikit-learn SQLAlchemy Power BI DAX


Published open-source dataset capturing demographics, purchasing patterns, device usage, and satisfaction metrics β€” designed for ML, analytics, and educational use.

Python Pandas Open Source


GitHub Stats

Top Languages

GitHub Streak


Currently

  • πŸŽ“ M.Sc. in IT Management (E-Business) Β· Shahid Beheshti University, Tehran
  • πŸ’Ό AI Developer @ Barmana β€” building LLM retrieval systems in production
  • πŸ” Open to Data Engineering, AI/ML Engineering, and Analytics Engineering opportunities

If you're working on intelligent data products, let's talk.

LinkedIn Email

Pinned Loading

  1. Olist-Churn-CLV-Analytics Olist-Churn-CLV-Analytics Public

    End-to-end e-commerce analytics project using the Olist dataset. Built churn prediction and Customer Lifetime Value (CLV) models with Python, SQL, PostgreSQL, scikit-learn, and Power BI. Includes R…

    Python

  2. ecommerce-customer-behavior-analysis ecommerce-customer-behavior-analysis Public

    End-to-end exploratory analysis of e-commerce customer behavior and sales data using a public Kaggle dataset (v2, multi-order).

    HTML 1

  3. olist-modern-data-stack olist-modern-data-stack Public

    End-to-end modern data stack using Airflow, dbt, PostgreSQL, Docker, and Power BI for e-commerce analytics.

    Python

  4. real-time-fraud-detection-pipeline real-time-fraud-detection-pipeline Public

    Real-time fraud detection pipeline using Kafka, PySpark Structured Streaming, PostgreSQL, and Docker Compose.

    Python