Skip to content
View riya0920's full-sized avatar

Block or report riya0920

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
riya0920/README.md

Riya Soni

Building production ML/AI systems. MS Computer Science @ Stevens Institute of Technology (May 2026).

I work on retrieval pipelines, agent architectures, and applied interpretability. Currently Data Analyst at Giraffe Media Group, where I build SMS analytics infrastructure on Redshift and ship internal AI tooling. Mechanistic interpretability research on GPT-2 advised by Matthew Finch (Senior ML Engineer, Bloomberg LP).

Open to ML/AI Engineer roles starting May 2026.

Featured projects

Earnings Intelligence Platform RAG system over SEC 10-K filings with rigorous evaluation across 12 retrieval configurations. Best composite score 0.870 using semantic chunking with hybrid retrieval and a cross-encoder reranker, evaluated via RAGAS on AAPL, MSFT, GOOGL, NVDA, META. ChromaDB BM25 GPT-4o-mini MLflow sentence-transformers SEC EDGAR API

Self-Healing RL Pipeline Four-agent A2A architecture (Monitor, Diagnostics, Repair, Verification) using DQN for autonomous recovery from data drift, tested across simulated subreddit phase shifts. PyTorch FastAPI LangSmith

Job Hunter Automated job discovery scraping 55+ sources with H-1B sponsorship filtering, match scoring, Google Sheets output, Gmail digests, and push notifications. Deployed on GitHub Actions. Python GitHub Actions ntfy.sh Google Workspace API

Currently exploring

Mechanistic interpretability of decoder-only transformers. Greedy sequential SVD pruning of GPT-2 attention heads achieving 6.15% loss improvement with zero retraining; scaling experiments to GPT-2 Medium.

Stack

Python PyTorch LangChain LangGraph ChromaDB FastAPI TypeScript SQL (Redshift, PostgreSQL) AWS Docker GitHub Actions

Reach me

LinkedIn · Email · Hoboken, NJ

Pinned Loading

  1. earnings-intelligence-platform earnings-intelligence-platform Public

    Self-evaluating RAG benchmark over SEC 10-K filings. 12 configurations evaluated with RAGAS, MLflow, and a hybrid retriever with cross-encoder reranking.

    Python

  2. self-healing-rl-pipeline self-healing-rl-pipeline Public

    Autonomous self-healing recommendation system. Multi-agent A2A architecture with Monitor, Diagnostics, Repair, and Verification agents that detect drift, diagnose root cause, and retrain a DQN poli…

    Python

  3. job-hunter job-hunter Public

    Automated AI/ML job discovery and alerting system. Scrapes 30+ sources, scores against your resume, deduplicates, and pushes notifications. Runs serverlessly for $0/month on GitHub Actions, GCF, or…

    Python 1