Skip to content
View shivamshinde123's full-sized avatar
💭
At School🏫
💭
At School🏫

Highlights

  • Pro

Block or report shivamshinde123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shivamshinde123/README.md

📍 Worcester, MA | currently doing my MS in Data Science at WPI and figuring out how far you can push LLMs before they do something unexpected

Python PyTorch LangChain LangGraph LLM Fine-tuning RAG Multi--Agent Systems MCP MLflow FastAPI React Claude Code

I like building things where AI actually does something useful, not just answers questions. Lately that means getting multiple LLMs to talk to each other, building MCP servers so AI assistants can reach into external tools, and sometimes just making something fun to see if it works.


What I'm Up To

  • 🤖 Deep in multi-agent AI right now, I find it fascinating when you give LLMs different roles and let them argue with each other to produce something
  • 🔬 Built AgentML, an autonomous ML experimentation loop that runs overnight, tries new models and hyperparameters, and only keeps the changes that actually improved the metric
  • 🔌 Building MCP servers to plug real-world data and tools into AI assistants, which is a lot more interesting than it sounds
  • ✍️ Writing about whatever I am learning on Medium, 33+ articles so far and counting

Current Projects 🚀

  • 🥊 Adversarial-LLM-Wiki : Adversarial Wiki compiles two opposing wikis (pro and con) for any topic from your sources or autonomous web research. It debates them on demand, surfacing each side’s hidden assumptions and guiding decisions with iterative clarifying questions. For example: For “remote work,” it builds pro/con wikis and debates “Should our team go fully remote?” to reveal the trade‑offs and hidden assumptions. It debates them on demand, surfacing each side’s hidden assumptions and guiding decisions with iterative clarifying questions.
  • 🕵️‍♂️ AgentPulse : A full-stack analytics system that monitors Claude Code sessions locally, computes metrics, and displays interactive dashboards.
  • 🌌 Shadesmar GitHub Visualizer : Self-contained 3D visualization of any GitHub user's repositories, repos as crystalline trees, contributors as flickering flames, all in a beautiful Shadesmar-inspired scene (from Brandon Sanderson's Stormlight Archive) built with Three.js. Just open the HTML file and enter a username!
  • 🔭 CodeScope : GitHub Action that auto-generates color-coded Mermaid diagrams for pull requests so reviewers can understand the scope of changes at a glance, powered by Claude
  • 🔁 AutoRAGResearch : Autonomous RAG pipeline optimizer that tunes chunk sizes, embedding models, and retrieval configs on its own using a closed LLM feedback loop tracked in MLflow
  • 🤖 AgentML : Autonomous ML experimentation loop where an AI agent iterates on model selection and hyperparameter tuning overnight, commits the wins, and reverts the losses, all tracked in MLflow
  • 🧠 PRLens : Enter any public GitHub PR URL and get an instant LLM-powered report on complexity, effort, AI attribution, and review quality
  • 🦑 MyAnimeList MCP Server : MCP server that plugs anime and manga intelligence directly into AI assistants like Claude Desktop
  • Mythology Worldbuilding Pantheon : Six specialized AI agents that argue, collaborate, and co-author full mythology universes
  • 🎮 Git Visual Playground : Git stops being scary when you can see every branch, merge, and commit happening live
  • 🧩 Custom Agent Skills : Self-contained skill modules that extend Claude, Cursor, and other AI assistants with specialized workflows

Legacy Work


Tech and Tools 🧰

AI, Data Science and Analytics

Development, Infrastructure and Cloud


Latest Tech Blogs 📢

Recent Article 0

Recent Article 1

Recent Article 2


A Few Random Facts

  • 🍜 Big anime fan, ask me for a recommendation and I will send you down a rabbit hole
  • 🎬 Watches animated and comedy movies, nothing too serious
  • 📚 Currently hooked on fantasy novels, ask me for a recommendation (Brandon Sanderson, Robin Hobb, Pierce Brown)
  • 📈 Ranked under 1000 on Kaggle
  • 💬 Happy to talk ML, LLMs, visualizations, anime, or fantasy books, whichever you need

Connect 🤝

LinkedIn Kaggle Gmail Portfolio


Check the repositories and don't forget to drop a star. 👇

⭐ From ShivamShinde

Pinned Loading

  1. AgentML-Project AgentML-Project Public

    Autonomous ML experimentation loop inspired by Andrej Karpathy's autoresearch. An AI agent iterates on model selection and hyperparameter tuning overnight by modifying a single file (train.py), run…

    Jupyter Notebook

  2. Auto-RAG-Research Auto-RAG-Research Public

    Autonomous RAG pipeline optimization system inspired by Andrej Karpathy's autoresearch project. Runs a closed-loop experimentation cycle: tries different RAG hyperparameter configurations, evaluate…

    Python

  3. MyAnimeList_MCP_Server MyAnimeList_MCP_Server Public

    A Model Context Protocol (MCP) server that provides comprehensive access to MyAnimeList data through the Jikan API. This server enables AI assistants to search, retrieve, and analyze anime and mang…

    Python 1

  4. AgentPulse AgentPulse Public

    A full-stack analytics system that monitors Claude Code sessions locally, computes metrics, and displays interactive dashboards

    Python

  5. Battery-Features-Predictions Battery-Features-Predictions Public

    A comprehensive machine learning pipeline for predicting and analyzing battery state of charge (SoC). This project leverages modern ML and experiment tracking workflows to extract insights from raw…

    Jupyter Notebook

  6. PR-Intelligence-Dashboard-Prototype PR-Intelligence-Dashboard-Prototype Public

    A sleek, single‑page dashboard that analyzes GitHub Pull Requests using LLM‑powered intelligence. Enter a public GitHub PR URL and instantly receive a structured intelligence report covering comple…

    Python