ChaoyuWang04

Follow

Chaoyu Wang ChaoyuWang04

Follow

17 followers · 22 following

03:29 (UTC -06:00)

Achievements

Achievements

ChaoyuWang04/README.md

Hi, I'm Chaoyu Wang 👋

I'm an AI/ML engineer focused on LLM alignment and fine-tuning,
currently self-studying at UC Berkeley and building toward AI research.

🔭 What I'm working on

Agent SFT pipelines with LoRA fine-tuning (Qwen3, custom tool-call alignment)
RAG systems with hybrid retrieval (BM25 + Dense, Cross-Encoder reranking)
GRPO/RLHF alignment for domain-specific LLMs

🛠️ Tech Stack

ML/AI: PyTorch · HuggingFace Transformers · vLLM · LLaMA Factory
Infra: RunPod · FastAPI · Docker
Dev: Python · Next.js · PostgreSQL

🎓 Background

M.S. Engineering & Applied Mathematics — Northwestern University (2025)
B.S. Applied Mathematics - University of California, San Diego (2024)
Ex-intern @ GuruGame HK

📫 Reach me

Pinned Loading

AdCampaignAgent-SFT AdCampaignAgent-SFT Public

A synthetic SFT dataset pipeline for training tool-calling agents for Mobile Game User Acquisition.

Python
FinRAG-GRPO FinRAG-GRPO Public

Reasoning Reward Model trained via GRPO on synthetic Chinese customer service preference data. Generates evaluation rationale before outputting preference labels, reducing reward hacking vs. scalar…

Python
promptgen-next promptgen-next Public

AI-powered prompt generation and image production system with template-driven workflows, multi-provider orchestration, and multilingual image stitching.

TypeScript 2
MonitorSysUA MonitorSysUA Public

Internal full-stack monitoring system for Google Ads operations, AppsFlyer cohort analytics, and evaluation-driven optimization workflows.

TypeScript 1
Chaoyu-Personal-Web Chaoyu-Personal-Web Public

Personal Web Page

TypeScript