ayushsi42

Follow

Ayush Singh ayushsi42

Follow

8 followers · 6 following

@adobe

Highlights

Pro

Organizations

ayushsi42/README.md

Hi, I'm Ayush Singh

Currently in my 2nd year at IIT Roorkee.

🔭 I'm currently working on improving the reasoning abilities of language models through RL techniques.
💬 Ask me about Large Language Models, and anything related to AI.
📫 Reach me here - ayushsingh73920@gmail.com

Read my papers here !!!

IPO: Your Language Model is Secretly a Preference Classifier - Paper
- ACL MAIN CONFERENCE, 2025
LoRA-Mini : Adaptation Matrices Decomposition and Selective Training - Paper
- AAAI CoLoRAi Workshop, 2024
Adaptive Urban Planning: A Hybrid Framework for Balanced City Development - Paper
- AAAI AI4UP Workshop, 2024

Connect with me:

Pinned Loading

Gflownet-Guided-RLHF Gflownet-Guided-RLHF Public

Python 1
shivank21/Implicit_Preference_Optimization shivank21/Implicit_Preference_Optimization Public

https://arxiv.org/pdf/2502.16182v2

Python 4 1
vlgiitr/Are-VLMs-Really-Blind vlgiitr/Are-VLMs-Really-Blind Public

Python 4