Skip to content
View ayushsi42's full-sized avatar

Highlights

  • Pro

Organizations

@vlgiitr

Block or report ayushsi42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ayushsi42/README.md

Hi, I'm Ayush Singh

Currently in my 2nd year at IIT Roorkee.

ayush-singh-iitr

  • 🔭 I'm currently working on improving the reasoning abilities of language models through RL techniques.

  • 💬 Ask me about Large Language Models, and anything related to AI.

  • 📫 Reach me here - ayushsingh73920@gmail.com

Read my papers here !!!

  • IPO: Your Language Model is Secretly a Preference Classifier - Paper

    • ACL MAIN CONFERENCE, 2025
  • LoRA-Mini : Adaptation Matrices Decomposition and Selective Training - Paper

    • AAAI CoLoRAi Workshop, 2024
  • Adaptive Urban Planning: A Hybrid Framework for Balanced City Development - Paper

    • AAAI AI4UP Workshop, 2024

Connect with me:

ayush_singh_iitr ayush-singh-iitr

Pinned Loading

  1. Gflownet-Guided-RLHF Gflownet-Guided-RLHF Public

    Python 1

  2. shivank21/Implicit_Preference_Optimization shivank21/Implicit_Preference_Optimization Public

    https://arxiv.org/pdf/2502.16182v2

    Python 4 1

  3. vlgiitr/Are-VLMs-Really-Blind vlgiitr/Are-VLMs-Really-Blind Public

    Python 4