Skip to content
View cvsekhar's full-sized avatar

Block or report cvsekhar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

628 repositories

Fine-Tuning LLM and embedding models

Jupyter Notebook 27 7 Updated Sep 12, 2023

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,693 293 Updated Aug 14, 2024
Jupyter Notebook 471 82 Updated Apr 4, 2021

Repo accompanying PEFT/LoRA article.

Jupyter Notebook 9 5 Updated Apr 29, 2024

FinSight - Financial Insights at Your Fingertip: FinSight is a cutting-edge AI assistant tailored for portfolio managers, investors, and finance enthusiasts. It streamlines the process of gaining c…

Jupyter Notebook 212 75 Updated Jul 10, 2024

DSPy: The framework for programming—not prompting—language models

Python 32,368 2,642 Updated Feb 23, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,031 883 Updated Feb 21, 2026

Minimal keyword extraction with BERT

Python 4,114 376 Updated Feb 3, 2026

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 163,256 14,649 Updated Feb 24, 2026

LLM papers I'm reading, mostly on inference and model compression

750 38 Updated Dec 21, 2023

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 12,513 1,156 Updated Sep 3, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

75,588 8,705 Updated Feb 5, 2026

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,659 221 Updated Feb 23, 2026

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,184 571 Updated Aug 22, 2025

Distribute and run LLMs with a single file.

C 23,742 1,266 Updated Feb 23, 2026

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,795 3,343 Updated Feb 24, 2026

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,980 195 Updated Feb 6, 2026

Examples in the MLX framework

Python 8,263 1,124 Updated Feb 12, 2026

Code for the video on feed-forward language model

Python 73 2 Updated Jan 17, 2024

The LLM Evaluation Framework

Python 13,783 1,257 Updated Feb 23, 2026

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 603 25 Updated Dec 11, 2024

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

Python 1,825 126 Updated Jul 10, 2024

WhisperPlus: Faster, Smarter, and More Capable 🚀

Python 1,937 147 Updated Dec 1, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 67,495 8,220 Updated Feb 24, 2026

Best practices for distilling large language models.

Jupyter Notebook 606 52 Updated Feb 1, 2024

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 36,718 5,943 Updated Feb 24, 2026

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,727 309 Updated May 21, 2025

A lightweight UI for interacting with the Zoo Text-to-CAD API.

Svelte 258 35 Updated Jan 16, 2026

This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and…

Jupyter Notebook 23 3 Updated Dec 23, 2023

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

Python 1,065 77 Updated Mar 7, 2024