Skip to content
View rmusser01's full-sized avatar
💯
¯\_(ツ)_/¯
💯
¯\_(ツ)_/¯

Block or report rmusser01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM-Research

88 repositories

LLM for Long Text Summary (Comprehensive Bulleted Notes)

Python 615 48 Updated Jul 5, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 74,780 8,344 Updated Mar 11, 2026

A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical creativity.

445 36 Updated Dec 23, 2025

1.58 Bit LLM on Apple Silicon using MLX

Python 248 31 Updated May 10, 2024

Your buddy in the (L)LM space.

Python 64 5 Updated Sep 20, 2024

A framework for few-shot evaluation of language models.

Python 11,654 3,084 Updated Mar 5, 2026

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,475 433 Updated Sep 13, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

3,527 238 Updated Jan 26, 2026

Erasing concepts from neural representations with provable guarantees

Python 244 15 Updated Jan 27, 2025

[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)

Jupyter Notebook 393 31 Updated Jun 2, 2025
Jupyter Notebook 12 1 Updated Oct 23, 2022
Jupyter Notebook 36 4 Updated Jul 14, 2022

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,458 302 Updated Jul 17, 2025

Public repo to document some SPR stuff

793 148 Updated Nov 7, 2023

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,902 351 Updated Oct 28, 2025

A vector search engine on top of Lucene

Python 20 1 Updated May 7, 2023

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 3,594 320 Updated Dec 24, 2024

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,333 231 Updated Jan 29, 2026

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,473 124 Updated Nov 13, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,797 500 Updated Jan 24, 2026

Implementation for MatMul-free LM.

Python 3,058 199 Updated Dec 2, 2025

[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.

Python 134 9 Updated Mar 21, 2025

[ACL 2024] Progressive LLaMA with Block Expansion.

Python 514 40 Updated May 20, 2024

WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1,555 140 Updated Jan 31, 2026

A pure and fast NumPy implementation of Mamba with cache support.

Python 18 1 Updated Jun 16, 2024

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 2,085 117 Updated Jul 29, 2024

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

Python 234 26 Updated Jul 19, 2025

Code for the paper 🌳 Tree Search for Language Model Agents

Python 220 24 Updated Jul 25, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,041 2,690 Updated Jan 23, 2026

[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)

2,151 152 Updated Mar 2, 2026