Skip to content
View oroszgy's full-sized avatar
:octocat:
:octocat:

Organizations

@ec-doris @huspacy

Block or report oroszgy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

LLM

72 repositories

A language for constraint-guided and efficient LLM programming.

Python 4,154 219 Updated May 22, 2025

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

10,161 785 Updated May 31, 2024

LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development

Python 21 5 Updated Jul 24, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 71,138 13,688 Updated Feb 25, 2026

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,150 373 Updated Jun 24, 2024

LLM inference in C/C++

C++ 95,825 15,061 Updated Feb 25, 2026

Python bindings for llama.cpp

Python 10,003 1,311 Updated Aug 15, 2025

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Python 96 3 Updated Aug 18, 2023

Structured Outputs

Python 13,456 667 Updated Feb 13, 2026

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 70,781 7,542 Updated Feb 1, 2026

Tensor library for machine learning

C++ 14,131 1,495 Updated Feb 25, 2026

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 737 44 Updated Apr 10, 2024

A repository for research on medium sized language models.

Python 533 78 Updated Jun 6, 2025

Explore and interpret large embeddings in your browser with interactive visualization! 📍

TypeScript 518 34 Updated Jan 22, 2026

Weak Labeling (NER) using ChatGPT

Python 37 7 Updated Mar 28, 2023

Machine Learning Engineering Open Book

Python 17,091 1,072 Updated Feb 21, 2026

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing…

MDX 24,309 2,622 Updated Feb 25, 2026

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,412 167 Updated Nov 5, 2025

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,546 78 Updated Feb 24, 2026

Github for the paper "Is ChatGPT the Ultimate Data Augmentation Algorithm" published in EMNLP findings 2023

Python 1 Updated Oct 17, 2023

Adala: Autonomous DAta (Labeling) Agent framework

Python 1,364 124 Updated Feb 24, 2026

[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.

Python 111 12 Updated May 16, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,374 988 Updated Feb 21, 2026

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,472 433 Updated Sep 13, 2024

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

TypeScript 10,633 941 Updated Feb 25, 2026

Mamba SSM architecture

Python 17,239 1,598 Updated Feb 18, 2026

Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message…

TypeScript 34,105 6,885 Updated Feb 25, 2026

Robust recipes to align language models with human and AI preferences

Python 5,506 471 Updated Sep 8, 2025

Go ahead and axolotl questions

Python 11,332 1,257 Updated Feb 25, 2026

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 2,680 179 Updated Feb 5, 2026