Skip to content
View GoDjMike's full-sized avatar
📈
Putting the ML back in AI
📈
Putting the ML back in AI

Block or report GoDjMike

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

🦙 LLMs

Utilities, runtimes, local inference, etc.
57 repositories

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 279 22 Updated Oct 28, 2025

High-performance In-browser LLM Inference Engine

TypeScript 17,432 1,209 Updated Feb 18, 2026
Python 1,520 174 Updated Nov 9, 2023

A playbook for systematically maximizing the performance of deep learning models.

29,862 2,419 Updated Jun 18, 2024

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

Python 1,929 113 Updated Feb 23, 2026

A language for constraint-guided and efficient LLM programming.

Python 4,154 219 Updated May 22, 2025

🦙 Integrating LLMs into structured NLP pipelines

Python 1,365 105 Updated Jan 8, 2025

Official inference framework for 1-bit LLMs

Python 28,600 2,351 Updated Feb 3, 2026

LLM inference in C/C++

C++ 95,894 15,074 Updated Feb 26, 2026

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,947 2,127 Updated Feb 26, 2026

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 5,026 531 Updated Apr 11, 2025

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,181 2,521 Updated Feb 9, 2026

📋 A list of open LLMs available for commercial use.

12,646 957 Updated Feb 13, 2025

Use your Neovim like using Cursor AI IDE!

Lua 17,436 803 Updated Feb 23, 2026

A diverse, simple, and secure all-in-one LLMOps platform

Go 109 27 Updated Sep 21, 2024

DSPy: The framework for programming—not prompting—language models

Python 32,393 2,650 Updated Feb 26, 2026

Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.

TypeScript 27,191 4,457 Updated Feb 26, 2026

An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.

Python 682 59 Updated Jan 16, 2025

Open source Claude Artifacts – built with Llama 3.1 405B

TypeScript 6,877 1,644 Updated Feb 23, 2026

A self-organizing file system with llama 3

TypeScript 5,715 386 Updated Aug 8, 2025

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 330 26 Updated Sep 9, 2024

Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.

Python 21,263 2,221 Updated Feb 24, 2026

A very simple tool to build LLM prompts from your code repositories.

Shell 155 11 Updated Jul 17, 2025

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…

Go 5,962 368 Updated Dec 11, 2025

Fast State-of-the-Art Static Embeddings

Python 2,003 116 Updated Feb 13, 2026

Embed machine learning models in your Dockerfile

TypeScript 102 8 Updated Feb 2, 2026

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,992 82 Updated Aug 24, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 87,378 60,328 Updated Dec 2, 2025

Local AI API Platform

C++ 2,758 180 Updated Jul 4, 2025