-
Solopreneur (AI/ML, SaaS, B2B)
- Kansas City, MO
-
21:38
(UTC -06:00) - @GoDjMike
- in/mjbrummett
🦙 LLMs
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
High-performance In-browser LLM Inference Engine
A playbook for systematically maximizing the performance of deep learning models.
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
A language for constraint-guided and efficient LLM programming.
🦙 Integrating LLMs into structured NLP pipelines
Official inference framework for 1-bit LLMs
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
📋 A list of open LLMs available for commercial use.
A diverse, simple, and secure all-in-one LLMOps platform
DSPy: The framework for programming—not prompting—language models
Composio powers 1000+ toolkits, tool search, context management, authentication, and a sandboxed workbench to help you build AI agents that turn intent into action.
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Open source Claude Artifacts – built with Llama 3.1 405B
A self-organizing file system with llama 3
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
A very simple tool to build LLM prompts from your code repositories.
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
Embed machine learning models in your Dockerfile
Enforce the output format (JSON Schema, Regex etc) of a language model
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows


