Stars
No fortress, purely open ground. OpenManus is Coming.
An AI web browsing framework focused on simplicity and extensibility.
Make websites accessible for AI agents
Simple package to extract text with coordinates from programmatic PDFs
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A terminal workspace with batteries included
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
DSPy: The framework for programming—not prompting—language models
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
High-speed Large Language Model Serving for Local Deployment
NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers
A collection of GPT system prompts and various prompt injection/leaking knowledge.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
YaRN: Efficient Context Window Extension of Large Language Models
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
NeoVim dark colorscheme inspired by the colors of the famous painting by Katsushika Hokusai.
Fast inference engine for Transformer models
Sync your Kindle notes and highlights directly into your Obsidian vault
Reverse Instructions to generate instruction tuning data with corpus examples
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Using Whisper API to transcribe Youtube videos.