Sponsors
Stars
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
Simplifying reinforcement learning for complex game environments
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Training Large Language Model to Reason in a Continuous Latent Space
A command-line tool to generate GitHub and GitLab activity graph.
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
A comprehensive repository of reasoning tasks for LLMs (and beyond)
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
Large-scale LLM inference engine
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
synapse-alpha / mathgenerator
Forked from lukew3/mathgeneratorA math problem generator, created for the purpose of giving self-studying students and teaching organizations the means to easily get access to high-quality, generated math problems to suit their n…
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
Tools for merging pretrained large language models.
Just a bunch of benchmark logs for different LLMs
DSPy: The framework for programming—not prompting—language models