Stars
Sky-T1: Train your own O1 preview model within $450
Scalable RL solution for advanced reasoning of language models
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Fantastic Data Engineering for Large Language Models
Recipes to scale inference-time compute of open models
Ongoing research training transformer models at scale
Build resilient language agents as graphs.
🦜🔗 Build context-aware reasoning applications
Simple, unified interface to multiple Generative AI providers
A curated, but incomplete, list of data-centric AI resources.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…
150+ quantitative finance Python programs to help you gather, manipulate, and analyze stock market data
A curated list of insanely awesome libraries, packages and resources for Quants (Quantitative Finance)
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are commi…
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
A platform for developers to simulate collaborative research activities
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)