Stars
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Access large language models from the command-line
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking
What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.
The Art of Problem-Solving in Software Engineering: How to Make MySQL Better
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
GiGL is an open-source library for training and inference of Graph Neural Networks at very large (billion) scale.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
MLGym A New Framework and Benchmark for Advancing AI Research Agents
IIG-RL-Benchmark is a library for training and evaluating game theoretical or deep RL algorithms on OpenSpiel games.
Exploitability calculation for imperfect-information game benchmarks
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
The repository to showcase the best framework for tabular data - the Awesome CatBoost
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
An open-source RAG-based tool for chatting with your documents.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
RLHF implementation details of OAI's 2019 codebase
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
An extremely fast Python package and project manager, written in Rust.
An extremely fast Python linter and code formatter, written in Rust.
corl-team / katakomba
Forked from tinkoff-ai/katakombaData-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
corl-team / CORL
Forked from tinkoff-ai/CORLHigh-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Orchestrator for running Tart Virtual Machines on a cluster of Apple Silicon devices
A guided tour on how to use HuggingFace large language models on Macs with Apple Silicon
CoreNet: A library for training deep neural networks
Fully open data curation for reasoning models