j-sperling

j-sperling

Achievements

Stars

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,112 6,103 Updated Jul 13, 2023

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 294,911 49,034 Updated Dec 2, 2024

simonw / llm

Access large language models from the command-line

Python 6,786 391 Updated Mar 28, 2025

adrialopezescoriza / demo3

Official implementation of DEMO3

Python 37 Updated Mar 18, 2025

Li-Z-Q / DeepSolution

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Python 40 6 Updated Mar 4, 2025

ept / hermitage

What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.

2,549 189 Updated Oct 14, 2024

enhancedformysql / The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

The Art of Problem-Solving in Software Engineering: How to Make MySQL Better

1,505 134 Updated Mar 1, 2025

NVlabs / MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 1,209 62 Updated Mar 29, 2025

snap-research / GiGL

GiGL is an open-source library for training and inference of Graph Neural Networks at very large (billion) scale.

Scala 36 1 Updated Mar 29, 2025

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 6,935 356 Updated Mar 11, 2025

facebookresearch / MLGym

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 459 44 Updated Mar 28, 2025

nathanlct / IIG-RL-Benchmark

IIG-RL-Benchmark is a library for training and evaluating game theoretical or deep RL algorithms on OpenSpiel games.

Python 12 Updated Feb 14, 2025

gabrfarina / exp-a-spiel

Exploitability calculation for imperfect-information game benchmarks

C++ 23 2 Updated Feb 20, 2025

google-deepmind / open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,457 975 Updated Mar 28, 2025

valeman / Awesome_CatBoost

The repository to showcase the best framework for tabular data - the Awesome CatBoost

231 24 Updated Mar 23, 2025

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,994 1,550 Updated Mar 27, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36,274 6,167 Updated Mar 29, 2025

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 21,833 1,718 Updated Feb 14, 2025

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,893 9,610 Updated Feb 9, 2025

valeman / Awesome_Math_Books

1,523 141 Updated Mar 28, 2025

vwxyzjn / lm-human-preference-details

RLHF implementation details of OAI's 2019 codebase

Python 184 9 Updated Jan 14, 2024

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 733 109 Updated Mar 23, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 46,969 1,319 Updated Mar 29, 2025

astral-sh / ruff

An extremely fast Python linter and code formatter, written in Rust.

Rust 37,306 1,268 Updated Mar 29, 2025

corl-team / katakomba

Forked from tinkoff-ai/katakomba

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Python 39 1 Updated Aug 22, 2023

corl-team / CORL

Forked from tinkoff-ai/CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 532 26 Updated Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

j-sperling

Achievements

Achievements

Block or report j-sperling

Stars

dennybritz / reinforcement-learning

donnemartin / system-design-primer

simonw / llm

adrialopezescoriza / demo3

Li-Z-Q / DeepSolution

ept / hermitage

enhancedformysql / The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

NVlabs / MambaVision

snap-research / GiGL

anthropics / claude-code

facebookresearch / MLGym

nathanlct / IIG-RL-Benchmark

gabrfarina / exp-a-spiel

google-deepmind / open_spiel

valeman / Awesome_CatBoost

triton-inference-server / server

ray-project / ray

Cinnamon / kotaemon

pytorch / examples

valeman / Awesome_Math_Books

vwxyzjn / lm-human-preference-details

vwxyzjn / ppo-implementation-details

astral-sh / uv

astral-sh / ruff

corl-team / katakomba

corl-team / CORL

cirruslabs / orchard

domschl / HuggingFaceGuidedTourForMac

apple / corenet

open-thoughts / open-thoughts