Skip to content
View j-sperling's full-sized avatar

Block or report j-sperling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,112 6,103 Updated Jul 13, 2023

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 294,911 49,034 Updated Dec 2, 2024

Access large language models from the command-line

Python 6,786 391 Updated Mar 28, 2025

Official implementation of DEMO3

Python 37 Updated Mar 18, 2025

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Python 40 6 Updated Mar 4, 2025

What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.

2,549 189 Updated Oct 14, 2024

The Art of Problem-Solving in Software Engineering: How to Make MySQL Better

1,505 134 Updated Mar 1, 2025

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 1,209 62 Updated Mar 29, 2025

GiGL is an open-source library for training and inference of Graph Neural Networks at very large (billion) scale.

Scala 36 1 Updated Mar 29, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 6,935 356 Updated Mar 11, 2025

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 459 44 Updated Mar 28, 2025

IIG-RL-Benchmark is a library for training and evaluating game theoretical or deep RL algorithms on OpenSpiel games.

Python 12 Updated Feb 14, 2025

Exploitability calculation for imperfect-information game benchmarks

C++ 23 2 Updated Feb 20, 2025

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

C++ 4,457 975 Updated Mar 28, 2025

The repository to showcase the best framework for tabular data - the Awesome CatBoost

231 24 Updated Mar 23, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,994 1,550 Updated Mar 27, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36,274 6,167 Updated Mar 29, 2025

An open-source RAG-based tool for chatting with your documents.

Python 21,833 1,718 Updated Feb 14, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 22,893 9,610 Updated Feb 9, 2025

RLHF implementation details of OAI's 2019 codebase

Python 184 9 Updated Jan 14, 2024

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 733 109 Updated Mar 23, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 46,969 1,319 Updated Mar 29, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 37,306 1,268 Updated Mar 29, 2025

Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)

Python 39 1 Updated Aug 22, 2023

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 532 26 Updated Feb 10, 2024

Orchestrator for running Tart Virtual Machines on a cluster of Apple Silicon devices

Go 216 17 Updated Mar 27, 2025

A guided tour on how to use HuggingFace large language models on Macs with Apple Silicon

Jupyter Notebook 152 17 Updated Mar 21, 2025

CoreNet: A library for training deep neural networks

Jupyter Notebook 6,999 545 Updated Oct 14, 2024

Fully open data curation for reasoning models

Python 1,592 137 Updated Mar 16, 2025
Next
Showing results