Skip to content
View Sea-Snell's full-sized avatar
🍊
hello
🍊
hello

Highlights

  • Pro

Block or report Sea-Snell

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The AI Code Editor

28,957 1,811 Updated Oct 13, 2024
Python 87 10 Updated Mar 13, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,427 1,445 Updated Mar 10, 2025

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]

Python 212 34 Updated Feb 28, 2025

Aidan Bench attempts to measure <big_model_smell> in LLMs.

Python 288 11 Updated Mar 5, 2025

Orbax provides common checkpointing and persistence utilities for JAX users

Python 355 45 Updated Mar 29, 2025

Recipes to scale inference-time compute of open models

Python 1,050 107 Updated Feb 25, 2025

Large Context Attention

Python 696 53 Updated Jan 24, 2025

Tools for merging pretrained large language models.

Python 5,485 521 Updated Mar 28, 2025

Python logging made (stupidly) simple

Python 21,237 724 Updated Mar 1, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,488 243 Updated Feb 20, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,183 50 Updated Nov 16, 2024

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 149 14 Updated Nov 11, 2024

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 19,892 2,497 Updated Mar 30, 2025

aider is AI pair programming in your terminal

Python 30,245 2,739 Updated Mar 30, 2025

Minimal transformer for arbtirary data (i.e. bio stuff!)

Jupyter Notebook 22 1 Updated Dec 8, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 47,059 1,322 Updated Mar 29, 2025

A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿

Python 326 5 Updated Aug 23, 2024

Official inference repo for FLUX.1 models

Python 21,095 1,491 Updated Feb 6, 2025

A set of Python scripts that makes your experience on TPU better

Python 50 2 Updated Jul 3, 2024

TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.

Python 20 Updated Jun 24, 2024

Minimal but scalable implementation of large language models in JAX

Python 34 Updated Nov 2, 2024
Jupyter Notebook 83 9 Updated Jan 25, 2025

Read Google Cloud Storage, Azure Blobs, and local paths with the same interface

Python 63 28 Updated Aug 27, 2024

Turn jitted jax functions back into python source code

Python 22 1 Updated Dec 16, 2024

SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

Python 2,700 457 Updated Mar 28, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,322 533 Updated Mar 28, 2025
Python 413 40 Updated Jul 11, 2024
Next
Showing results