Skip to content
View n0w0f's full-sized avatar

Highlights

  • Pro

Block or report n0w0f

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,370 11,628 Updated Mar 2, 2025

An ecosystem for digital reticular chemistry

Python 46 7 Updated Sep 10, 2024

📦🚀 Fully automated version management and package publishing

JavaScript 21,555 1,707 Updated Mar 7, 2025

Find unused, missing and transitive dependencies in a Python project.

Python 1,009 23 Updated Mar 1, 2025

Code for the Molmo Vision-Language Model

Python 316 20 Updated Dec 12, 2024

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 411 36 Updated Feb 25, 2025

Geometric Deep Learning @ University of Cambridge

Jupyter Notebook 17 2 Updated Mar 2, 2025

This repository contains the Hugging Face Agents Course.

Jupyter Notebook 13,905 842 Updated Mar 7, 2025

Democratizing Reinforcement Learning for LLMs

Python 1,925 170 Updated Feb 16, 2025

Ranking LLMs on agentic tasks

Jupyter Notebook 95 8 Updated Feb 11, 2025

s1: Simple test-time scaling

Python 5,877 672 Updated Mar 6, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 696 85 Updated Mar 7, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,276 169 Updated Mar 4, 2025
Python 482 15 Updated Feb 27, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,273 511 Updated May 3, 2024

Train transformer language models with reinforcement learning.

Python 12,305 1,661 Updated Mar 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,449 415 Updated Mar 7, 2025

Gymnasium framework for training language model agents on constructive tasks

Python 150 18 Updated Mar 3, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,016 1,403 Updated Feb 1, 2025

Synthetic data curation for post-training and structured data extraction

Python 935 64 Updated Mar 7, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,375 531 Updated Mar 7, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,085 227 Updated Feb 19, 2025

Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.

Python 1,297 89 Updated Mar 7, 2025

Fully open reproduction of DeepSeek-R1

Python 22,325 2,001 Updated Mar 7, 2025

🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org

Python 7,094 815 Updated Mar 7, 2025

OpenAI-style proxy server for enabling tool use for models that don't support it natively (like Deepseek R1)

Python 4 Updated Jan 24, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 3,091 209 Updated Mar 7, 2025

Official implementation of MatterGen -- a generative model for inorganic materials design across the periodic table that can be fine-tuned to steer the generation towards a wide range of property c…

Python 1,206 170 Updated Mar 6, 2025
Next
Showing results