- Germany
-
00:02
- 1h ahead - in/nawaf-alampara-78731612b
- @Iam_Nawaf_
Highlights
- Pro
Lists (14)
Sort Name ascending (A-Z)
Stars
A beautiful, simple, clean, and responsive Jekyll theme for academics
An ecosystem for digital reticular chemistry
📦🚀 Fully automated version management and package publishing
Find unused, missing and transitive dependencies in a Python project.
MLGym A New Framework and Benchmark for Advancing AI Research Agents
Geometric Deep Learning @ University of Cambridge
This repository contains the Hugging Face Agents Course.
Democratizing Reinforcement Learning for LLMs
Ranking LLMs on agentic tasks
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Train transformer language models with reinforcement learning.
verl: Volcano Engine Reinforcement Learning for LLMs
Gymnasium framework for training language model agents on constructive tasks
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Synthetic data curation for post-training and structured data extraction
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Fully open reproduction of DeepSeek-R1
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
OpenAI-style proxy server for enabling tool use for models that don't support it natively (like Deepseek R1)
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
Official implementation of MatterGen -- a generative model for inorganic materials design across the periodic table that can be fine-tuned to steer the generation towards a wide range of property c…