marcodelpin

Follow

marcodelpin

Follow

15 followers · 56 following

Stars

Distill

5 repositories

MinishLab / model2vec

Fast State-of-the-Art Static Embeddings

Python 1,095 49 Updated Mar 2, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,737 456 Updated Mar 14, 2025

deepseek-ai / ESFT

Expert Specialized Fine-Tuning

Python 586 244 Updated Sep 22, 2024

Om-Alve / smolGPT

Python 1,311 101 Updated Feb 15, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,486 1,687 Updated Mar 13, 2025