From 5aa7ec651df3989d6e0bb1ce14ac7285d3562e89 Mon Sep 17 00:00:00 2001 From: Joe Cummings Date: Wed, 15 Oct 2025 18:19:46 -0400 Subject: [PATCH 1/2] Fix typo in key features section --- docs/source/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/index.md b/docs/source/index.md index de594e342..802d62baa 100644 --- a/docs/source/index.md +++ b/docs/source/index.md @@ -7,7 +7,7 @@ Key Features ------------ * **Post-Training Focus**: Specializes in techniques - like Supervised Fine-Tuning (SFT) and Generalized Reward Policy Optimization (GRPO) + like Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO) * **PyTorch Integration**: Built natively on PyTorch with dependencies on [PyTorch nightly](https://pytorch.org/get-started/locally/), [Monarch](https://meta-pytorch.org/monarch), [vLLM](https://docs.vllm.ai/en/latest/), From 6691810790e0a17572abbe1315391c4b1dacf428 Mon Sep 17 00:00:00 2001 From: Joe Cummings Date: Wed, 15 Oct 2025 18:22:38 -0400 Subject: [PATCH 2/2] Correct GRPO to Group Relative Policy Optimization --- docs/source/getting_started.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/source/getting_started.md b/docs/source/getting_started.md index 57e1b63c8..3fe46de7e 100644 --- a/docs/source/getting_started.md +++ b/docs/source/getting_started.md @@ -5,5 +5,5 @@ Welcome to TorchForge! This guide will help you get up and running with TorchFor TorchForge specializes in post-training techniques for large language models, including: - **Supervised Fine-Tuning (SFT)**: Adapt pre-trained models to specific tasks using labeled data -- **Generalized Reward Policy Optimization (GRPO)**: Advanced reinforcement learning for model alignment +- **Group Relative Policy Optimization (GRPO)**: Advanced reinforcement learning for model alignment - **Multi-GPU Distributed Training**: Efficient scaling across multiple GPUs and nodes