minor type fix

alexhernandezgarcia · Jun 6, 2024 · 7fcbbd7 · 7fcbbd7
1 parent a5849c3
commit 7fcbbd7
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -134,7 +134,7 @@ So far, we've discussed how to manually set actions or use random actions in the
 
 As the agent interacts with the environment, it collects data about the outcomes of its actions. This data is used to train a policy network, which models the probability distribution of possible actions given the current state. Over time, the policy network learns to favor actions that lead to more successful outcomes with higher reward, optimizing the agent's performance.
 
-8. Sample a batch of trajectories from a trained agent 
+9. Sample a batch of trajectories from a trained agent 
 
 ```python 
 batch, _ = gflownet.sample_batch(n_forward=3,  train=False)