Skip to content

Commit

Permalink
minor type fix
Browse files Browse the repository at this point in the history
  • Loading branch information
engmubarak48 committed Jun 6, 2024
1 parent a5849c3 commit 7fcbbd7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -134,7 +134,7 @@ So far, we've discussed how to manually set actions or use random actions in the

As the agent interacts with the environment, it collects data about the outcomes of its actions. This data is used to train a policy network, which models the probability distribution of possible actions given the current state. Over time, the policy network learns to favor actions that lead to more successful outcomes with higher reward, optimizing the agent's performance.

8. Sample a batch of trajectories from a trained agent
9. Sample a batch of trajectories from a trained agent

```python
batch, _ = gflownet.sample_batch(n_forward=3, train=False)
Expand Down

0 comments on commit 7fcbbd7

Please sign in to comment.