Early environment resets based on agents' respawn status. #167

riccardosavorgnan · 2025-12-05T17:26:14Z

This PR introduces the ability to reset an environment once all agents have respawned at least once. The behavior is controlled through the termination_mode parameter (0 for "Perform H steps, where H is the set max episode length", 1 for "Terminate once all agents have respawned").

The change improves* convergence speed in early training stages and provides slightly better asymptotic performance.

*based on the limited number of experiments ran so far.

…ts in an episode

greptile-apps

Additional Comments (2)

pufferlib/ocean/drive/drive.py, line 253 (link)

logic: termination_mode parameter missing in resampling env_init call (line 234-260), but present in initial call (line 167-189)
pufferlib/resources/drive/puffer_drive_weights.bin

logic: This 2.4MB binary file appears to have been accidentally deleted - it exists on the base branch but is unrelated to the early termination feature

_{5 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-12-05T17:30:00Z

pufferlib/ocean/drive/drive.py

                offroad_behavior=self.offroad_behavior,
                dt=dt,
                scenario_length=(int(scenario_length) if scenario_length is not None else None),
+                termination_mode=int(self.termination_mode),


logic: int(None) raises TypeError when termination_mode is not provided

Suggested change

termination_mode=int(self.termination_mode),

termination_mode=(int(self.termination_mode) if self.termination_mode is not None else 0),

Prompt To Fix With AI

This is a comment left during a code review. Path: pufferlib/ocean/drive/drive.py Line: 184:184 Comment: **logic:** `int(None)` raises TypeError when `termination_mode` is not provided ```suggestion termination_mode=(int(self.termination_mode) if self.termination_mode is not None else 0), ``` How can I resolve this? If you propose a fix, please make it concise.

daphne-cornelisse

Looks great!

… early-termination-ricky

eugenevinitsky · 2025-12-08T23:24:11Z

@riccardosavorgnan a note, though it shouldn't block merging. When you suddenly reset the environment like this, it means that the agents don't know when the environment is going to end so it converts it into a problem with a stochastic, unobservable terminal condition. This may work better if we add value truncation when that happens.

daphne-cornelisse · 2025-12-08T23:42:53Z

@riccardosavorgnan a note, though it shouldn't block merging. When you suddenly reset the environment like this, it means that the agents don't know when the environment is going to end so it converts it into a problem with a stochastic, unobservable terminal condition. This may work better if we add value truncation when that happens.

Agree that we should add value truncation. Just wanted to comment that I think the agents currently don’t know when the environment is going to end either.

… early-termination-ricky

…ab/PufferDrive into early-termination-ricky

…for at least num_agents.

… early-termination-ricky

riccardosavorgnan added 2 commits December 5, 2025 17:15

Added early termination parameter based on respawn status of all agen…

1fead26

…ts in an episode

Merge branch 'main' into early-termination-ricky

ccfcd60

greptile-apps bot reviewed Dec 5, 2025

View reviewed changes

riccardosavorgnan assigned daphne-cornelisse, eugenevinitsky and julianh65 Dec 5, 2025

daphne-cornelisse self-requested a review December 5, 2025 22:25

daphne-cornelisse reviewed Dec 6, 2025

View reviewed changes

Merge branch 'main' of https://github.com/Emerge-Lab/PufferDrive into…

74898f8

… early-termination-ricky

daphne-cornelisse approved these changes Dec 6, 2025

View reviewed changes

Emerge-Lab deleted a comment from greptile-apps bot Dec 6, 2025

riccardosavorgnan added 2 commits December 7, 2025 20:10

pre-commit fix

c09345a

fix test

6cfc6f7

daphne-cornelisse added 5 commits December 11, 2025 19:45

Merge branch 'main' of https://github.com/Emerge-Lab/PufferDrive into…

eac5daf

… early-termination-ricky

Merge branch 'early-termination-ricky' of https://github.com/Emerge-L…

9260046

…ab/PufferDrive into early-termination-ricky

Apply precommit.

ea730af

Reduce variance in aggregate metrics by logging only if we have data …

ad1c9a6

…for at least num_agents.

Merge branch 'main' of https://github.com/Emerge-Lab/PufferDrive into…

e9d3ec6

… early-termination-ricky

daphne-cornelisse merged commit 19b5eb6 into main Dec 13, 2025
14 checks passed

daphne-cornelisse deleted the early-termination-ricky branch December 13, 2025 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Early environment resets based on agents' respawn status. #167

Early environment resets based on agents' respawn status. #167

Uh oh!

riccardosavorgnan commented Dec 5, 2025 •

edited by daphne-cornelisse

Loading

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

greptile-apps bot Dec 5, 2025

Uh oh!

daphne-cornelisse left a comment •

edited

Loading

Uh oh!

eugenevinitsky commented Dec 8, 2025

Uh oh!

daphne-cornelisse commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

	termination_mode=int(self.termination_mode),
	termination_mode=(int(self.termination_mode) if self.termination_mode is not None else 0),

Early environment resets based on agents' respawn status. #167

Early environment resets based on agents' respawn status. #167

Uh oh!

Conversation

riccardosavorgnan commented Dec 5, 2025 • edited by daphne-cornelisse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Additional Comments (2)

Uh oh!

greptile-apps bot Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

daphne-cornelisse left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eugenevinitsky commented Dec 8, 2025

Uh oh!

daphne-cornelisse commented Dec 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

riccardosavorgnan commented Dec 5, 2025 •

edited by daphne-cornelisse

Loading

greptile-apps bot left a comment •

edited

Loading

daphne-cornelisse left a comment •

edited

Loading