[Proposal/Bug Fix] Change truncation to termination in Car Racing after finishing a lap #106

RedTachyon · 2022-11-02T17:19:30Z

Proposal

Currently in Car Racing, when the agent finishes a lap, the environment is marked as truncated instead of terminated. This seems like a really odd choice to me.

Gymnasium/gymnasium/envs/box2d/car_racing.py

Lines 557 to 561 in cc7d8dd

    
           if self.tile_visited_count == len(self.track) or self.new_lap: 
        
               # Truncation due to finishing lap 
        
               # This should not be treated as a failure 
        
               # but like a timeout 
        
               truncated = True

This was added in openai/gym#2890 alongside an actual fix to the environment logic. I suspect the review focused on the bug fix, and omitted the undiscussed change, so it slipped through the cracks. (BTW now you see why I'm always being annoying about out-of-scope changes in PRs and similar stuff)

Finishing a lap is a very clear example for episode termination. You reach a terminal state after making a full loop, and the episode ends. It should never have been marked as truncation.

The annoying part the environment version was bumped for this (but also for the actual bug), so we'll have to bump it again. But I can't really see any justification for keeping this marked as truncation, which is inconsistent with the entire rationale for what truncation is meant to be (reaching the time limit). The explanation in some of the comments was that finishing the lap shouldn't be treated as a failure, but termination does not imply failure. Failure or success is defined by the reward. Termination says "You're done, nothing more to do". Truncation says "You took too long, try again".

@pseudo-rnd-thoughts @jkterry1 @araffin

Markus28 · 2022-11-03T19:22:28Z

I don't think it's an MDP if you change it to be a termination since the observation doesn't really carry any information about whether a lap has been completed/how much of a lap has been completed (or does it, idk?). So making it a termination may be problematic, at least that would be my understanding.

RedTachyon · 2022-11-03T20:34:43Z

I don't think it's an MDP if you change it to be a termination

I don't really buy this. The observation is a pixel rendering in a radius around the environment, so it's hardly an MDP in the first place.

RedTachyon added enhancement New feature or request bug Something isn't working labels Nov 2, 2022

RedTachyon mentioned this issue Nov 2, 2022

Re-add timelimit truncation information #101

Closed

10 tasks

pseudo-rnd-thoughts mentioned this issue Nov 12, 2022

[Question] The params self.new_lap seems not used on the truncated condition in CarRacing-v2? openai/gym#3149

Open

Kallinteris-Andreas mentioned this issue Dec 14, 2023

Change end-of-episode in CarRacing to termination as opposed to truncation #813

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal/Bug Fix] Change truncation to termination in Car Racing after finishing a lap #106

[Proposal/Bug Fix] Change truncation to termination in Car Racing after finishing a lap #106

RedTachyon commented Nov 2, 2022 •

edited

Markus28 commented Nov 3, 2022 •

edited

RedTachyon commented Nov 3, 2022

[Proposal/Bug Fix] Change truncation to termination in Car Racing after finishing a lap #106

[Proposal/Bug Fix] Change truncation to termination in Car Racing after finishing a lap #106

Comments

RedTachyon commented Nov 2, 2022 • edited

Proposal

Markus28 commented Nov 3, 2022 • edited

RedTachyon commented Nov 3, 2022

RedTachyon commented Nov 2, 2022 •

edited

Markus28 commented Nov 3, 2022 •

edited