Fix/reset before collect in procedural examples, tests and hl experiment #1100

maxhuettenrauch · 2024-04-05T08:29:21Z

I have added the correct label(s) to this Pull Request or linked the relevant issue(s)
I have provided a description of the changes in this Pull Request
I have added documentation for my changes
If applicable, I have added tests to cover my changes.
I have reformatted the code using poe format
I have checked style and types with poe lint and poe type-check
(Optional) I ran tests locally with poe test
(or a subset of them with poe test-reduced) ,and they pass
(Optional) I have tested that documentation builds correctly with poe doc-build

Calling collector.collect currently fails if the collector is not reset before collecting. Not sure if there's also a bug in collector where it says ...It has only an effect if n_episode is not None... for the reset_before_collect kwarg as it also happens when collecting n_step.

…er (examples, tests, hl experiment)

…es' into fix/collect-in-procedural-examples

maxhuettenrauch · 2024-04-05T08:31:30Z

@bordeauxred, @MischaPanch fyi

examples/box2d/lunarlander_dqn.py

bordeauxred · 2024-04-05T08:45:47Z

@MischaPanch the examples are not covered by the tests. Does it make to include them in cicd or run them once a day (if at least one commit on this day)
or sth?

MischaPanch · 2024-04-05T09:04:54Z

@MischaPanch the examples are not covered by the tests. Does it make to include them in cicd or run them once a day (if at least one commit on this day) or sth?

Once this is merged, we will follow up and write a script which generates the benchmarks that are currently stored in the examples dir and displayed in the docs. Then we can have a manually triggered GH Action that we trigger on each release. Running examples on each PR/daily is too expensive and unnecessary.

MischaPanch

Thanks for spotting!

test/discrete/test_bdq.py

examples/atari/atari_dqn.py

codecov-commenter · 2024-04-05T09:13:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 88.22%. Comparing base (8a0629d) to head (9bac3c7).

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1100      +/-   ##
==========================================
+ Coverage   88.21%   88.22%   +0.01%     
==========================================
  Files         100      100              
  Lines        8304     8305       +1     
==========================================
+ Hits         7325     7327       +2     
+ Misses        979      978       -1

Flag	Coverage Δ
unittests	`88.22% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tianshou/data/collector.py

MischaPanch · 2024-04-05T09:18:19Z

@maxhuettenrauch I thought the sac_with_il seeding problem was fixed now. Is your branch based on the updated master? Or is there some other source of randomness that we're not controlling yet?

maxhuettenrauch · 2024-04-05T09:19:57Z

@maxhuettenrauch I thought the sac_with_il seeding problem was fixed now. Is your branch based on the updated master? Or is there some other source of randomness that we're not controlling yet?

I doesn't fail on my machine... and it hasn't failed on other PRs, or has it?

MischaPanch · 2024-04-05T09:22:51Z

@maxhuettenrauch I thought the sac_with_il seeding problem was fixed now. Is your branch based on the updated master? Or is there some other source of randomness that we're not controlling yet?

I doesn't fail on my machine... and it hasn't failed on other PRs, or has it?

It fails occasionally :D, the worst kind of failing. We'll see with you next pushed commit

maxhuettenrauch · 2024-04-05T09:34:50Z

@maxhuettenrauch I thought the sac_with_il seeding problem was fixed now. Is your branch based on the updated master? Or is there some other source of randomness that we're not controlling yet?

I doesn't fail on my machine... and it hasn't failed on other PRs, or has it?

It fails occasionally :D, the worst kind of failing. We'll see with you next pushed commit

Weird. It's giving me deterministic results on my machine... but maybe another machine produces different numbers on that seed.

maxhuettenrauch · 2024-04-05T09:48:00Z

Well, I think this still needs some time. The atari_dqn example is broken in another place due to a recent update on master (at least for me).

MischaPanch · 2024-04-05T09:53:53Z

@maxhuettenrauch the L0 notebook fails for some reason, could you check locally pls?

maxhuettenrauch · 2024-04-05T13:51:57Z

@maxhuettenrauch the L0 notebook fails for some reason, could you check locally pls?

poe doc-build and poe doc-spellcheck seem to run through on my end. Not sure why it's failing.

…es' into fix/collect-in-procedural-examples

MischaPanch · 2024-04-15T16:20:03Z

@maxhuettenrauch this is finished, right? I'd merge it

Do you know whether it's related (or even solves) #1111 ?

maxhuettenrauch · 2024-04-16T07:17:20Z

This should (hopefully) cover all instances of the collect before training starts, yes.

Regarding the other issue, I think #1077 broke it.

Maximilian Huettenrauch added 4 commits April 5, 2024 10:05

explicitly call reset on collector before collecting outside of train…

4311351

…er (examples, tests, hl experiment)

Merge branch 'thuml_master' into fix/collect-in-procedural-examples

bb689f2

Merge branch 'thuml_master' into fix/collect-in-procedural-examples

df24d57

Merge remote-tracking branch 'origin/fix/collect-in-procedural-exampl…

9bac3c7

…es' into fix/collect-in-procedural-examples

bordeauxred reviewed Apr 5, 2024

View reviewed changes

examples/box2d/lunarlander_dqn.py Show resolved Hide resolved

MischaPanch reviewed Apr 5, 2024

View reviewed changes

test/discrete/test_bdq.py Show resolved Hide resolved

examples/atari/atari_dqn.py Show resolved Hide resolved

MischaPanch reviewed Apr 5, 2024

View reviewed changes

tianshou/data/collector.py Outdated Show resolved Hide resolved

remove comment from doc string in collector.py

03387ff

MischaPanch marked this pull request as ready for review April 5, 2024 09:39

MischaPanch marked this pull request as draft April 5, 2024 10:00

MischaPanch mentioned this pull request Apr 14, 2024

Revisit "warm-up" phase in examples #1112

Open

MischaPanch and others added 4 commits April 15, 2024 00:45

Merge branch 'master' into fix/collect-in-procedural-examples

69988ee

Merge branch 'thuml_master' into fix/collect-in-procedural-examples

6e89715

Merge remote-tracking branch 'origin/fix/collect-in-procedural-exampl…

b7a88e5

…es' into fix/collect-in-procedural-examples

increase max epochs in test_sac_with_il.py

cf36f10

MischaPanch marked this pull request as ready for review April 15, 2024 16:18

MischaPanch approved these changes Apr 15, 2024

View reviewed changes

MischaPanch merged commit 60d1ba1 into thu-ml:master Apr 16, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/reset before collect in procedural examples, tests and hl experiment #1100

Fix/reset before collect in procedural examples, tests and hl experiment #1100

maxhuettenrauch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

bordeauxred commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

MischaPanch left a comment

codecov-commenter commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 15, 2024

maxhuettenrauch commented Apr 16, 2024

Fix/reset before collect in procedural examples, tests and hl experiment #1100

Fix/reset before collect in procedural examples, tests and hl experiment #1100

Conversation

maxhuettenrauch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

bordeauxred commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

MischaPanch left a comment

Choose a reason for hiding this comment

codecov-commenter commented Apr 5, 2024

Codecov Report

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 5, 2024

maxhuettenrauch commented Apr 5, 2024

MischaPanch commented Apr 15, 2024

maxhuettenrauch commented Apr 16, 2024