Backward sampling for continuous torus (ctorus) #193

alexhernandezgarcia · 2023-09-06T12:11:13Z

The main goal of this PR is to enable backward sampling in the Continuous Torus (ctorus) environment. This is needed to sample from the replay buffer and to compute evaluation metrics related to the likelihood of test data.#

Enabling backward sampling in the CTorus has required changing, among other things, the arguments of (formerly called) sample_actions() method of the environments. Specifically, it needs to know

The originating states
Whether the actions are forward or backward

Therefore, these has required changes wherever this method is used.

I have taken the chance to make other changes in this method:

Add a good amount of docstrings
Rename the method to sample_actions_batch()
Other refactoring changes

More things:

Important fix: set_state(state) now copies the state before setting it to self.state. This was the source of painful hidden errors.
Extended tests for the ctorus.
Documentation here and there, especially in ctorus.py
statebatch2policy() of the tori calls statetorch2proxy instead of doing numpy-based transformations.
Remove get_uniform_terminating_states() of Tetris since it is outdated and it can use the base env's get_random_terminating_states().
Enable test of backward sampling for all environments through the use of get_uniform_terminating_states(). (See important note about this below)

⚠️ After having enabled tests of backward sampling for some environments that did not include these tests, they fail. This is the case of:

Crystal: it gets stuck in no valid actions available, so we need to look into it.
Tree: Tried to execute action (2, 0.5769401788711548) not present in action space. I think this may be fixed in another PR.

Sanity checks:

wandb sanity check runs

…probs

…implement it with step_random(); Remove get_parents test from continuous.

…step() which is called by both forward and backward step().

… add states_from and is_backward args needed for continuous envs backward sampling 3) Rename mask_invalid_actions -> mask 4) Add docstring.

…emented.

… tests and use instead of get_uniform_... for the tetris env.

# Conflicts: # config/logger/base.yaml # gflownet/envs/tree.py

gflownet/envs/base.py

gflownet/envs/ctorus.py

gflownet/envs/tree.py

Co-authored-by: Michał Koziarski <michal.koziarski@gmail.com>

…ezgarcia/gflownet into backward-sampling-continuous

…ntinuous-mk Changes to continuous backward sampling

…ezgarcia/gflownet into backward-sampling-continuous

AlexandraVolokhova · 2023-09-07T21:17:14Z

gflownet/envs/ctorus.py

@@ -143,54 +173,106 @@ def get_parents(
            parents = [state]
            return parents, [action]

-    def sample_actions(
+    def action2representative(self, action: Tuple) -> Tuple:


if this method returns self.representative_action regardless the action, why is it action2representative with an argument action, not just get_representative_action (w/o any arguments)?

AlexandraVolokhova · 2023-09-07T21:43:59Z

gflownet/envs/ctorus.py

+        if self.done:
+            assert action == self.eos
+            self.done = False
+            self.n_actions += 1


Why self.n_actions is incremented in the backward step, not decremented? I thought it should be the same as the last dimension of the state

No, it is not the same. self.n_actions counts the number of (valid) actions in a trajectory, regardless of whether it is forward of backward.

AlexandraVolokhova

I didn't get all the details but in overall it looks good to me

alexhernandezgarcia added 25 commits September 1, 2023 19:50

Resolve conflicts.

399dd36

Merge branch 'log-likelihood-estimation' into deleteme-main-merge-log…

a9488b0

…probs

Merge branch 'main' into deleteme-main-merge-logprobs

6ee1bd2

Implement action2representative in ctorus.

e092606

Add test__sample_backwards_reaches_source to continuous tests and re-…

4d045bd

…implement it with step_random(); Remove get_parents test from continuous.

Implement step_backwards() in ctorus and related changes. Implement _…

a42ded9

…step() which is called by both forward and backward step().

Add TODO notes.

0003b22

Refactor env.sample_actions(): 1) Rename to sample_actions_batch() 2)…

16b7ff6

… add states_from and is_backward args needed for continuous envs backward sampling 3) Rename mask_invalid_actions -> mask 4) Add docstring.

Unrelated update due to black and isort.

247bd18

Update docstring.

e44978f

Changes in masks to accomodate backward sampling and docstring.

42c1f44

Add atol argument in env.isclose().

9f0204a

statebatch2policy of htorus calls statetorch2policy.

ec848f8

Backward sampling in ctorus plus cosmetic changes.

c42fc01

Tests of special cases for ctorus.

db20861

Change states_from argument of sample_actions_batch to GFlowNet format.

606ae39

Typing fixes

a841c39

Small fix in test.

6f45190

Copy state before updating it in _step.

dabcb59

Copy state in set_state to avoid maddening silent errors.

30d1638

set_state in tetris calls super().set_state to ensure copy.

9396cbc

Update get_logprobs to new masks

0b13708

Update step() to new _step

2c13d9a

Remove common tests from htorus because backward sampling is not impl…

4623877

…emented.

Add get_random_terminating_states to options for backward sampling in…

c5366ef

… tests and use instead of get_uniform_... for the tetris env.

alexhernandezgarcia requested review from michalkoziarski and AlexandraVolokhova September 6, 2023 12:11

alexhernandezgarcia and others added 3 commits September 6, 2023 13:25

Resolve conflicts of merging main

642ac8b

Small fix

c31ea39

Merge branch 'main' of https://github.com/alexhernandezgarcia/gflownet

39408c5

michalkoziarski and others added 3 commits September 6, 2023 15:25

Merge branch 'main' into backward-sampling-continuous-mk

36d7476

# Conflicts: # config/logger/base.yaml # gflownet/envs/tree.py

tree env fix

df8965c

Simplify step(), step_backwards() and _step() of ctorus.

ffdc043

alexhernandezgarcia marked this pull request as ready for review September 6, 2023 19:41

renamed is_discrete

281c5bf

michalkoziarski requested changes Sep 6, 2023

View reviewed changes

alexhernandezgarcia and others added 3 commits September 6, 2023 23:24

Update gflownet/envs/ctorus.py

cab12c7

Co-authored-by: Michał Koziarski <michal.koziarski@gmail.com>

Merge branch 'backward-sampling-continuous' of github.com:alexhernand…

a472d18

…ezgarcia/gflownet into backward-sampling-continuous

Merge pull request #194 from alexhernandezgarcia/backward-sampling-co…

3cc07a0

…ntinuous-mk Changes to continuous backward sampling

alexhernandezgarcia mentioned this pull request Sep 6, 2023

Adjust mask of ctorus #196

Open

alexhernandezgarcia added 2 commits September 6, 2023 23:11

Resolve conflict after merging main.

253b39f

Merge branch 'backward-sampling-continuous' of github.com:alexhernand…

2afbbea

…ezgarcia/gflownet into backward-sampling-continuous

michalkoziarski approved these changes Sep 7, 2023

View reviewed changes

Hacky way of skipping the Crystal BW sampling test until fixed

e1215a8

AlexandraVolokhova reviewed Sep 7, 2023

View reviewed changes

AlexandraVolokhova approved these changes Sep 7, 2023

View reviewed changes

alexhernandezgarcia merged commit e73b143 into main Sep 7, 2023
1 check passed

josephdviviano deleted the backward-sampling-continuous branch January 31, 2024 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backward sampling for continuous torus (ctorus) #193

Backward sampling for continuous torus (ctorus) #193

alexhernandezgarcia commented Sep 6, 2023 •

edited

Loading

AlexandraVolokhova Sep 7, 2023

AlexandraVolokhova Sep 7, 2023

alexhernandezgarcia Sep 7, 2023

AlexandraVolokhova left a comment

Backward sampling for continuous torus (ctorus) #193

Backward sampling for continuous torus (ctorus) #193

Conversation

alexhernandezgarcia commented Sep 6, 2023 • edited Loading

Sanity checks:

AlexandraVolokhova Sep 7, 2023

Choose a reason for hiding this comment

AlexandraVolokhova Sep 7, 2023

Choose a reason for hiding this comment

alexhernandezgarcia Sep 7, 2023

Choose a reason for hiding this comment

AlexandraVolokhova left a comment

Choose a reason for hiding this comment

alexhernandezgarcia commented Sep 6, 2023 •

edited

Loading