Log likelihood estimation #167

alexhernandezgarcia · 2023-07-29T00:28:49Z

Summary

Code to estimate the log likelihood of sampling data points with the current GFLowNet policy, which can be used to compute potentially insightful metrics. This PR includes the calculation of:

The correlation between the estimated log-likelihood of test data and the rewards of the data.
The mean negative log likelihood of the test data.

You can see one example on wandb.

Estimation of the log-likelihood

The log-likelihood is estimated by sampling backward trajectories according to the backward policy, then calculating the log probability of a sample with importance sampling, where the weights are the backward transition probabilities of the trajectories. In particular:

$\log p_T(x) = \int_{x \in \tau} P_F(\tau)d\tau$

$= \log \mathbb{E}_{P_B(\tau|x)} \frac{P_F(x)}{P_B(\tau|x)}$

$\approx \log \frac{1}{N} \sum_{i=1}^{N} \frac{P_F(x_i)}{P_B(\tau|x_i)}, x_i \sim P_B(\tau|x_i)$

Other notes

For convenience, I have implemented a new method GFlowNetAgent.compute_logprobs_trajectories(), which is used now too by the trajectory balance loss method.
The code of estimate_logprobs_data()may be simplified and modularised a bit, but it's not a big deal anyway.

The core of this PR is ready to review but note that the code to calculate the metrics in test() has become even messier and there may be issues before/after merging. A future PR should organise the evaluation code.

Things in this PR not directly related to the goal

set_state() for the Tetris environment, to catch cases where the state and done are incompatible.
I had to implement a couple of methods in the Batch to make the trajectory indices consecutive.

…mation

…ckward, depending on argument.

…ity-checked on 10x10 Grid)

michalkoziarski

I reviewed it, it seems ok to me (minus some minor comments). I would love to get one more review from someone else (at least for the estimate_logprobs_data function, the rest should be fine), but I guess that will be unlikely to get at the moment in a reasonable time.

One question: did you confirm that the results with this are comparable to previous versions on your test environments (since technically you change the TB loss)?

gflownet/utils/batch.py

gflownet/gflownet.py

gflownet/utils/batch.py

AlexandraVolokhova

Thank you for this great job! I left a couple of comments for simplifying / making more readable the code, but in overall, it looks good to me.

gflownet/gflownet.py

AlexandraVolokhova · 2023-08-31T00:12:12Z

One more thing: I'd add computing and tracking two variances of the log probs:

variance over samples of logprobs_estimates (to understand better the behaviour of the correlation coefficient over the training)
median over samples of the variances of the logprobs_estimates over trajectories for each sample (to get a sense of how noisy the estimation is). The math is a bit tricky here as we use log mean as an estimation, not just the mean. But there're some work around: https://stats.stackexchange.com/questions/418313/variance-of-x-and-variance-of-logx-how-to-relate-them
But in any case, we will need to compute empirical var(P_F(tau) / P_B (tau)) / n_traj for each sample and then play around a bit with it to get variance for the log mean estimation.

alexhernandezgarcia · 2023-08-31T14:25:34Z

I did compare the results with previous versions, at least with the Grid, and I have later run more instances. See this report on the sanity checks project on wandb.

…d of each test data point to the test config.

…points have n_trajectories.

…ent Batch.make_indices_consecutive().

…cutive traj. indices (without changing the batch).

alexhernandezgarcia · 2023-09-01T23:58:38Z

One more thing: I'd add computing and tracking two variances of the log probs:

1. variance over samples of logprobs_estimates (to understand better the behaviour of the correlation coefficient over the training)

2. median over samples of the variances of the logprobs_estimates over trajectories for each sample (to get a sense of how noisy the estimation is). The math is a bit tricky here as we use log mean as an estimation, not just the mean. But there're some work around: https://stats.stackexchange.com/questions/418313/variance-of-x-and-variance-of-logx-how-to-relate-them
   But in any case, we will need to compute empirical var(P_F(tau) / P_B (tau)) / n_traj for each sample and then play around a bit with it to get variance for the log mean estimation.

I agree. I will not do this in this PR so as to merge asap. I have opened an issue instead: #192

michalkoziarski

Thank you for the changes and the tests - looks good to me!

alexhernandezgarcia and others added 30 commits June 22, 2023 09:52

Merge branch 'double-check-valid-actions' into log-likelihood-estimation

e253466

equal method in base env to account for different state types

91bbd33

set state for tetris

b9b804b

wip: gflownet sample_backwards method

09ec5aa

Add optional mask argument to Batch.add_to_batch

fb9cadf

Make GFlowNet.sample_actions() return the sampling masks

4ac2ef6

Provide the sampling masks to Batch.add_to_batch() when possible

ddef4d6

basic start of implementation of avoid repeated trajectories

b0959dd

basic test of sample_backwards

a4feafa

add tetris to test_likelihood

478ea3f

add missing import

a9625c0

format warning string

f146956

remove unused import

0de74ec

fix tetris likelihood test

01de6b7

merge main and resolve conflicts

58b59a4

merge main and fix conflict

5e3aef4

lil fix

7e250e1

Merge branch 'remove_batch_class_mask_calls' into log-likelihood-esti…

c0397e4

…mation

resolve conflicts and merge batch-loss-decoupling

e88e022

resolve hanging conflict

329ec08

conflicts

aaae00e

fix issue with merge2

1c7f033

wip

4db272a

docstring

a8235b3

wip: implement method to estimate logprobs; has ipdb

18095fe

GFlowNetAgent.get_logprobs_trajectories() computes only forward or ba…

0d2f0b3

…ckward, depending on argument.

Merge branch 'main' into log-likelihood-estimation

34295f3

Functionality to make the indices of a batch consecutive

fb5f540

fix bug and check if indices are consecutive before sorting

2b07d6e

Rewrite TB loss method by using self.get_logprobs_trajectories() (san…

370f9ae

…ity-checked on 10x10 Grid)

alexhernandezgarcia added 2 commits August 25, 2023 16:59

Revert to commit 15917eb

23df012

Remove tests/gflownet/gflownet/test_likelihood.py

6c6303f

michalkoziarski reviewed Aug 30, 2023

View reviewed changes

gflownet/utils/batch.py Show resolved Hide resolved

gflownet/utils/batch.py Outdated Show resolved Hide resolved

gflownet/gflownet.py Outdated Show resolved Hide resolved

gflownet/gflownet.py Outdated Show resolved Hide resolved

gflownet/gflownet.py Outdated Show resolved Hide resolved

AlexandraVolokhova reviewed Aug 30, 2023

View reviewed changes

gflownet/utils/batch.py Show resolved Hide resolved

AlexandraVolokhova approved these changes Aug 30, 2023

View reviewed changes

gflownet/gflownet.py Outdated Show resolved Hide resolved

gflownet/gflownet.py Outdated Show resolved Hide resolved

gflownet/gflownet.py Show resolved Hide resolved

alexhernandezgarcia added 17 commits August 31, 2023 10:50

Make max_data (now max_data_size) and argument and add to logger config.

a88e055

Add the number of backward trajectories to estimate the log likelihoo…

5675335

…d of each test data point to the test config.

Remove unused variables and add comment line.

0a4db4c

Minor: convert to list instead of iterating.

6883bfd

Fix typo

1b68f1e

Simplify code to estimate logprobs since we can assume that all data …

698ba2c

…points have n_trajectories.

Move assert up.

1dadcf2

Add note in docstring about indexing.

af8043c

Fix traj_indices_are_consecutive() list() <- set()

01a605e

Add consecutive option to Batch.get_trajectory_indices() and reimplem…

b28ded1

…ent Batch.make_indices_consecutive().

In TB loss, replace batch.make_indices_consecutive() by getting conse…

27a5893

…cutive traj. indices (without changing the batch).

Tests for make_indices_consecutive

28fa35c

Note that make_indices_consecutive() is unused.

627ee04

Compute correlation between np.exp(logprobs) and rewards.

7283dff

Implement Batch.get_unique_trajectory_indices() and small fix.

6bd8440

Simplify code and ensure correct indexing of trajectories.

3599dbf

Mini-fix

abc94e9

alexhernandezgarcia mentioned this pull request Sep 1, 2023

Compute and log variances of the log probs #192

Open

michalkoziarski approved these changes Sep 6, 2023

View reviewed changes

Merge branch 'main' into log-likelihood-estimation

1323b1a

alexhernandezgarcia merged commit ca157c1 into main Sep 6, 2023
1 check passed

josephdviviano deleted the log-likelihood-estimation branch January 31, 2024 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log likelihood estimation #167

Log likelihood estimation #167

alexhernandezgarcia commented Jul 29, 2023 •

edited

Loading

michalkoziarski left a comment

AlexandraVolokhova left a comment

AlexandraVolokhova commented Aug 31, 2023

alexhernandezgarcia commented Aug 31, 2023

alexhernandezgarcia commented Sep 1, 2023

michalkoziarski left a comment

Log likelihood estimation #167

Log likelihood estimation #167

Conversation

alexhernandezgarcia commented Jul 29, 2023 • edited Loading

Summary

Estimation of the log-likelihood

Other notes

Things in this PR not directly related to the goal

michalkoziarski left a comment

Choose a reason for hiding this comment

AlexandraVolokhova left a comment

Choose a reason for hiding this comment

AlexandraVolokhova commented Aug 31, 2023

alexhernandezgarcia commented Aug 31, 2023

alexhernandezgarcia commented Sep 1, 2023

michalkoziarski left a comment

Choose a reason for hiding this comment

alexhernandezgarcia commented Jul 29, 2023 •

edited

Loading