Add dual averaging algorithm #35

rlouf · 2021-09-29T12:54:11Z

Adds the dual averaging algorithm that is commonly used to adapt the step size of HMC algorithms. Closes #34.

This one is completely independent from the rest of the code so I'll be working on it while we solve the current issues with HMC and NUTS.

Reference: http://webdoc.sub.gwdg.de/ebook/serien/e/CORE/dp2005_67.pdf

rlouf · 2021-09-29T14:04:07Z

@brandonwillard As you can see in test_algorithm.py I have to set the type of x_init to float64. If I set it to float32 I get an error because the scan upcasts the parameters.

This also happens if I set aesara.config.floatX='float32'. It is not the first time I run into this upcast annoyance, and I am wondering if that is something you would be willing to address aesara-side?

codecov · 2021-09-29T14:49:52Z

Codecov Report

Merging #35 (fa43113) into main (0d1d7c1) will not change coverage.
The diff coverage is 100.00%.

❗ Current head fa43113 differs from pull request most recent head 416ba7d. Consider uploading reports for the commit 416ba7d to get more accurate results

@@            Coverage Diff            @@
##              main       #35   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files            9        10    +1     
  Lines          345       361   +16     
  Branches        14        14           
=========================================
+ Hits           345       361   +16

Impacted Files	Coverage Δ
aehmc/algorithms.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0d1d7c1...416ba7d. Read the comment docs.

brandonwillard · 2021-09-29T16:35:02Z

This also happens if I set aesara.config.floatX='float32'. It is not the first time I run into this upcast annoyance, and I am wondering if that is something you would be willing to address aesara-side?

We can definitely address this; however, I first need to understand the entire context of the upcasting. In general, there are some casting configuration options (e.g. cast_policy) and defaults that we need to revisit.

brandonwillard · 2021-09-29T16:38:05Z

tests/test_algorithms.py

+        gradient = aesara.grad(value, x)
+        return update(gradient, step, x, x_avg, gradient_avg)
+
+    x_init = at.as_tensor(0.0, dtype="float64")


Does this fail if you use aesara.config.floatX?

I'm guessing that the (CI and local) tests are running with aesara.config.floatX set to "float64", so that's why an explicit "float64" is fine, but setting this to aesara.config.floatX and changing that config option to "float32" (before loading Aesara) should not cause a problem.

If that's the upcasting issue you described, then, yes, that sounds like an Aesara issue. Just don't forget to set that option before loading Aesara; otherwise, it won't be active and you will get confusing casting issues. In other words, that option can't be properly changed after loading Aesara, because sometimes that value is used during class/type/object creation.

If I set aesara.config.floatX = "float32" right after the imports I still get an error if I don't specify x_init's type or set it to float32.

But I’m not sure what pytest does with the code so I'll try in a stand-alone script.

Try it with a conftest.py setup like Aesara's, but with floatX set, of course, or try the same with the AESARA_FLAGS env variable. That will make sure the setting is available before the imports.

I added the conftest.py file with floatX set to float32; I checked that print(aesara.config.floatX) in the test_dual_averaging function returns float32. However the test still fails with the following error:

ValueError: When compiling the inner function of scan the following error has been encountered: The initial state (`outputs_info` in scan nomenclature) of variable IncSubtensor{Set;:int64:}.0 (argument number 1) has dtype float32, while the result of the inner function (`fn`) has dtype float64. This can happen if the inner function of scan results in an upcast or downcast.

I found this related question with a simpler example on SO.

I just provided an answer to that question.

rlouf · 2021-10-04T13:40:47Z

The type error has nothing to do with scan; it is a result of aesara's default behavior. I don't know if it is a good or a bad thing, but it is surprising. The following piece of code:

t0 = 10
step = at.as_tensor(0, dtype="int32")
eta = 1.0 / (step + t0)
print(eta.dtype)

will return float64, even if floatX is set to float32.

On the other hand the following code

```python
t0 = 10
step = at.as_tensor(0, dtype="int32")
eta = (1.0 / (step + t0)).astype("floatX")
print(eta.dtype)

will return whatever floatX is. I would expect eta to be cast to floatX in the first example.

Another thing that I find very confusing is that

x_init = at.as_tensor(1.0)
print(x_init.dtype)

will return float32 regardless of the value of the floatX flag.

rlouf · 2021-10-04T14:05:00Z

I ended up adding astype("floatX") wherever needed. Good to merge if the tests pass.

brandonwillard · 2021-10-04T21:57:52Z

The type error has nothing to do with scan; it is a result of aesara's default behavior. I don't know if it is a good or a bad thing, but it is surprising. The following piece of code:
t0 = 10
step = at.as_tensor(0, dtype="int32")
eta = 1.0 / (step + t0)
print(eta.dtype)
will return float64, even if floatX is set to float32.

In this instance, 1.0 is a floating point number and the denominator is an integer (possibly int64), so, according to the casting/promotion rules, Aesara will promote the result to a "larger" floating point type—regardless of aesara.config.floatX.

NumPy has the same behavior:

import numpy as np


np.dtype(np.array(1.0, dtype=np.float32) / np.array(10, dtype=np.int32))
# dtype('float64')

Another thing that I find very confusing is that
x_init = at.as_tensor(1.0)
print(x_init.dtype)
will return float32 regardless of the value of the floatX flag.

I'm not seeing that locally:

import os

os.environ["AESARA_FLAGS"] = "floatX=float32"

import aesara.tensor as at


assert aesara.config.floatX == "float32"

x_init = at.as_tensor(1.0)
x_init.dtype
# 'float32'

brandonwillard

In general, we need to go along with the configured Aesara upcasting/promotion rules, and this often necessitates the use of dtypes from user-provided graphs. I'll try running this again locally to see where that could be done.

conftest.py

tests/test_hmc.py

aehmc/algorithms.py

rlouf · 2021-10-05T10:21:01Z

Ok, that makes a lot more sense. I'll try to see how we could use the dtypes from the graphs the user provides.

As for the previous example, I get the same thing as you do on my machine. The following is confusing though:

import os

os.environ["AESARA_FLAGS"] = "floatX=float64"

import aesara
import aesara.tensor as at


assert aesara.config.floatX == "float64"

x_init = at.as_tensor(1.0)
print(x_init.dtype)
# 'float32'

rlouf added the enhancement New feature or request label Sep 29, 2021

rlouf self-assigned this Sep 29, 2021

rlouf force-pushed the dual-averaging branch from 68a6ad8 to 00d5f63 Compare September 29, 2021 13:42

rlouf force-pushed the dual-averaging branch from 00d5f63 to fa68fa5 Compare September 29, 2021 14:12

rlouf requested a review from brandonwillard September 29, 2021 14:22

brandonwillard previously approved these changes Sep 29, 2021

View reviewed changes

brandonwillard reviewed Sep 29, 2021

View reviewed changes

brandonwillard added the important label Sep 30, 2021

rlouf dismissed brandonwillard’s stale review via 4a866f6 September 30, 2021 07:28

rlouf force-pushed the dual-averaging branch 2 times, most recently from 4a866f6 to 3ac9e1b Compare September 30, 2021 07:29

rlouf force-pushed the dual-averaging branch from 3ac9e1b to e0077e2 Compare October 4, 2021 14:01

rlouf force-pushed the dual-averaging branch from e0077e2 to a4286f2 Compare October 4, 2021 14:21

rlouf mentioned this pull request Oct 4, 2021

Handle dtypes in a systematic way #39

Closed

rlouf force-pushed the dual-averaging branch 2 times, most recently from eb475b6 to 7e1635e Compare October 4, 2021 14:44

Set aesara flags at pytest start

2c5eca9

rlouf force-pushed the dual-averaging branch 3 times, most recently from fa43113 to ce59d26 Compare October 4, 2021 15:36

brandonwillard requested changes Oct 5, 2021

View reviewed changes

conftest.py Outdated Show resolved Hide resolved

tests/test_hmc.py Show resolved Hide resolved

aehmc/algorithms.py Outdated Show resolved Hide resolved

aehmc/algorithms.py Outdated Show resolved Hide resolved

rlouf force-pushed the dual-averaging branch from ce59d26 to 78c72d5 Compare October 5, 2021 10:19

rlouf force-pushed the dual-averaging branch from 78c72d5 to 9055f95 Compare October 5, 2021 12:12

rlouf force-pushed the dual-averaging branch from 9055f95 to 416ba7d Compare October 5, 2021 12:27

Add dual averaging algorithm

416ba7d

brandonwillard approved these changes Oct 12, 2021

View reviewed changes

rlouf merged commit 9011be2 into main Oct 14, 2021

rlouf deleted the dual-averaging branch October 14, 2021 10:53

rlouf mentioned this pull request Oct 14, 2021

Add Welford's online algorithm #38

Merged

Uh oh!

Add dual averaging algorithm #35

Add dual averaging algorithm #35

Uh oh!

Conversation

rlouf commented Sep 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Sep 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

brandonwillard commented Sep 29, 2021

Uh oh!

brandonwillard Sep 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlouf Sep 29, 2021

Choose a reason for hiding this comment

Uh oh!

brandonwillard Sep 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlouf Sep 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlouf Sep 30, 2021

Choose a reason for hiding this comment

Uh oh!

brandonwillard Sep 30, 2021

Choose a reason for hiding this comment

Uh oh!

rlouf commented Oct 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Oct 4, 2021

Uh oh!

brandonwillard commented Oct 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandonwillard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rlouf commented Oct 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rlouf commented Sep 29, 2021 •

edited

Loading

rlouf commented Sep 29, 2021 •

edited

Loading

codecov bot commented Sep 29, 2021 •

edited

Loading

brandonwillard Sep 29, 2021 •

edited

Loading

brandonwillard Sep 29, 2021 •

edited

Loading

rlouf Sep 30, 2021 •

edited

Loading

rlouf commented Oct 4, 2021 •

edited

Loading

brandonwillard commented Oct 4, 2021 •

edited

Loading