[Frontend] Add AutoGraph support for Python for loops #258

dime10 · 2023-08-23T12:54:28Z

Introduces support for Python for .. in ...: statement capture as part of the compiled program.
Similar to PR #235, AutoGraph is used to convert such statements into the equivalent Catalyst version before tracing occurs. Specifically, the following constructs are supported via AutoGraph:

for elem in iterable: - These get converted into a for_loop(0, len(iterable), 1) with elem = iterable[i] automatically assigned using the iteration index, assuming iterable is convertible to a JAX array. If this is not the case, the loop is executed as is in Python.
for i in range(start, stop, step): These are converted directly into their equivalent for_loop(start, stop, step). Contrary to the default Python range, when AutoGraph is enabled range can also accept dynamic tracers as start, stop, step values. If any exception is raised during the tracing of the for_loop body, Catalyst will fall back to Python with a warning.
for i, elem in enemurate(iterable): - These get converted into for_loop(0, len(iterable), 1) with the iteration index assigned to the variable chosen by the user (in this case i), and elem = iterable[i]. This also assumes that iterable is convertible to an array, and that the loop body traces without exception, otherwise the loop is executed in Python.

Note that a warning is raised when when a Python fallback is triggered due to a tracing exception. Python fallbacks caused by the iterable not being convertible to array are silent.

[sc-41287]

codecov · 2023-08-23T13:06:11Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.16% 🎉

Comparison is base (ffdd0ab) 99.31% compared to head (90b1a49) 99.47%.
Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #258      +/-   ##
==========================================
+ Coverage   99.31%   99.47%   +0.16%     
==========================================
  Files          41       41              
  Lines        7141     7284     +143     
  Branches      377      393      +16     
==========================================
+ Hits         7092     7246     +154     
+ Misses         27       20       -7     
+ Partials       22       18       -4

Files Changed	Coverage Δ
frontend/catalyst/compilation_pipelines.py	`100.00% <ø> (ø)`
frontend/catalyst/pennylane_extensions.py	`99.14% <ø> (+<0.01%)`	⬆️
frontend/catalyst/__init__.py	`95.83% <100.00%> (+0.37%)`	⬆️
frontend/catalyst/ag_primitives.py	`100.00% <100.00%> (ø)`
frontend/catalyst/autograph.py	`100.00% <100.00%> (ø)`
frontend/catalyst/jax_primitives.py	`97.04% <100.00%> (ø)`
frontend/catalyst/jax_tracer.py	`99.27% <100.00%> (+3.79%)`	⬆️

... and 4 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Catalyst for loop conversion works directly when iterating over arrays. Non-array iteration targets will attempt a conversion to array, if this fails we fall back to Python loops.

A custom range class is added that acts identically to the Python range class, however it allows tracers to be used as arguments for start, stop, and step. The for loop conversion looks for the new range class and can automatically incorporate it into the iteration bounds of the Catalyst `for_loop` function. Some advantages include that a static Python range does not need to be materialized into a constant array, and dynamic ranges are now also supported. A downside is that currently a conversion to Catalyst's for_loop always takes place when a range is encountered. However, the user may use the indices obtained from the range to index non-array objects, which is not detectable by us and would lead to a tracing error.

.. when any exception is raised during tracing of the Catalyst loop. A warning is raised when an exception occurs within the Catalyst loop, allowing users to correct certain mistakes (such as wrapping a list into an array). However, the conversion remains safe in the sense that it always falls back on the code as the user wrote it.

.. in warnings/errors arising inside of converted code.

No longer requires the pytest-mock package dependency.

The tests were more precise this way, but too brittle as the line numbers can change frequently.

josh146

Thanks @dime10, great work.

I'm approving a bit early because I don't want to be a bottleneck here with my upcoming travel, but I only reviewed the tests and the docs, so the implementation should still be reviewed by someone else :)

Minor docs comment, but the qjit docstring should also be updated to specify for is now supported:

autograph (bool): Experimental support for automatically converting Python control
    flow statements to Catalyst-compatible control flow. Currently only supports Python
    ``if``, ``elif``, and ``else`` statements. Note that this feature requires an
    available TensorFlow installation.

doc/dev/installation.rst

frontend/catalyst/ag_primitives.py

frontend/test/pytest/test_autograph.py

dime10 · 2023-09-13T17:15:56Z

Thanks for the review @josh146 :) Good point on the documentation, I'll have to polish it up a bit before merging!

rmoyard

Great PR 👍 I am just wondering if CRange could inherit from range like you did with enumerate. This would remove issues with testing. Happy to approve after that

.github/workflows/check-catalyst.yaml

frontend/catalyst/ag_primitives.py

- catalyst.autograph_ignore_fallbacks: Silences warnings resulting from Python fallbacks. This is useful if the user doesn't want to see any warnings, or in situations where the warning cannot be fixed. - catalyst.autograph_strict_conversion: Turns Python fallback warnings into errors and produces the full traceback of the error that caused the fallback. This can be useful when debugging why a fallback happened.

Depending on when the tracing fails, variables tracked by autograph could have been modified already. To prevent an invalid state, we restore all autograph tracked variables to their original state before running the loop in Python.

This bug resulted in loop carried values (like a sum value) not being updated during the execution of the loop. This commit also adds a variety of tests around various uses of "iteration arguments".

dime10 · 2023-09-19T20:49:49Z

I've added an additional user facing feature here: 8219ffa

It provides two flags that the user can set around the conversion strictness (I used them during debugging and in the tests):

catalyst.autograph_ignore_fallbacks: Silences warnings resulting from Python fallbacks.
This is useful if the user doesn't want to see any warnings, or in situations where the warning cannot be fixed.
catalyst.autograph_strict_conversion: Turns Python fallback warnings into errors and produces the full
traceback of the error that caused the fallback.
This can be useful when debugging why a fallback happened.

josh146 · 2023-09-19T21:24:20Z

Thanks @dime10! I think these are good options to include.

Non-blocking for this PR, but for end-users I worry about having these new options be module variables as opposed to an option you pass to @qjit.

In Strawberry Fields, we used to have strawberryfields.hbar be a module level variable that users could set, and we ran into a tonne of problems and edge cases due to this global state/the fact that functions weren't pure.

It mainly showed up in several places:

Parallelization. E.g., users using threads, or dask, or even pytest running tests in parallel, would cause race conditions.
there would sometimes be bugs in tests because the preceding test wouldn't properly do a tear down
developers would sometimes alter these values internally, and forget to change them back. Or there would be an exception before it could change back (and the developer didn't use try-finally).

So before usage gets baked in, might be good to turn these into arguments.

rmoyard

It looks good to me 💯 I suggest a autograph configuration kwarg for qjit but not blocking

dime10 · 2023-09-19T23:09:39Z

I was originally thinking of similar configurations in JAX which are also global, like jax.config.update("jax_debug_nans", True). However, you raise some good points so I'm thinking we could add something like this:

@qjit(autograph=True, ag_mode=...)

Where ag_mode can be one of the following:

"strict": Always raise an error with the full traceback when a conversion fails. Enable this
to debug why a conversion may have failed, or when successful conversion is critical.
"permissive": The default & safe option. Catalyst will never abort compilation because
an AutoGraph conversion failed, but some errors are re-raised as warnings.
"silent": Catalyst will not raise any warnings or errors and will fall back to Python
silently whenever AutoGraph conversion fails.

However, there is a problem because there is nothing actually linking the qjit object to the eventual invocation of the autograph transformed for loop. I think this is true in general for all our primitives, so there is no obvious way to pass compilation options into any of them.
(I just discovered another use of a global here actually, but it's only used internally:

catalyst/frontend/catalyst/jax_tracer.py

Line 114 in 64be9d2

JaxTape.device = device

)

josh146 · 2023-09-21T16:56:33Z

"strict": Always raise an error with the full traceback when a conversion fails. Enable this to debug why a conversion may have failed, or when successful conversion is critical.

"permissive": The default & safe option. Catalyst will never abort compilation because an AutoGraph conversion failed, but some errors are re-raised as warnings.

"silent": Catalyst will not raise any warnings or errors and will fall back to Python silently whenever AutoGraph conversion fails.

I really like this 🙌 I'll add it to Q4 as a potential story.

dime10 force-pushed the python_for branch from 7d856ca to 0165e01 Compare August 23, 2023 23:46

Base automatically changed from python_cf to main August 25, 2023 19:27

dime10 force-pushed the python_for branch from 0165e01 to 5bec7f4 Compare August 25, 2023 19:30

dime10 added 9 commits September 12, 2023 10:45

Add basic support for Python for loops

e9a2154

Catalyst for loop conversion works directly when iterating over arrays. Non-array iteration targets will attempt a conversion to array, if this fails we fall back to Python loops.

Add reference to for_loop function

3b94c0d

Fix construction of captured ranges

e5a9965

Add support for enumerate inside of for loops

e9c91e1

Test support for unpacking loop values

a08ca7a

Fix various issues reported by Pylint

53e2848

Typos

63d2980

dime10 force-pushed the python_for branch from 5bec7f4 to 7beec57 Compare September 12, 2023 17:19

dime10 added 3 commits September 12, 2023 19:29

Add extraction of original source code information

78a31eb

.. in warnings/errors arising inside of converted code.

Add tensorflow-cpu to requirements.txt

85e5a6a

Fix missing autograph_artifact in ag_primitives

1f2f2e9

dime10 force-pushed the python_for branch from 7beec57 to 0659722 Compare September 12, 2023 23:54

dime10 added 2 commits September 12, 2023 20:07

Linting

3274162

Simplify pytest mocking

8a1b854

No longer requires the pytest-mock package dependency.

dime10 force-pushed the python_for branch from 0659722 to 8a1b854 Compare September 13, 2023 00:07

Remove hardcoded line numbers in tests

4a158ba

The tests were more precise this way, but too brittle as the line numbers can change frequently.

dime10 marked this pull request as ready for review September 13, 2023 00:22

dime10 requested review from josh146 and rmoyard September 13, 2023 00:22

josh146 approved these changes Sep 13, 2023

View reviewed changes

rmoyard reviewed Sep 13, 2023

View reviewed changes

.github/workflows/check-catalyst.yaml Show resolved Hide resolved

frontend/catalyst/ag_primitives.py Show resolved Hide resolved

frontend/catalyst/ag_primitives.py Outdated Show resolved Hide resolved

frontend/catalyst/ag_primitives.py Outdated Show resolved Hide resolved

dime10 added 2 commits September 15, 2023 16:57

Improve clarity of code comment

515dbcf

Improve changelog & docstrings

063e547

dime10 added 6 commits September 15, 2023 18:36

Improve patch coverage to 100%

5341c3a

Restore the initial variable state before fallback

baec087

Depending on when the tracing fails, variables tracked by autograph could have been modified already. To prevent an invalid state, we restore all autograph tracked variables to their original state before running the loop in Python.

Fix variable tracking in loop body tracing

dfd98a7

This bug resulted in loop carried values (like a sum value) not being updated during the execution of the loop. This commit also adds a variety of tests around various uses of "iteration arguments".

Add error checks around uninitialized loop values

8d2a40c

Add test involving if/cond/for/for_loop

c86b957

rmoyard approved these changes Sep 19, 2023

View reviewed changes

dime10 force-pushed the python_for branch from 691c70e to 1c4aaef Compare September 19, 2023 22:41

dime10 added 2 commits September 19, 2023 19:00

Satisfy Pylint

3bd9ed2

Fix patch coverage

90b1a49

dime10 force-pushed the python_for branch from 1c4aaef to 90b1a49 Compare September 19, 2023 23:00

dime10 merged commit 8106beb into main Sep 21, 2023
18 checks passed

dime10 deleted the python_for branch September 21, 2023 16:37

dime10 mentioned this pull request Feb 21, 2024

Rework QJIT class into distinct compilation stages #531

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Frontend] Add AutoGraph support for Python for loops #258

[Frontend] Add AutoGraph support for Python for loops #258

dime10 commented Aug 23, 2023 •

edited

Loading

codecov bot commented Aug 23, 2023 •

edited

Loading

josh146 left a comment

dime10 commented Sep 13, 2023

rmoyard left a comment

dime10 commented Sep 19, 2023

josh146 commented Sep 19, 2023 •

edited

Loading

rmoyard left a comment

dime10 commented Sep 19, 2023 •

edited

Loading

josh146 commented Sep 21, 2023 •

edited

Loading

[Frontend] Add AutoGraph support for Python for loops #258

[Frontend] Add AutoGraph support for Python for loops #258

Conversation

dime10 commented Aug 23, 2023 • edited Loading

codecov bot commented Aug 23, 2023 • edited Loading

Codecov Report

josh146 left a comment

Choose a reason for hiding this comment

dime10 commented Sep 13, 2023

rmoyard left a comment

Choose a reason for hiding this comment

dime10 commented Sep 19, 2023

josh146 commented Sep 19, 2023 • edited Loading

rmoyard left a comment

Choose a reason for hiding this comment

dime10 commented Sep 19, 2023 • edited Loading

josh146 commented Sep 21, 2023 • edited Loading

dime10 commented Aug 23, 2023 •

edited

Loading

codecov bot commented Aug 23, 2023 •

edited

Loading

josh146 commented Sep 19, 2023 •

edited

Loading

dime10 commented Sep 19, 2023 •

edited

Loading

josh146 commented Sep 21, 2023 •

edited

Loading