v0.11.4 #654

patrick-kidger · 2024-02-06T17:12:26Z

Features

Added eqx.filter_shard. This lowers to jax.lax.with_sharding_constraint as a single way to transfer data, or reshard data, both inside and outside of JIT! (No more jax.device_put.) In addition, the parallelism example has been updated to use this simpler new functionality. (Thanks @homerjed and @dlwh! Sharding - shard eqx.Module as well as inputs? #688, eqx.filter_shard; test + update examples/parallelism.ipynb #691)
Added eqx.filter_{jacfwd,jacrev,hessian}. These do what you expect! (Thanks @lockwo! Add filter hessian #677)
Added eqx.nn.RotaryPostionalEmbedding. This is designed to be used in conjunction with the existing eqx.nn.MultiheadAttention. (Thanks @Artur-Galstyan! RoPE Embeddings #568)
Added support for padding='VALID', padding='SAME', padding='SAME_LOWER' to the convolutional layers: eqx.nn.{Conv, ...}. (Thanks @ChenAo-Phys! New padding options for Conv and ConvTranspose #658)
Added support for padding_mode='ZEROS', padding_mode='REFLECT', padding_mode='REPLICATE', padding_mode='CIRCULAR' to the convolutional layers: eqx.nn.{Conv, ...}. (Thanks @ChenAo-Phys! New padding options for Conv and ConvTranspose #658)
Added a dtype argument to eqx.nn.{MultiheadAttention, Linear, Conv, ...} for specifying the dtype of their parameters. In addition eqx.nn.BatchNorm will now also uses its dtype argument to determine the dtype of its weights and bias, not just the dtype of its moving statistics. (Thanks @Artur-Galstyan and @AakashKumarNain! Simple dtype argument addition #680, Add dtypes to the rest of eqx.nn #689)

Compatibility

eqx.error_if is now compatible with JAX 0.4.26. (Which changed JAX's own reporting of error messages slightly.)
Added a warning that checks for doing something like:
```
class MyModule(eqx.Module):
	fn: Callable

    def __init__(self, ...):
	    self.fn = jax.vmap(some_fn)
```
As this is an easy source of bugs. (The vmap'd function is not a PyTree so will not propagate anything in the PyTree stucture of some_fn.)

Technical internal stuff

eqx.internal.while_loop(..., kind="checkpointed") will now only propagate forward JVP tracers for those outputs which are perturbed due to the input to the loop being perturbed. (Rather than all of them.) This change just means that later calls to a nondifferentiable operation, like jax.pure_callback or eqx.internal.nondifferentiable, will no longer crash at trace time. (See Problems with progress bar and jax.grad diffrax#396.)
eqx.internal.while_loop(..., kind="bounded") will now handle certain vmap+grad combinations without crashing. (It seems like AJX is adding some spurious batch tracers.) (See pytree output structure mismatch error in backprop during vmap optimistix#48 (comment))
the transpose rule for eqx.internal.create_vprim now understands symbolic zeros, fixing a crash for grad-of-vmap-of-<lineax.linear_solve that we only use some outputs from>. (See pytree output structure mismatch error in backprop during vmap optimistix#48.)
The type annotation for the input of any converter function used in eqx.field(converter=...) will now be used as the type annotation in any dataclass-autogenerated __init__ functions. In particular this should mean such functions are now compatible with runtime type checkers like beartype. (jaxtyping users, you were already covered: this checks the assigned annotations instead.)

* add new padding options for Conv and ConvTranspose * Update _conv.py * Add tests for the padding of `Conv` and `ConvTranspose` * Fix some type hints * Fix the type of padding_t

…ssigning a jax-transformed layer.

…imistix#48

* rope embeddings added * added sinusoidial embedding * added rope to mha * added caching and compute-on-the-fly approach if no max_seq_len given and added process heads to MHA * remove `use_rope_embedding` flag * fixed merge related errors * removed unnecessary state_len flag and placed shape checking in if-clause * rope embeddings added * added sinusoidial embedding * added rope to mha * added caching and compute-on-the-fly approach if no max_seq_len given and added process heads to MHA * remove `use_rope_embedding` flag * fixed merge related errors * removed unnecessary state_len flag and placed shape checking in if-clause * worked in review * export new embeddings * removed state len again, oops * add ensure_compile_time_eval * remove max_seq_len completely * removed unnecessary if check * improved docstrings * better mem, adhering to strict jax config * fixed dtype promotion * removed dtype float and use float(seq_len) instead * jnp.arange(0.0, ...) to force floats * Adjustments to RoPE: - Changed how the rotation is done to match the ESM2 implementation. - Lots of doc tidy-ups. - Removed SinusoidalPositionalEmbedding. I think I want to be more certain that this is correct before merging it. * added rope tests * typo * fixed tests and annotations * removed internal_sinus cache --------- Co-authored-by: Patrick Kidger <33688385+patrick-kidger@users.noreply.github.com>

* add dtype and format code * add a simple test for checking dtype other than float32 * fix default dtype and format code * refine documentation for the dtype argument

* added dtypes * fixed norms and added info to _spectral_norm

…erter` annotation if available. In particular this should mean that Equinox modules are now compatible with beartype decorators.

* eqx.filter_shard; test + example * fixed line lengths * fixed? * double checking..

The main improvement here is that a checkpointed while loop will now only propagate perturbations for those outputs that are actually perturbed. Whilst this doesn't affect the backward pass at all (we were already trimming the cotangents according to this criterion), this now means that any calls to `eqx.nondifferentiable`, or any primitive without a JVP rule, will now no longer throw an error. In addition, this commit includes a couple of crash fixes (needed to pass the new test).

patrick-kidger and others added 17 commits February 6, 2024 17:00

Simplified clear_cache

4a26d8c

Fixed broken test

f8f1737

Fixed wrong annotation for omega

7c120c6

New padding options for Conv and ConvTranspose (#658)

23d983e

* add new padding options for Conv and ConvTranspose * Update _conv.py * Add tests for the padding of `Conv` and `ConvTranspose` * Fix some type hints * Fix the type of padding_t

Tweaked documentation for new padding options to render correctly.

c2ae9cd

Removed spurious variable

168d33d

Now emits a warning if you're about to silently footgun yourself by a…

32bfdc5

…ssigning a jax-transformed layer.

Fixed pyright 1.1.351 raising a spurious error from Enumerations

dbaecff

Fixed errors from new pyright

c79c393

transpose of vprim now handles symbolic zeros; see patrick-kidger/opt…

e624b8c

…imistix#48

Merge branch 'main' into dev

556a38f

add filter hessian

4d95b63

Simple dtype argument addition (#680)

3061c18

* add dtype and format code * add a simple test for checking dtype other than float32 * fix default dtype and format code * refine documentation for the dtype argument

Add dtypes to the rest of eqx.nn (#689)

eb4a577

* added dtypes * fixed norms and added info to _spectral_norm

The annotations of autogenerated __init__ methods now use the `conv…

60612c1

…erter` annotation if available. In particular this should mean that Equinox modules are now compatible with beartype decorators.

Fix for patrick-kidger/optimistix#48

0a2c4b2

patrick-kidger force-pushed the dev branch from 0d94077 to 0a2c4b2 Compare March 31, 2024 20:42

homerjed and others added 3 commits April 7, 2024 10:46

eqx.filter_shard; test + update examples/parallelism.ipynb (#691)

be7e36a

* eqx.filter_shard; test + example * fixed line lengths * fixed? * double checking..

Bump version

040ea59

patrick-kidger changed the title ~~Dev~~ v0.11.4 Apr 14, 2024

patrick-kidger merged commit b88edca into main Apr 14, 2024
2 checks passed

patrick-kidger deleted the dev branch April 14, 2024 12:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.11.4 #654

v0.11.4 #654

patrick-kidger commented Feb 6, 2024 •

edited

Loading

v0.11.4 #654

v0.11.4 #654

Conversation

patrick-kidger commented Feb 6, 2024 • edited Loading

Features

Compatibility

Technical internal stuff

patrick-kidger commented Feb 6, 2024 •

edited

Loading