Formalize operator dtypes #1697

wdphy16 · 2024-01-19T18:12:39Z

The discussion started in #1543 , and now I finally have some time to thoroughly check it.

I think the most important intuition is that, if dtype is specified in __init__, the property operator.dtype should return the same value. (Except when the specified dtype is 64-bit, and NETKET_ENABLE_X64 is disabled, it will become 32-bit.) This behavior is consistent with jnp.array.

If not specified in __init__, it will be inferred from all other arguments, and whether the operator is required to be complex, using JAX's promotion rules. If no other argument can be used to infer it, it defaults to float64 if NETKET_ENABLE_X64 is enabled, and float32 otherwise.

If the inferred dtype is different from the specified one, __init__ may cast it to the specified dtype, or raise a TypeError if it's unphysical or hard to implement.

If the inferred dtype is lower than the specified one, we silently upcast it as in numpy.
If the inferred dtype is higher than the specified one, and both are real or both are complex, we silently truncate it as in numpy. (I guess people don't like to see a lot of warnings when they already decide to specify 32-bit...)
If the inferred dtype is float64 but the specified one is complex64, we also silently truncate it as in numpy.
If the inferred dtype is complex but the specified one is real, we may give a warning and discard the imaginary part, which may be already done by numpy. Or we may raise the error.

If the dtype is not specified and it's inferred to be int, we promote it to float, as suggested previously by this test. (I think it's intuitive for physicists who are too lazy to type .0, rather than dtype lawers)

It's also possible to specify an int dtype and it just works in many cases. In future we can make it work with PauliStrings, and raise an error for uint and other dtypes, if someone really needs that.

Note that when doing in-place arithmetic, some operators actually modify the underlying arrays, so their dtypes never change, and they raise a TypeError if casting complex to real. Other operators just call the out-of-place methods, so their dtypes may change. This PR only cleans up __init__ and does not touch those methods.

Discrete operators

Their dtype is the dtype of matrix elements, and we already explicitly ensured that in most cases. The dtype of expectation values will be inferred from both the matrix elements and the wave function.

For Ising and IsingJax, previously the dtype was inferred to be int if J and h are ints, now it's promoted to float.

For Heisenberg, previously there was no argument dtype in __init__, now we've added it and it's handled by LocalOperator.

For BoseHubbard, previously we didn't cast it to x32 when x64 is disabled, now we cast it. Although this operator doesn't have a JAX version yet, we still do the cast to make things more consistent.

For LocalLiouvillian, we also cast it to x32 when x64 is disabled. Previously we cast the specified real dtype to complex, now we raise a TypeError because it's unphysical.

Pauli strings

Now we infer that the dtype is complex if any string has an odd number of Y (previously if it has any Y), or if any weight is complex. Previously we cast the specified real dtype to complex when needed, now we raise a TypeError because it's unphysical.

I've added a _reduce_pauli_string in __init__. After that, those strings with Y cannot cancel out.

It's still possible that a term has an odd number of Y and a purely imaginary weight, so that the whole Hamiltonian is real and non-Hermitian, and we have to work with a complex array of weights. We don't specially handle that for now.

Also, there is a (maybe subtle) change: When dtype is not specified and cannot be inferred from weights, previously it defaulted to x32 because of this line, now it defaults to x64 when x64 is enabled.

Continuous operators

They don't compute matrix elements, and we never explicitly cast the output dtype, so I think their dtype should behave like param_dtype in Flax.

For PotentialEnergy and KineticEnergy, the dtype is only used to cast coefficient and mass, so nothing is changed.

For SumOperator, there is a subtle change: Previously we only inferred the dtype from the operators, now we also infer it from the coefficients. Note that the dtype is only the dtype of coefficients, and the dtypes of operators are unaffected. The reason is like how we define Flax modules: If we write

small_module_1 = SmallModule(param_dtype=dtype1)
small_module_2 = SmallModule(param_dtype=dtype2)
big_module = BigModule(small_module_1, small_module_2, param_dtype=dtype3)

then we usually expect that BigModule will not change the dtypes of parameters in small_module_1 and small_module_2, and only the parameters newly defined in BigModule have dtype3. On the contrary, if we write

big_module = BigModule(module_type="SmallModule", param_dtype=dtype3)

then we expect that it constructs some SmallModule using dtype3.

codecov · 2024-01-19T18:40:25Z

Codecov Report

Attention: 2 lines in your changes are missing coverage. Please review.

Comparison is base (bb86b02) 82.53% compared to head (e86e625) 82.75%.
Report is 3 commits behind head on master.

❗ Current head e86e625 differs from pull request most recent head 124d2a2. Consider uploading reports for the commit 124d2a2 to get more accurate results

Files	Patch %	Lines
netket/utils/numbers.py	60.00%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1697      +/-   ##
==========================================
+ Coverage   82.53%   82.75%   +0.21%     
==========================================
  Files         298      298              
  Lines       18304    18224      -80     
  Branches     2763     3504     +741     
==========================================
- Hits        15107    15081      -26     
+ Misses       2512     2468      -44     
+ Partials      685      675      -10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

PhilipVinc

Thank you @wdphy16 I think this is very good, and was long needed.

My main thoughts are the following:

I don't want netket to throw tons of warnings if using numba operators with x64 disabled.
I'm not sure it makes sense to talk about weak types for the dtype of numba operators, as those will be arrays and jax will not tag them as weak (I think).

Therefore, I think that we should somehow default to having single-precision operators (where relevant) if x64 is disabled, and double-precision operators if x64 is enabled.

What do you think?

netket/operator/_continuous_operator.py

netket/operator/_local_operator/convert.py

netket/operator/_local_operator/helpers.py

test/operator/test_continuous_operator.py

…ays promote from float when dtype is None

PhilipVinc

Thank you @wdphy16 , I think this is good to be merged.

As tests pass, please feel free to add one line to the CHANGELOG and then merge

netket/operator/_ising/base.py

PhilipVinc · 2024-01-22T23:12:06Z

@wdphy16 Ah sorry, before farming, I forgot, can you double check that operators in experimental (fermions) are addressed as well ?

PhilipVinc · 2024-01-22T23:31:35Z

@wdphy16 I added the changelog and commit a3d1c54 should fix fermions as well. Can you confirm?

netket/jax/_utils_dtype.py

PhilipVinc

that's good for me. Can I merge?

wdphy16 added 6 commits January 19, 2024 16:31

Clean up continuous operators

dec2634

Clean up LocalLiouvillian

e8adbba

Add dtype to Heisenberg

8caab90

Minor fix

c28055d

Clean up PauliStrings

126832e

Ruff

1c4a325

wdphy16 added 5 commits January 19, 2024 22:47

Defaults to float

8749611

Test PauliStrings inplace operations

efcbf47

Add annotation for sign_rule

3062eae

Reduce Pauli strings in PauliStrings.__init__

217ddc5

Add annotation for sign_rule

cc57452

PhilipVinc reviewed Jan 20, 2024

View reviewed changes

wdphy16 added 7 commits January 21, 2024 12:18

Always use promote_types from jnp

e3c0535

Always use jax.dtypes.canonicalize_dtype for JAX-aware operators; alw…

1956dca

…ays promote from float when dtype is None

Convert complex to real in pack_internals

8e53323

Add test_enforce_float_Ising

263bfac

Use np.isreal

87aea1a

Use jax.dtypes.canonicalize_dtype for non-JAX-aware operators

b87d6d2

Test energy dtype in test_continuous_operator.py

e21016c

PhilipVinc approved these changes Jan 22, 2024

View reviewed changes

netket/operator/_ising/base.py Outdated Show resolved Hide resolved

PhilipVinc added 4 commits January 23, 2024 00:26

minor reformats/docstring fixes

afc2b2f

fix dtypes for fermion operators

a3d1c54

add changelog

f602a5f

black .

9e81982

wdphy16 added 2 commits January 23, 2024 12:21

Add nk.jax.canonicalize_dtypes

d8b63be

Fix typo

e86e625

PhilipVinc reviewed Jan 24, 2024

View reviewed changes

netket/jax/_utils_dtype.py Outdated Show resolved Hide resolved

fix docstring

124d2a2

PhilipVinc approved these changes Jan 24, 2024

View reviewed changes

PhilipVinc merged commit b2d0f88 into netket:master Jan 24, 2024
9 checks passed

wdphy16 deleted the operator_dtype branch January 24, 2024 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formalize operator dtypes #1697

Formalize operator dtypes #1697

wdphy16 commented Jan 19, 2024 •

edited

codecov bot commented Jan 19, 2024 •

edited

PhilipVinc left a comment

PhilipVinc left a comment

PhilipVinc commented Jan 22, 2024

PhilipVinc commented Jan 22, 2024

PhilipVinc left a comment

Formalize operator dtypes #1697

Formalize operator dtypes #1697

Conversation

wdphy16 commented Jan 19, 2024 • edited

Discrete operators

Pauli strings

Continuous operators

codecov bot commented Jan 19, 2024 • edited

Codecov Report

PhilipVinc left a comment

Choose a reason for hiding this comment

PhilipVinc left a comment

Choose a reason for hiding this comment

PhilipVinc commented Jan 22, 2024

PhilipVinc commented Jan 22, 2024

PhilipVinc left a comment

Choose a reason for hiding this comment

wdphy16 commented Jan 19, 2024 •

edited

codecov bot commented Jan 19, 2024 •

edited