Various core and test changes #3835

tybug · 2024-01-10T20:36:19Z

Split from #3818. I could split this further if desired, with all the draw_bits changes in a single PR.

Changes:

move cu.choice to ConjectureData
improve minimal readability with nonlocal
migrate away from draw_bits in various test files in preparation for its removal. Or at least, taking a back seat relative to the ir.

There were some test changes where the extra boolean draw from > 24 bit integers caused the test to fail.

hypothesis/hypothesis-python/src/hypothesis/internal/conjecture/data.py

Lines 1260 to 1262 in 980042e

    
           if bits > 24 and self.draw_boolean( 
        
               7 / 8, forced=None if forced is None else False 
        
           ):

I mostly remedied this by reducing the bit size down to 20 for these tests. It does make me wonder if we're doing the appropriate thing by biasing the distribution in this way at the ir level, rather than at the strategy level (integers()).

…ced_floats

…void multiple draws

tybug · 2024-01-10T22:01:42Z

Looks like test_can_learn_to_normalize_the_unnormalized became flaky due to this (e.g.). Will look into it.

Zac-HD · 2024-01-11T04:21:12Z

I mostly remedied this by reducing the bit size down to 20 for these tests. It does make me wonder if we're doing the appropriate thing by biasing the distribution in this way at the ir level, rather than at the strategy level (integers()).

Hmm, good question! IIRC we used 2**24 as a special value because it ensured that unicode characters were chosen without (additional) bias. I'd probably leave it alone for now, but we should do an audit of this kind of heuristic once we're using the IR everywhere - there are probably a bunch we'll be able to clean up then.

General thoughts on this PR: looks good to me. Two approaches you can take to this kind of pulled-out stage are to (1) defer anything that we can't merge ~immediately to a future PR, or (2) keep working on it here until it passes to make future PRs easier. I generally favor a bit of (2) followed by (1) if I'm feeling stuck or just slowed down; it's basically the same question as pulling out part in the first place 😁

tybug · 2024-01-11T19:11:49Z

that approach makes sense 👍.

In this case I didn't get stuck for too long on test_can_learn_to_normalize_the_unnormalized. It was a rare case where a test is failing because we did too good of a job 😄. dfa.normalize only adds a new normalization if the shrinking differs within 100 calls. With draw_bits, the shrinking differed within that time frame. After using the IR, the shrinking was usually consistent (hence the flakiness). Which means we were doing a better job shrinking to a normalized example with the IR than without!

The solution? Make the property even harder to shrink to a normalized value.

Zac-HD

🎉

Zac-HD · 2024-01-11T04:01:51Z

hypothesis-python/tests/conjecture/test_optimiser.py

-            while data.draw_bits(2) == 3:
+            # TODO this test fails with data.draw_boolean(0.25). Does the hill
+            # climbing optimizer just not like the bit representation of boolean
+            # draws, or do we have a deeper bug here?
+            while data.draw_integer(0, 3) == 3:


I'm trusting you to track this; I'd probably link to this code from a comment on the IR issue after merging.

left a task in the pr description 👍 (less likely for me to lose it than a comment, and github doesn't easily let me leave reviews on unrelated files.)

tybug added 17 commits January 10, 2024 14:59

move cu.choice to ConjectureData

0690201

migrate draw_bits in test_shrinker

eb8035b

migrate draw_bits in test_optimiser

c63449d

remove test duplicated in test_pareto

350b868

migrate draw_bits in test_engine

417c682

migrate draw_bits in test_pareto

2c4ee91

migrate draw_bits in test_test_data

d7c9709

migrate draw_bits in test_shrinking_dfas

c2a6f3b

remove test_draw_write_round_trip. This is better covered by test_for…

ba90e9e

…ced_floats

improve minimal readability with nonlocal

f6b7349

some missed draw_bits migrations in various files

fcb6596

avoid 32 bit integers which draws more data

ecdeadf

increase test_last_block_length buffer to account for >24 bit integers

e494101

revert test changes which relied on changes in the other pr

9a5caf6

reduce test_does_not_keep_generating_when_multiple_bugs bit size to a…

6425301

…void multiple draws

linting

484ad62

fix test_child_indices for additional BIASED_COIN_LABEL

9c53888

tybug requested a review from Zac-HD as a code owner January 10, 2024 20:36

tybug mentioned this pull request Jan 10, 2024

Migrate DataTree to the new IR #3818

Merged

5 tasks

add release note

103020c

tybug added 2 commits January 11, 2024 09:57

mark y as unused

3a77ec0

make non_normalized_test_function even harder to normalize

a2513d4

fix floats canonlicalization test not testing what it should

5a77f14

Zac-HD approved these changes Jan 11, 2024

View reviewed changes

Zac-HD merged commit 00d19ca into HypothesisWorks:master Jan 11, 2024
47 checks passed

tybug deleted the various-core-touchups branch January 11, 2024 22:55

tybug mentioned this pull request Jan 15, 2024

Test touchups in preparation for ir migration #3844

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various core and test changes #3835

Various core and test changes #3835

tybug commented Jan 10, 2024

tybug commented Jan 10, 2024

Zac-HD commented Jan 11, 2024

tybug commented Jan 11, 2024 •

edited

Loading

Zac-HD left a comment

Zac-HD Jan 11, 2024

tybug Jan 11, 2024

	if bits > 24 and self.draw_boolean(
	7 / 8, forced=None if forced is None else False
	):

Various core and test changes #3835

Various core and test changes #3835

Conversation

tybug commented Jan 10, 2024

tybug commented Jan 10, 2024

Zac-HD commented Jan 11, 2024

tybug commented Jan 11, 2024 • edited Loading

Zac-HD left a comment

Choose a reason for hiding this comment

Zac-HD Jan 11, 2024

Choose a reason for hiding this comment

tybug Jan 11, 2024

Choose a reason for hiding this comment

tybug commented Jan 11, 2024 •

edited

Loading