Order variables by closeness to executing statement in pure_eval #807

alexmojaki · 2020-08-31T21:50:35Z

Part of #805

The idea here is to prioritise values which appear closer to the executing statement. The dict of values is ordered so that trimming removes lower priority values later on.

However this doesn't work in python 3.5 because the ordered dict is actually converted to a regular dict which loses the order:

sentry-python/sentry_sdk/serializer.py

Lines 286 to 288 in 4d91fe0

    
           # Create temporary copy here to avoid calling too much code that 
        
           # might mutate our dictionary while we're still iterating over it. 
        
           obj = dict(iteritems(obj))

It seems that my options are:

Remove the above line and fix the errors that arise.
Keep the dict ordered when making the copy.
Preemptively trim in the pure_eval integration so order no longer matters.

What do you think @untitaker ?

vmarkovtsev · 2020-09-01T06:35:52Z

Note: this will automatically include the function arguments, because, for all the lines except the lowest in the call stack, they lie on an executing statement 👍 I am super excited and eager to try this 🎉

untitaker

tests fail on tox -e py3.5-pure_eval, but this looks good

untitaker · 2020-09-01T07:23:19Z

tests/integrations/pure_eval/test_pure_eval.py

+            "u",
+            "y",
+        ]
+        if sys.version_info[:2] == (3, 5):


Instead of this conditional can't we do assert set(frame_vars.keys()) == set(expected_keys)

I think you misunderstood. frame_vars.keys() is set-like and so assert frame_vars.keys() == set(expected_keys) would work in all versions. assert list(frame_vars.keys()) == expected_keys is a stronger assertion to make extra sure (when possible) that values are prioritised as expected. The tests are fine as they are, the reason for the failure in 3.5 is that the implementation actually doesn't work and the values are actually wrong.

The problem is that I return OrderedDict above so that the trimming in serialize will keep the first 10 items of the dict, but just before the trimming the OrderedDict gets converted to a dict (see the PR description) which breaks the order in 3.5 and thus the implementation (the wrong values get trimmed). I'm asking if I should just trim the values within the pure_eval integration and forget the OrderedDict, or try to preserve order in serialize.

To be clear both options should be equivalent right now, but if serialize changes its algorithm and starts trimming things more cleverly in the future, it will be better if it has the full dict of variables in order of priority rather than a dict that was eagerly/preemptively trimmed by pure_eval.

Ah right, I recommend going for option 3 but avoiding hardcoding the number 10 (instead importing the var). I don't think there's a safe alternative to dict outside of pure_eval context, so I'd like to avoid changes to serializer.

but if serialize changes its algorithm and starts trimming things more cleverly in the future

I doubt this will happen in serialization, let's assume pure_eval always knows better (seems likely given the amount of extra analysis it does)

untitaker · 2020-09-02T08:45:16Z

Excellent, thank you!

Order variables by closeness to executing statement in pure_eval

64da5c0

untitaker approved these changes Sep 1, 2020

View reviewed changes

Forget order, just trim the values in pure_eval

e004c51

untitaker merged commit 6541785 into getsentry:master Sep 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Order variables by closeness to executing statement in pure_eval #807

Order variables by closeness to executing statement in pure_eval #807

alexmojaki commented Aug 31, 2020

vmarkovtsev commented Sep 1, 2020

untitaker left a comment

untitaker Sep 1, 2020

alexmojaki Sep 1, 2020

alexmojaki Sep 1, 2020

untitaker Sep 1, 2020

untitaker Sep 1, 2020

alexmojaki Sep 1, 2020

untitaker commented Sep 2, 2020

	# Create temporary copy here to avoid calling too much code that
	# might mutate our dictionary while we're still iterating over it.
	obj = dict(iteritems(obj))

Order variables by closeness to executing statement in pure_eval #807

Order variables by closeness to executing statement in pure_eval #807

Conversation

alexmojaki commented Aug 31, 2020

vmarkovtsev commented Sep 1, 2020

untitaker left a comment

Choose a reason for hiding this comment

untitaker Sep 1, 2020

Choose a reason for hiding this comment

alexmojaki Sep 1, 2020

Choose a reason for hiding this comment

alexmojaki Sep 1, 2020

Choose a reason for hiding this comment

untitaker Sep 1, 2020

Choose a reason for hiding this comment

untitaker Sep 1, 2020

Choose a reason for hiding this comment

alexmojaki Sep 1, 2020

Choose a reason for hiding this comment

untitaker commented Sep 2, 2020