Optimize hook calling a bit #280

bluetech · 2020-06-26T16:04:59Z

~~Note: includes PR #279, please ignore the duplicate commits (will rebase once that one is settled).~~

This PR start to optimize the hook calling path, mostly for the benefit of pytest.

For this pytest file which we use as a useful benchmark of pytest overhead,

import pytest
@pytest.mark.parametrize("x", range(5000))
def test_foo(x): pass

Before: 10752102 function calls (10196645 primitive calls) in 9.270 seconds
After: 10351436 function calls (9896147 primitive calls) in 8.918 seconds

The main change stems from looking at the stack trace of a hook call. Before this PR it was this (pytest often has several of these nested):

File "/pytest/src/_pytest/python.py", line 1561, in runtest
  self.ihook.pytest_pyfunc_call(pyfuncitem=self)
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/hooks.py", line 286, in __call__
  return self._hookexec(self, self.get_hookimpls(), kwargs)
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/manager.py", line 93, in _hookexec
  return self._inner_hookexec(hook, methods, kwargs)
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/manager.py", line 84, in <lambda>
  self._inner_hookexec = lambda hook, methods, kwargs: hook.multicall(
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/callers.py", line 208, in _multicall
  return outcome.get_result()
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/callers.py", line 80, in get_result
  raise ex[1].with_traceback(ex[2])
File "/pytest/.tox/venv/lib/python3.8/site-packages/pluggy/callers.py", line 187, in _multicall
  res = hook_impl.function(*args)
File "/pytest/src/_pytest/python.py", line 177, in pytest_pyfunc_call
  result = testfunction(**testargs)

This PR removes the <lambda> frame, and follow up PRs will (try) to remove the _hookexec frame and the duplicate _multicall frame (which is cosmetic).

bluetech · 2020-06-26T16:05:49Z

I forgot to mention, it is intended to be reviewed commit-by-commit.

goodboy · 2020-06-26T21:25:59Z

@bluetech oh nice!

I'll try to look at this over the weekend 👍

A dict keys view supports set-like operations.

This check is quite expensive, try to reduce its overhead.

PluginManager adds an adapter lambda to the hook call path. This adds overhead and makes the stack trace more messy. Change the call convention such that the adaptation is not needed, and remove the lambda.

bluetech · 2020-06-29T12:50:54Z

Rebased, doesn't depend on other PRs now.

I also remoeved one of the micro-optimization commits, on second thought it's probably not worth it and is distracting.

goodboy · 2020-06-30T13:44:54Z

src/pluggy/hooks.py

-        return self._hookexec(self, self.get_hookimpls(), kwargs)
+        # This is written to avoid expensive operations when not needed.
+        if self.spec:
+            for argname in self.spec.argnames:


Is looping through the args everytime hoping for none missing from the call faster then just always checking what's missing using the set difference?

Seems like in average case this will be slower (assuming calls are written correctly most of the time)?
I'm not actually sure I can think of a case where this is faster?
I don't think a for loop will ever be faster then a set difference but I could be wrong?

Here is an unscientific benchmark (Python 3.8, Archlinux), also added a variant which uses issubset. Only checks the happy path.

def old(argnames, kwargs): if argnames: notincall = set(argnames) - kwargs.keys() if notincall: pass def old_subset(argnames, kwargs): if argnames: if not set(argnames).issubset(kwargs.keys()): pass def new(argnames, kwargs): for argname in argnames: if argname not in kwargs: break import timeit kwargs = {'a': 0, 'b': 1, 'c': 2, 'd': 3, 'e': 4} argnames = list(kwargs) print("old: ", timeit.timeit("old(argnames, kwargs)", "from __main__ import old, argnames, kwargs")) print("old_subset:", timeit.timeit("old_subset(argnames, kwargs)", "from __main__ import old_subset, argnames, kwargs")) print("new: ", timeit.timeit("new(argnames, kwargs)", "from __main__ import new, argnames, kwargs"))

Output:

old: 0.7920419139554724 old_subset: 0.6716860989108682 new: 0.2587350399699062

(Python 3.8, Archlinux)

You're of my kind 😸

new: 0.2587350399699062

So slick; I guess for the win 🏄‍♂️

goodboy · 2020-06-30T13:46:06Z

src/pluggy/hooks.py

+                    )
+                    break
+
+            firstresult = self.spec.opts.get("firstresult")


This is exactly what I had in mind :)

goodboy

Nice job @bluetech.

Really like the touches to the benchmark tests as well 👍

Really superb work.

bluetech mentioned this pull request Jun 26, 2020

Move tracing "intercept" methods to _HookCaller? #262

Open

bluetech force-pushed the optimize-call2 branch 2 times, most recently from 2d5426b to c7b3ba9 Compare June 29, 2020 12:47

bluetech added 4 commits June 29, 2020 15:49

Remove redundant set() on dict keys view

98f1b58

A dict keys view supports set-like operations.

Optimize argument check in _HookCaller.__call__

bc79719

This check is quite expensive, try to reduce its overhead.

Don't include setup overhead in benchmark measurement

d22672d

Avoid lambda in hookcall path

0aa2462

PluginManager adds an adapter lambda to the hook call path. This adds overhead and makes the stack trace more messy. Change the call convention such that the adaptation is not needed, and remove the lambda.

bluetech force-pushed the optimize-call2 branch from c7b3ba9 to 0aa2462 Compare June 29, 2020 12:49

goodboy reviewed Jun 30, 2020

View reviewed changes

src/pluggy/hooks.py

)

break

firstresult = self.spec.opts.get("firstresult")

Copy link

Contributor

goodboy Jun 30, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is exactly what I had in mind :)

goodboy approved these changes Jun 30, 2020

View reviewed changes

goodboy requested review from nicoddemus and RonnyPfannschmidt June 30, 2020 15:16

RonnyPfannschmidt approved these changes Jul 1, 2020

View reviewed changes

goodboy merged commit 0a064fe into pytest-dev:master Jul 2, 2020

bluetech deleted the optimize-call2 branch September 29, 2020 06:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize hook calling a bit #280

Optimize hook calling a bit #280

bluetech commented Jun 26, 2020 •

edited

bluetech commented Jun 26, 2020

goodboy commented Jun 26, 2020

bluetech commented Jun 29, 2020

goodboy Jun 30, 2020

bluetech Jun 30, 2020

goodboy Jun 30, 2020

goodboy Jun 30, 2020

goodboy left a comment

Optimize hook calling a bit #280

Optimize hook calling a bit #280

Conversation

bluetech commented Jun 26, 2020 • edited

bluetech commented Jun 26, 2020

goodboy commented Jun 26, 2020

bluetech commented Jun 29, 2020

goodboy Jun 30, 2020

Choose a reason for hiding this comment

bluetech Jun 30, 2020

Choose a reason for hiding this comment

goodboy Jun 30, 2020

Choose a reason for hiding this comment

goodboy Jun 30, 2020

Choose a reason for hiding this comment

goodboy left a comment

Choose a reason for hiding this comment

bluetech commented Jun 26, 2020 •

edited