fix: Numerous small performance and correctness issues #211

dpcollins-google · 2021-08-15T13:52:50Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

dpcollins-google · 2021-08-15T13:53:48Z

google/cloud/pubsublite/cloudpubsub/internal/client_multiplexer.py

 _Key = TypeVar("_Key")
 _Client = TypeVar("_Client")


 class ClientMultiplexer(Generic[_Key, _Client]):
-    _OpenedClientFactory = Callable[[], _Client]
+    _OpenedClientFactory = Callable[[_Key], _Client]


There was a lot of "self" time in get_or_create callers from constructing the factory to pass in. This is in the publish hot path, hence this change

dpcollins-google · 2021-08-15T13:54:21Z

google/cloud/pubsublite/cloudpubsub/internal/client_multiplexer.py

    _closer: _ClientCloser
-    _lock: asyncio.Lock
-    _live_clients: Dict[_Key, _Client]
+    _live_clients: Dict[_Key, Awaitable[_Client]]


There was a lot of time in acquiring the lock in the publish hotpath- this change makes the lock not necessary

dpcollins-google · 2021-08-15T13:57:06Z

google/cloud/pubsublite/cloudpubsub/internal/multiplexed_subscriber_client.py

@@ -40,22 +38,16 @@ class MultiplexedSubscriberClient(SubscriberClientInterface):
    _executor: ThreadPoolExecutor
    _underlying_factory: AsyncSubscriberFactory

-    _multiplexer: ClientMultiplexer[SubscriptionPath, StreamingPullFuture]
+    _lock: Lock
+    _live_clients: Set[StreamingPullFuture]


Previously, the subscriber enforced that there was only one open subscription stream per-subscription per-client, but there's actually no need for this.

dpcollins-google · 2021-08-15T13:58:06Z

google/cloud/pubsublite/internal/wire/serial_batcher.py

        return item.response_future

-    def should_flush(self) -> bool:
-        return self._tester.test(item.request for item in self._requests)
+    def size(self) -> BatchSize:


Most of the CPU in the publish path was in should_flush calls, which had to construct an iterable and iterate through every time.

dpcollins-google · 2021-08-15T13:59:57Z

google/cloud/pubsublite/internal/wire/single_partition_publisher.py

+            element_count=1, byte_count=PubSubMessage.pb(request).ByteSize()
+        )
+
+    def _should_flush(self) -> bool:


if you'll note on the left... batching settings were previously ignored

🤖 I have created a release \*beep\* \*boop\* --- ### [1.1.1](https://www.github.com/googleapis/python-pubsublite/compare/v1.1.0...v1.1.1) (2021-09-07) ### Bug Fixes * Add workaround for grpc/grpc#25364 ([#213](https://www.github.com/googleapis/python-pubsublite/issues/213)) ([e417bf3](https://www.github.com/googleapis/python-pubsublite/commit/e417bf39fe32c995e5ac2e0a807a10fee3f37d9f)) * Numerous small performance and correctness issues ([#211](https://www.github.com/googleapis/python-pubsublite/issues/211)) ([358a1d8](https://www.github.com/googleapis/python-pubsublite/commit/358a1d8a429086ee75373260eb087a9dd171e3e6)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

fix: Numerous small performance and correctness issues

9442a97

dpcollins-google requested review from a team as code owners August 15, 2021 13:52

google-cla bot added the cla: yes This human has signed the Contributor License Agreement. label Aug 15, 2021

product-auto-label bot added the api: pubsublite Issues related to the googleapis/python-pubsublite API. label Aug 15, 2021

dpcollins-google commented Aug 15, 2021

View reviewed changes

dpcollins-google requested a review from manuelmenzella-google August 15, 2021 14:00

manuelmenzella-google approved these changes Aug 16, 2021

View reviewed changes

fix: Numerous small performance and correctness issues

499e521

dpcollins-google added the automerge Merge the pull request once unit tests and other checks pass. label Aug 16, 2021

gcf-merge-on-green bot merged commit 358a1d8 into master Aug 16, 2021

gcf-merge-on-green bot deleted the optimize-python branch August 16, 2021 14:16

gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Aug 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Numerous small performance and correctness issues #211

fix: Numerous small performance and correctness issues #211

dpcollins-google commented Aug 15, 2021

dpcollins-google Aug 15, 2021

dpcollins-google Aug 15, 2021

dpcollins-google Aug 15, 2021

dpcollins-google Aug 15, 2021

dpcollins-google Aug 15, 2021

fix: Numerous small performance and correctness issues #211

fix: Numerous small performance and correctness issues #211

Conversation

dpcollins-google commented Aug 15, 2021

dpcollins-google Aug 15, 2021

Choose a reason for hiding this comment

dpcollins-google Aug 15, 2021

Choose a reason for hiding this comment

dpcollins-google Aug 15, 2021

Choose a reason for hiding this comment

dpcollins-google Aug 15, 2021

Choose a reason for hiding this comment

dpcollins-google Aug 15, 2021

Choose a reason for hiding this comment