feat: Provide AsyncIO support for generated code #365

lidizheng · 2020-04-09T00:50:16Z

Let's see how the CI react to the change.
The test might not pass until changes in googleapis/python-api-core#22 got released.

software-dov

I know this is WIP, but can you also file an issue to track adding async client support to the ads templates?

software-dov · 2020-04-09T17:34:39Z

gapic/schema/wrappers.py

+    @utils.cached_property
+    def client_output_async(self):
+        """Return the output from the client layer.


It looks like there's a fair bit of duplication between client_output and client_output_async. Would it be worth combining and parameterizing them? It's not obvious to me just by eyeballing how they differ.

Good idea! I will create a function to hide common logic. In next pass, I will try to improve readability using similar pattern.

Thanks. If it turns out to be tricky then no need to twist the logic too much.

I didn't find a way to unify this specific method without regression.

The client_output is cached, so it can't accept parameters;

The output of client_output is frozen, so we can't modify the content in it (we can create a new one but it is worse than what we have now).

Keep client_output and client_output_async as the cached layer, then move the guts of the logic into an internal method that takes a parameter. The internal method should not be cached and should not be a property.

Created the common internal method in 4ecf34d.

lidizheng · 2020-04-14T00:19:15Z

This PR may need a smarter way to reuse existing logic. The newly generated class should not be a burden for incoming important features like #359.

lidizheng · 2020-04-14T20:00:26Z

To improve the maintainability of the microgenerator project, we might want to reduce the duplicated logic. Otherwise, new features will have to apply same changes in multiple places, which is quite error-prone. I want to list the options we have to compare:

Composition: all AsyncIO class contains an instance of existing class;
Helper functions: promote duplicated logic between the existing class and AsyncIO class to external functions;
Inheritance: create a common base class for both existing class and AsyncIO class (13fba42);
Duplication: duplicate the logic (a51bcd3).

Options 1, 2, 3 solve the maintainability issue, but have different drawbacks. I can change to either one, if you see fit. WDYT? @software-dov @busunkim96

For option 1, proxying methods from one class to another requires quite some boilerplates. For example, the existing Client class has not only instance method, but also class variables and class methods. The code looks like class_method_a = _comp_class.method_a, method_b = _comp_instance.method_b. Metaclass pattern makes this option even worse which requires its own composition logic. So, the implementation might look hacky.

For option 2, the issue of helper functions is that the duplicated logic usually contains modifying member variables. Passing self out, and allow external function to update it is less ideal. Also, helper functions doesn't solve the member variable initialization issue. The member variable still needs to be defined explicitly.

For option 3, inheritance increases code complexity, and challenges the scope of the interface abstraction. For example, it is unclear that should MTLS feature apply to all transport or only gRPC transport. It also makes it harder for users to understand the logic, since they need to read multiple class instead of one.

software-dov · 2020-04-15T21:46:44Z

My preferences would be 2, then 1, then 3, then 4. @arithmetic1728 has confirmed that mTLS logic should eventually apply to http transports, and it presumably will also apply to asynchronous clients. I could be persuaded (and please do, if experiments indicate it!) that composition is a less gross hack than helper functions.
With helpers we need to be careful not to wind up duplicating the downsides of inheritance: if a function takes a Client OR an AsyncClient, with explicit type checking and separate logic paths, then that's arguably worse. If we tack on too many accessor methods to facilitate the helper functions, we've just split the implementation between 2 and 3.

I think, in light of the above, I'm leaning more towards composition as a general strategy.

lidizheng · 2020-04-20T17:34:13Z

@software-dov I have experiment with the composition pattern to unify logic between Client and AsyncClient, PTAL at commit 3cc730c.

The composition pattern doesn't work cleanly with Transport classes due to function signature difference. AsyncTransport stubs returns Awaitable[Response](coroutine) while gRPC Transport returns Response type directly. Also, the channel object has different type (grpc.Channel vs. aio.Channel). The options are:

Have a common parent class to make types more generic (see 3cc730c);
Similar to option 1, but instead of using inheritance we can use composition to weave the common class and the two public transport classes;
Loosen the type restriction on Transport classes, so it enables GrpcTransport to composite AsyncTransport class;
Let the logic being copied twice (see e176020).

@software-dov WDYT?

plamut · 2020-04-21T17:41:48Z

@lidizheng If I understand correctly, we want type compatibility between the transport classes grpc.Channel and aio.Channel? Would it help if we cheat by creating an abstract Channel base class and register the two existing Channel classes as its "virtual subclasses"?

lidizheng · 2020-04-21T17:50:44Z

For channels, if we loosen the type check, it is equivalent to option 3. Abstract Channel base class works. Another option is type it with Union[grpc.Channel, aio.Channel]. It stops linter from complaining, but it might be error-prone if user pass-in an AsyncIO channel into a normal gRPC Transport.

software-dov · 2020-04-21T19:14:50Z

Possibly bad idea: is there any way we can rewrite the synchronous Transport in terms of the async and just force explicit awaits?

I'm a little lost reading through the proposed diffs and the existing aio and regular grpc code. If I understand correctly, the main problem is the return type/semantics of the transport stubs.
If we can come up with a reasonable solution for that, it looks like everything else is a workable problem.

lidizheng · 2020-04-21T23:56:03Z

I'm a little lost reading through the proposed diffs and the existing aio and regular grpc code. If I understand correctly, the main problem is the return type/semantics of the transport stubs.
If we can come up with a reasonable solution for that, it looks like everything else is a workable problem.

Yes, the composition pattern works except the types. The complain can be suppressed by typing.cast if you don't think it is hacky.

Possibly bad idea: is there any way we can rewrite the synchronous Transport in terms of the async and just force explicit awaits?

When I'm writing gRPC AsyncIO, I found even though AsyncIO is the official asynchronous library. People still appreciate the ability to use normal synchronous Python.

codecov · 2020-06-04T21:48:04Z

Codecov Report

Merging #365 into master will not change coverage.
The diff coverage is 100.00%.

@@            Coverage Diff            @@
##            master      #365   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           26        26           
  Lines         1453      1466   +13     
  Branches       300       300           
=========================================
+ Hits          1453      1466   +13

Impacted Files	Coverage Δ
gapic/schema/wrappers.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6a1263c...52336f8. Read the comment docs.

software-dov · 2020-06-08T20:50:40Z

gapic/templates/tests/unit/%name_%version/%sub/test_%service.py.j2

@@ -789,6 +785,12 @@ def test_transport_instance():
    client = {{ service.client_name }}(transport=transport)
    assert client._transport is transport

+def test_transport_instance_2():


What is this test verifying? That None credentials default to anonymous?

I wanted to test if transport can initialize with no arguments. Just learnt that to use insecure channel, we need anonymous credential. So, this case is removed.

software-dov · 2020-06-10T20:27:21Z

gapic/templates/%namespace/%name_%version/%sub/services/%service/transports/grpc.py.j2

+        if scopes is None:
+            scopes = cls.AUTH_SCOPES


Personal preference/style nit:
How about

scopes = cls.AUTH_SCOPES if scopes is None else scopes

Updated to scopes = scopes or cls.AUTH_SCOPES, same below.

gapic/templates/%namespace/%name_%version/%sub/services/%service/transports/grpc_asyncio.py.j2

lidizheng · 2020-06-10T21:08:28Z

Original version of MTLS setup in Client.__init__:

        if isinstance(client_options, dict):
            client_options = ClientOptions.from_dict(client_options)
        if client_options is None:
            client_options = ClientOptions.ClientOptions()

        if transport is None and client_options.api_endpoint is None:
            use_mtls_env = os.getenv("GOOGLE_API_USE_MTLS", "never")
            if use_mtls_env == "never":
                client_options.api_endpoint = self.DEFAULT_ENDPOINT
            elif use_mtls_env == "always":
                client_options.api_endpoint = self.DEFAULT_MTLS_ENDPOINT
            elif use_mtls_env == "auto":
                has_client_cert_source = (
                    client_options.client_cert_source is not None
                    or mtls.has_default_client_cert_source()
                )
                client_options.api_endpoint = (
                    self.DEFAULT_MTLS_ENDPOINT if has_client_cert_source else self.DEFAULT_ENDPOINT
                )
            else:
                raise MutualTLSChannelError(
                    "Unsupported GOOGLE_API_USE_MTLS value. Accepted values: Never, Auto, Always"
                )

        # Save or instantiate the transport.
        # Ordinarily, we provide the transport, but allowing a custom transport
        # instance provides an extensibility point for unusual situations.
        if isinstance(transport, {{ service.name }}Transport):
            # transport is a {{ service.name }}Transport instance.
            if credentials:
                raise ValueError('When providing a transport instance, '
                                 'provide its credentials directly.')
            self._transport = transport
        elif isinstance(transport, str):
            Transport = type(self).get_transport_class(transport)
            self._transport = Transport(
                credentials=credentials, host=self.DEFAULT_ENDPOINT
            )
        else:
            self._transport = {{ service.name }}GrpcTransport(
                credentials=credentials,
                host=client_options.api_endpoint,
                api_mtls_endpoint=client_options.api_endpoint,
                client_cert_source=client_options.client_cert_source,
            )

The behavior of MTLS is kind of strange in some cases.

It assumed MTLS only works with transport=None. It behaves differently when transport=None and transport="grpc", even if they are using the same underlying transport.
It uses a unnecessary hard-coded transport class. get_transport_class(None) returns gRPC transport by default.
Even with USE_MTLS="Never", the test case expects the transport being initialized with a none-empty MTLS endpoint. This confused me that what parameter defines the on-and-off for the MTLS feature. But I kept this behavior untouched.

In this PR, the MTLS logic is slightly updated to resolve above cases. Now, the magical difference between transport=None and transport="grpc" is removed. All MTLS behaviors react consistently to the input client_options and GOOGLE_API_USE_MTLS.

busunkim96 · 2020-06-10T21:34:51Z

@arithmetic1728 Could you take a look at the mTLS changes?

arithmetic1728 · 2020-06-10T22:13:37Z

Original version of MTLS setup in Client.__init__:

The behavior of MTLS is kind of strange. It took me several hours to make it straight.

It assumed uses won't pass transport="grpc" when using MTLS. It behaves differently when transport=None and transport="grpc", even if they are using the same underlying transport.

Yes, the behavior is supposed to be different when transport is None and transport is given. If user provides transport, mTLS related logic won't be triggered, and client_options won't be used.

It uses a unnecessary hard-coded transport class. get_transport_class(None) returns gRPC transport by default.

I think the library assumes user could provide their own implementation of Transport class, so transport is not necessarily grpc, it could some user defined transport class name.

Even with USE_MTLS="Never", the test case expects the transport being initialized with a none-empty MTLS endpoint. This really confused me that what parameter defines the on-and-off for the MTLS feature.

Which test is it?

lidizheng · 2020-06-10T22:33:39Z

@arithmetic1728 Thanks for the quick response. The test I mentioned is this.

    os.environ["GOOGLE_API_USE_MTLS"] = "never"
    with mock.patch('{{ (api.naming.module_namespace + (api.naming.versioned_module_name,) + service.meta.address.subpackage)|join(".") }}.services.{{ service.name|snake_case }}.transports.{{ service.name }}GrpcTransport.__init__') as grpc_transport:
        grpc_transport.return_value = None
        client = {{ service.client_name }}()
        grpc_transport.assert_called_once_with(
            api_mtls_endpoint=client.DEFAULT_ENDPOINT,
            client_cert_source=None,
            credentials=None,
            host=client.DEFAULT_ENDPOINT,
        )

This is the test case that expects api_mtls_endpoint under "never".

Yes, the behavior is supposed to be different when transport is None and transport is given.

I can see benefits in both ways with treating them different or in the same way. By differentiating transport=None and transport="grpc", we can prevent mTLS feature from being a standard feature for transport classes. On the other hand, we can delegate the decision whether to implement mTLS to our users.

If we treat both cases in the same way, the transport class will always receive mTLS arguments. Then, users can decide whether to implement mTLS feature in their customized transport. If they want to ignore those arguments, they should be free to do so.

WDYT about this approach?

arithmetic1728 · 2020-06-10T22:38:11Z

The logic for mTLS is as follows:

Transport side: mTLS is controlled by the api_mtls_endpoint argument (None=> no mTLS)

Client side:
(1) if transport is provided, no mTLS logic is used.
(2) if transport is None, mTLS logic is triggered as follows:
a) figure out the client certificate to use.
cert_to_use = client_options.client_cert_source or adc_default_cert or None
b) figure out the api endpoint to use. If client_options.api_endpoint is given, it will use the given one; otherwise, api_endpoint is determined based on GOOGLE_API_USE_MTLS env value and cert_to_use.
Note that even if GOOGLE_API_USE_MTLS=never, we still apply the client cert if it exists (The client cert will be ignored by the server). Since mTLS logic in transport constructor is controlled by the api_mtls_endpoint parameter, in this case, the regular endpoint will be passed to the transport constructor via api_mtls_endpoint parameter to make sure client cert is used if exists.

Please don't change the mTLS logic since all languages implement the same logic. Please let me know if you have any questions regarding mTLS.

arithmetic1728 · 2020-06-10T22:43:19Z

@arithmetic1728 Thanks for the quick response. The test I mentioned is this.

    os.environ["GOOGLE_API_USE_MTLS"] = "never"
    with mock.patch('{{ (api.naming.module_namespace + (api.naming.versioned_module_name,) + service.meta.address.subpackage)|join(".") }}.services.{{ service.name|snake_case }}.transports.{{ service.name }}GrpcTransport.__init__') as grpc_transport:
        grpc_transport.return_value = None
        client = {{ service.client_name }}()
        grpc_transport.assert_called_once_with(
            api_mtls_endpoint=client.DEFAULT_ENDPOINT,
            client_cert_source=None,
            credentials=None,
            host=client.DEFAULT_ENDPOINT,
        )

This is the test case that expects api_mtls_endpoint under "never".

Yes, please see my previous comments.

Let's schedule a meeting to discuss this, it might be easier that way.

lidizheng · 2020-06-10T22:49:48Z

@arithmetic1728 Thanks! See you offline (online).

gapic/templates/%namespace/%name_%version/%sub/services/%service/client.py.j2

gapic/templates/%namespace/%name_%version/%sub/services/%service/async_client.py.j2

arithmetic1728 · 2020-06-11T21:20:40Z

After discussing with Lidi, I think the mTLS change looks good to me. LGTM for the mTLS change.

arithmetic1728

LGTM for the mTLS change.

lidizheng · 2020-06-12T18:47:54Z

@software-dov @busunkim96 PTALAA.

software-dov

Looks good, nothing stands out to me. I'll file an issue for cross patching to alternative templates.
I'm taking it on faith that all the asynchronicity is correct; even after reading multiple blag posts and running local experiments, I still haven't fully internalized it.

tests/system/conftest.py

lidizheng · 2020-06-16T17:53:15Z

@software-dov Thanks for the review! I updated conftest.py to use the generalized factory function. About alternative templates, I wonder if Jinja2's include command helps? It can import text files (including another template file).

busunkim96

LGTM as well. Some questions/nits on docstrings and tests.

I think the google-api-core version need to be bumped in the template setup.py.j2.

gapic/templates/%namespace/%name_%version/%sub/services/%service/transports/grpc_asyncio.py.j2

tests/system/test_grpc_unary.py

tests/system/test_grpc_streams.py

gapic/templates/%namespace/%name_%version/%sub/services/%service/transports/grpc.py.j2

…orts

lidizheng · 2020-06-16T21:41:24Z

@busunkim96 Thanks for reviewing. I'm shamed that I forgot to remove the debug logging. All comments addressed. Do I need to squash all commits?

busunkim96 · 2020-06-16T23:12:58Z

@software-dov What merge style is preferred in this repo?

EDIT:
(Squashed and merged as that's seemed to be what's preferred from the repo commit history 😄 )

googlebot added the cla: yes This human has signed the Contributor License Agreement. label Apr 9, 2020

lidizheng force-pushed the asyncio-support branch from 76fcc19 to 8d0e88c Compare April 9, 2020 00:51

software-dov reviewed Apr 9, 2020

View reviewed changes

lidizheng force-pushed the asyncio-support branch from f1482f1 to 7123809 Compare April 10, 2020 23:40

lidizheng mentioned this pull request Apr 10, 2020

[AsyncIO] Adding async client support to the Ads templates #385

Closed

lidizheng force-pushed the asyncio-support branch from 5329fe7 to 659e0fa Compare April 14, 2020 17:44

lidizheng force-pushed the asyncio-support branch from 741fce3 to 14b1760 Compare April 22, 2020 19:00

lidizheng force-pushed the asyncio-support branch from 887d2b3 to ef2267e Compare June 4, 2020 22:30

software-dov reviewed Jun 10, 2020

View reviewed changes

lidizheng force-pushed the asyncio-support branch from 94c0a99 to bdb628f Compare June 10, 2020 20:55

busunkim96 requested a review from arithmetic1728 June 10, 2020 21:35

lidizheng force-pushed the asyncio-support branch from bdb628f to 236e6a6 Compare June 10, 2020 23:01

arithmetic1728 reviewed Jun 11, 2020

View reviewed changes

gapic/templates/%namespace/%name_%version/%sub/services/%service/client.py.j2 Show resolved Hide resolved

arithmetic1728 reviewed Jun 11, 2020

View reviewed changes

gapic/templates/%namespace/%name_%version/%sub/services/%service/async_client.py.j2 Outdated Show resolved Hide resolved

arithmetic1728 approved these changes Jun 11, 2020

View reviewed changes

lidizheng force-pushed the asyncio-support branch from 94681bf to 0c8940f Compare June 12, 2020 17:51

lidizheng marked this pull request as ready for review June 12, 2020 18:48

lidizheng changed the title ~~WIP: Provide AsyncIO support for generated code~~ feat: Provide AsyncIO support for generated code Jun 12, 2020

software-dov reviewed Jun 15, 2020

View reviewed changes

tests/system/conftest.py Outdated Show resolved Hide resolved

software-dov approved these changes Jun 15, 2020

View reviewed changes

lidizheng force-pushed the asyncio-support branch 2 times, most recently from bc5bde8 to 833b567 Compare June 16, 2020 17:38

busunkim96 approved these changes Jun 16, 2020

View reviewed changes

lidizheng added 4 commits June 16, 2020 14:38

feat: Provide AsyncIO support for generated code

5cecccc

Address comments

e2f973f

Address comments: correct typing, remove debug logging and unused imp…

a35f737

…orts

Bump required api-core version to 1.17.2

52336f8

lidizheng force-pushed the asyncio-support branch from 76906da to 52336f8 Compare June 16, 2020 21:38

busunkim96 mentioned this pull request Jun 16, 2020

feat: add credentials_file and scopes via client_options #461

Merged

5 tasks

busunkim96 merged commit 305ed34 into googleapis:master Jun 17, 2020

This was referenced Jan 18, 2022

chore(master): release reverted #1133

Merged

chore(master): release 0.39.1 #1146

Closed

This was referenced Jan 26, 2022

chore(main): release 0.39.1 #1160

Closed

chore(main): release 0.39.1 #1162

Closed

chore(main): release 0.61.0 #1163

Merged

feat: Provide AsyncIO support for generated code #365

feat: Provide AsyncIO support for generated code #365

Conversation

lidizheng commented Apr 9, 2020

software-dov left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lidizheng commented Apr 14, 2020 • edited

lidizheng commented Apr 14, 2020

software-dov commented Apr 15, 2020

lidizheng commented Apr 20, 2020 • edited

plamut commented Apr 21, 2020 • edited

lidizheng commented Apr 21, 2020

software-dov commented Apr 21, 2020

lidizheng commented Apr 21, 2020

codecov bot commented Jun 4, 2020 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lidizheng commented Jun 10, 2020 • edited

busunkim96 commented Jun 10, 2020

arithmetic1728 commented Jun 10, 2020

lidizheng commented Jun 10, 2020 • edited

arithmetic1728 commented Jun 10, 2020

arithmetic1728 commented Jun 10, 2020

lidizheng commented Jun 10, 2020

arithmetic1728 commented Jun 11, 2020

arithmetic1728 left a comment

Choose a reason for hiding this comment

lidizheng commented Jun 12, 2020

software-dov left a comment

Choose a reason for hiding this comment

lidizheng commented Jun 16, 2020

busunkim96 left a comment

Choose a reason for hiding this comment

lidizheng commented Jun 16, 2020

busunkim96 commented Jun 16, 2020 • edited

lidizheng commented Apr 14, 2020 •

edited

lidizheng commented Apr 20, 2020 •

edited

plamut commented Apr 21, 2020 •

edited

codecov bot commented Jun 4, 2020 •

edited

lidizheng commented Jun 10, 2020 •

edited

lidizheng commented Jun 10, 2020 •

edited

busunkim96 commented Jun 16, 2020 •

edited