Speed up API generation #461

omus · 2021-09-27T16:11:25Z

In working on some other PRs it was necessary for me to regenerate the service definitions locally. I noticed the process was kind of slow so I looked into some minor improvements and this PR was the outcome.

I started off with adding threading support to the high-level and low-level loops. Here are the resulting timings:

No threading: 1m 45s
Threading (1): 1m 38s
Threading (2): 60s
Threading (4): 37s

I then noticed we were downloading the service definition files twice. To fix this I introduced the ServiceFile type which handles fetching of the service definition JSON files and caches the results. This means we only download the service definition files for the low-level API generation. Additionally, this change reduces how many GitHub API call and avoids rate limiting over multiple calls.

Finally, I added in some logic to remove the high-level service definitions before building the new ones. This was done to handle the scenario which AWS actually removes a service definition file.

mattBrzezinski

LGTM! Just the one comment worth considering.

mattBrzezinski · 2021-09-28T15:16:31Z

src/api_generation/high_level.jl


-        service_blob = blob(repo_name, service["sha"]; auth=auth)
-        service = JSON.parse(String(base64decode(service_blob.content)))
+    # Remove old service files to ensure services that no longer exist are removed.


I'm down for this change. Couple things maybe worth commenting, has AWS ever removed an existing service before? Also maybe need consideration with how to handle this on the Julia side.

AWS' semver most likely wouldn't mark this as breaking given what they have done historically (an assumption). But I'm not sure we would want to follow that in our Julia packages.

It is doubtful this would happen on the AWS side. Where this does occur with AWS.jl is if we rename the service files.

Maybe I should update this comment?

Forgot to respond here... They existing files would not be removed. If we wanted to rename files we would need to manual empty the directory and regenerate them.

We can bring this change along, seems to make sense!

mattBrzezinski · 2021-10-20T16:18:22Z

bors r+

461: Speed up API generation r=mattBrzezinski a=omus In working on some other PRs it was necessary for me to regenerate the service definitions locally. I noticed the process was kind of slow so I looked into some minor improvements and this PR was the outcome. I started off with adding threading support to the high-level and low-level loops. Here are the resulting timings: No threading: 1m 45s Threading (1): 1m 38s Threading (2): 60s Threading (4): 37s I then noticed we were downloading the service definition files twice. To fix this I introduced the `ServiceFile` type which handles fetching of the service definition JSON files and caches the results. This means we only download the service definition files for the low-level API generation. Additionally, this change reduces how many GitHub API call and avoids rate limiting over multiple calls. Finally, I added in some logic to remove the high-level service definitions before building the new ones. This was done to handle the scenario which AWS actually removes a service definition file. Co-authored-by: Curtis Vogt <curtis.vogt@gmail.com> Co-authored-by: mattBrzezinski <matt.brzezinski@invenia.ca>

test/AWSMetadataUtilities.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

bors · 2021-10-20T16:20:49Z

Canceled.

mattBrzezinski · 2021-10-20T16:22:33Z

bors r+

bors · 2021-10-20T16:29:09Z

Build succeeded:

omus added 4 commits September 27, 2021 09:42

Use threads when generating API wrappers

6f0b7b8

Remove high-level services files

8907fa3

Use ServiceFile struct to download definition once

e90475d

Support other GitHub authorization types

66d81e1

omus requested a review from mattBrzezinski September 27, 2021 16:11

Merge branch 'master' into cv/speedup-api-generation

e7c7d9b

mattBrzezinski approved these changes Sep 28, 2021

View reviewed changes

Merge branch 'master' into cv/speedup-api-generation

b11be7e

github-actions bot reviewed Oct 20, 2021

View reviewed changes

test/AWSMetadataUtilities.jl Outdated Show resolved Hide resolved

Update test/AWSMetadataUtilities.jl

c356be8

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

bors bot merged commit 059aae3 into master Oct 20, 2021

bors bot deleted the cv/speedup-api-generation branch October 20, 2021 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up API generation #461

Speed up API generation #461

omus commented Sep 27, 2021

mattBrzezinski left a comment

mattBrzezinski Sep 28, 2021

omus Sep 29, 2021

mattBrzezinski Oct 20, 2021

mattBrzezinski commented Oct 20, 2021

bors bot commented Oct 20, 2021

mattBrzezinski commented Oct 20, 2021

bors bot commented Oct 20, 2021

Speed up API generation #461

Speed up API generation #461

Conversation

omus commented Sep 27, 2021

mattBrzezinski left a comment

Choose a reason for hiding this comment

mattBrzezinski Sep 28, 2021

Choose a reason for hiding this comment

omus Sep 29, 2021

Choose a reason for hiding this comment

mattBrzezinski Oct 20, 2021

Choose a reason for hiding this comment

mattBrzezinski commented Oct 20, 2021

bors bot commented Oct 20, 2021

mattBrzezinski commented Oct 20, 2021

bors bot commented Oct 20, 2021