[PROD-1648] Bugfix describe volumes #97

aditya-jaishankar · 2024-02-14T20:19:21Z

Summary

There is a bug currently in the library where a call to the ec2.describe_volumes only accepts at most 199 items in the Filters argument. However, in the case of very large clusters, there can be more than 199 instance ids passed in, which causes the call to fail.

This PR splits the number of filters to chunks of size no more than 199.

Testing Performed

All unit tests pass
Dummy projects were kicked off on databricks and the resulting cluster was inspected with the functions calls in both the cases of small number of workers as well as number of workers > 200 and it was verified that the functions calls return the cluster volume data as expected.

Checklist

Before formally opening this PR, please adhere to the following standards:

Branch/PR names begin with the related Jira ticket id (ie PROD-31) for Jira integration
File names are lower_snake_case
Relevant unit tests have been added or not applicable
Relevant documentation has been added or not applicable
Mark yourself as the assignee (makes it easier to scan the PR list)

Related Jira Ticket

romainissynced · 2024-02-15T16:08:31Z

sync/awsdatabricks.py

 ) -> List[dict]:
    """Get all ebs volumes associated with a list of instance reservations"""

+    def get_chunk(instance_ids: list, chunk_size: int) -> Iterator[list]:


Isn't the return type technically a -> Generator[list, None, None]?

Might want to also type the args like so:

Suggested change

def get_chunk(instance_ids: list, chunk_size: int) -> Iterator[list]:

def get_chunk(instance_ids: List[str], chunk_size: int) -> Generator[List[str], None, None]:

Mentioning this because it can help the IDE auto-complete.

For example if you were to [item.count('a') for item in self.get_chunk(***) pycharm will autocomplete the .count() because it knows its a list of strings

Aah yes, thanks good catch.

romainissynced

Looks good, just left a nitpick on typing that doesn't need to be fixed before merge.

gorskysd · 2024-02-16T13:21:20Z

sync/__init__.py

@@ -1,4 +1,4 @@
 """Library for leveraging the power of Sync"""
-__version__ = "1.0.0"


Need to update this to 1.0.3 now

gorskysd · 2024-02-16T13:36:09Z

sync/awsdatabricks.py

+            while next_token:
+                response = ec2_client.describe_volumes(Filters=filters, NextToken=next_token)
+                volumes += response.get("Volumes", [])
+                next_token = response.get("NextToken")
+


Have you had a chance to test this out on a big cluster yet?

Yes, I tested this out by kicking off a big job with 300 .large instances and then wrote a local scipt to call describe_volumes() on that cluster_id while the cluster was running and it worked fine (also worked fine for a cluster with 10 instances)

gorskysd

LGTM

aditya-jaishankar added 2 commits February 14, 2024 12:42

add chunking to describe volumes call

948befe

bump version number

3744f6b

aditya-jaishankar self-assigned this Feb 14, 2024

aditya-jaishankar requested review from CaymanWilliams, gorskysd and romainissynced February 14, 2024 20:50

romainissynced reviewed Feb 15, 2024

View reviewed changes

romainissynced previously approved these changes Feb 15, 2024

View reviewed changes

gorskysd reviewed Feb 16, 2024

View reviewed changes

aditya-jaishankar added 2 commits February 21, 2024 09:13

update version

ab67a8e

update type hints

25a093a

aditya-jaishankar dismissed romainissynced’s stale review via 25a093a February 21, 2024 14:15

aditya-jaishankar requested a review from romainissynced February 21, 2024 14:18

gorskysd approved these changes Feb 21, 2024

View reviewed changes

aditya-jaishankar merged commit 7a00d68 into main Feb 21, 2024

aditya-jaishankar deleted the bugfix-describe-volumes branch February 21, 2024 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[PROD-1648] Bugfix describe volumes #97

[PROD-1648] Bugfix describe volumes #97

Uh oh!

aditya-jaishankar commented Feb 14, 2024 •

edited

Loading

Uh oh!

romainissynced Feb 15, 2024

Uh oh!

romainissynced Feb 15, 2024

Uh oh!

aditya-jaishankar Feb 21, 2024

Uh oh!

romainissynced left a comment

Uh oh!

gorskysd Feb 16, 2024

Uh oh!

gorskysd Feb 16, 2024

Uh oh!

aditya-jaishankar Feb 21, 2024

Uh oh!

gorskysd left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	def get_chunk(instance_ids: list, chunk_size: int) -> Iterator[list]:
	def get_chunk(instance_ids: List[str], chunk_size: int) -> Generator[List[str], None, None]:

		@@ -1,4 +1,4 @@
		"""Library for leveraging the power of Sync"""
		__version__ = "1.0.0"

[PROD-1648] Bugfix describe volumes #97

[PROD-1648] Bugfix describe volumes #97

Uh oh!

Conversation

aditya-jaishankar commented Feb 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing Performed

Checklist

Uh oh!

romainissynced Feb 15, 2024

Choose a reason for hiding this comment

Uh oh!

romainissynced Feb 15, 2024

Choose a reason for hiding this comment

Uh oh!

aditya-jaishankar Feb 21, 2024

Choose a reason for hiding this comment

Uh oh!

romainissynced left a comment

Choose a reason for hiding this comment

Uh oh!

gorskysd Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

gorskysd Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

aditya-jaishankar Feb 21, 2024

Choose a reason for hiding this comment

Uh oh!

gorskysd left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aditya-jaishankar commented Feb 14, 2024 •

edited

Loading