[LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information #813

supertom · 2016-06-15T15:53:06Z

[GCE] Improve performance of list nodes by caching volume information

Description

When listing nodes, the GCE driver currently calls the disk API for each disk attached to the node. This PR changes that behavior by using the aggregatedLIst call for disks (once per list_node request) and using that information to provide disk details.

See sample performance info in LIBCLOUD-826

Implementation:

For disk information, aggregated calls are now always used and the disk information is stored in a dictionary, called 'volume_dict'. If the user would like the most current information, they may set the use_cache keyword to false and the call (and subsequent population of volume_dict) will be made prior to returning disk information.

Code was added/changed in two classes. In GCENodeDriver, added two methods and an additional parameter to build, lookup and toggle the refresh of the volume cache. In GCEConnection, added convenience and helper methods to the class, which not only support this performance improvement but also support the longer term vision of leveraging aggregatedList calls elsewhere.

GCEConnection

(new method) def request_aggregated_items(self, api_name) - make all necessary calls, handling maxresults and saving the 'items' portion of the response.
(new method) def _merge_response_items(self, list_name, response_list) - helper method to merge responses into a single dictionary

GCENodeDriver

(new member) volume_dict - dictionary organized by name, zone. Name is always available to us, but is not unique across zones. Zones, though, are optionally supplied. By organizing by name, we remove the need to search through the entire list of disks each time and can do a single hash lookup to have access to all disks by that name. If we have the zone, another hash lookup, if not, we take the first key alphabetically.
(new method) _build_name_zone_dict(self, zone_dict) - internal method to populate volume dict
(new method) _ex_populate_volume_dict(self) - Void method to call API and build volume dictionary
(new method) _ex_lookup_volume(self, name, zone=None) - implements the actual lookup. If zone is not provided, take the disk with that name from the (alphabetical) first zone (this is only an issue if there are more than two disks with the same name).
(new parameter) list_nodes(self, ex_zone=None, use_disk_cache=True) - use_disk_cache parameter for list_nodes to pass through, defaults to True. If set to true, no more than one call per 500 disks would be made to populate all disk info for nodes
(new parameter, revision) ex_get_volume(self, name, zone=None, use_cache=False) - revised to check if volume_dict has been populated or should be used, followed by returning the call to _ex_lookup_volume

Status

done, ready for review

Checklist (tick everything that applies)

Code linting (required, can be done after the PR checks)
Documentation
Tests
ICLA (required for bigger changes)

/cc @erjohnso /cc @tonybaloney

erjohnso · 2016-06-23T14:57:38Z

libcloud/compute/drivers/gce.py

+        :rtype:   :class:`StorageVolume` or raise ``ResourceNotFoundError``.
+        """
+        if volume_name not in self.volume_dict:
+            raise ResourceNotFoundError(


Should this instead make an API call in case the cache is missing a recently created disk?

sayap · 2016-09-21T19:56:46Z

libcloud/compute/drivers/gce.py

@@ -131,6 +132,80 @@ def request(self, *args, **kwargs):

        return response

+    def request_aggregated_items(self, api_name):


I think this always returns only up to 500 items, as the if self.gce_params: condition few lines above will always evaluate to false.

Thanks @sayap, fixed.

We leverage the aggregated disk call and store the result. For the list node operation, we've added an extra parameter to use the cached data, which results to true. Tests and fixtures updated as well.

supertom changed the title ~~[GCE] LIBCLOUD-826: Improve performance of list nodes by caching volume information~~ [LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information Jun 16, 2016

erjohnso reviewed Jun 23, 2016
View reviewed changes

sayap reviewed Sep 21, 2016

View reviewed changes

supertom force-pushed the LIBCLOUD-826 branch from 35c5381 to ce7b9f5 Compare December 16, 2016 18:30

supertom force-pushed the LIBCLOUD-826 branch from ce7b9f5 to d98624d Compare January 6, 2017 04:34

GCE list nodes performance improvement. Resolves LIBCLOUD-826.

d5efc32

We leverage the aggregated disk call and store the result. For the list node operation, we've added an extra parameter to use the cached data, which results to true. Tests and fixtures updated as well.

supertom force-pushed the LIBCLOUD-826 branch from d98624d to d5efc32 Compare January 6, 2017 05:12

asfgit closed this in 03575ec Jan 6, 2017

asfgit pushed a commit that referenced this pull request Jan 13, 2017

move #813 to current version as it was merged after the 1.5.0 tag

5be8b04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information #813

[LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information #813

supertom commented Jun 15, 2016 •

edited

Loading

erjohnso Jun 23, 2016

sayap Sep 21, 2016

supertom Jan 6, 2017

		@@ -131,6 +132,80 @@ def request(self, args, *kwargs):

		return response

		def request_aggregated_items(self, api_name):

[LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information #813

[LIBCLOUD-826] [GCE]: Improve performance of list nodes by caching volume information #813

Conversation

supertom commented Jun 15, 2016 • edited Loading

[GCE] Improve performance of list nodes by caching volume information

Description

Implementation:

GCEConnection

GCENodeDriver

Status

Checklist (tick everything that applies)

erjohnso Jun 23, 2016

Choose a reason for hiding this comment

sayap Sep 21, 2016

Choose a reason for hiding this comment

supertom Jan 6, 2017

Choose a reason for hiding this comment

supertom commented Jun 15, 2016 •

edited

Loading