You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, pooling will check if a cluster is "big enough" in terms of memory, CPU and a few other aspects (e.g. EBS volume size).
We can instead compare memory and CPU needs by SSHing to the cluster's YARN resource manager and querying its metrics API for availableMB and availableVirtualCores.
Not only would this provide more useful information about a cluster that can run multiple jobs simultaneously, it would also allow us to skip querying the cluster's instances ListInstanceGroups/ListInstanceFleets, saving an API call.
The text was updated successfully, but these errors were encountered:
Probably should call these options min_available_mb and min_available_virtual_cores. If either is set, we can bypass checking the cluster's instance information.
Currently, pooling will check if a cluster is "big enough" in terms of memory, CPU and a few other aspects (e.g. EBS volume size).
We can instead compare memory and CPU needs by SSHing to the cluster's YARN resource manager and querying its metrics API for
availableMB
andavailableVirtualCores
.Not only would this provide more useful information about a cluster that can run multiple jobs simultaneously, it would also allow us to skip querying the cluster's instances
ListInstanceGroups
/ListInstanceFleets
, saving an API call.The text was updated successfully, but these errors were encountered: