MAAP-HEC-AWS #40: update ADES-K8s metrics to use generalized schema #6
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR updates the format of the emitted job metrics from the ADES-K8s backend that is collected from the Calrissian docker_usage.json file. The format now conforms to this schema:
https://docs.google.com/document/d/1p0dYy_6NMBQrn5Qq3yXXFIk17Q0jBmlprItmzFlQ41Y/edit#heading=h.qwv42w6gqyda
In order to provide these metrics in this format, the following updates were required:
--pod-labels
option to feed in a YAML file that specifies labels that should be attached to every CWL process pod that is created by the job podget_job
ADES API to query for all pods that have a specific job_id label and to iterate over the returned pods to extract the node specific metricsdisk_space_free_gb
andmemory_gb
will be unknown from the context of pod execution because K8s abstracts away the specifics of the node hardware from the pod memory and disk requirementsThe following snippet is an example JSON payload returned by a
get_job()
call after the job has completed and contains the updated metrics: