-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implementation of the sites with opportunistic resources in WMAgent.secrets #11966
Conversation
Jenkins results:
|
Jenkins results:
|
@todor-ivanov @amaltaro ready to be revised |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
deploy/WMAgent.production
Outdated
RESOURCE_OPP6=([name]=T3_US_Anvil [run]=1000 [pend]=250 [state]=normal) | ||
RESOURCE_OPP7=([name]=T3_US_Lancium [run]=5000 [pend]=1250 [state]=normal) | ||
RESOURCE_OPP8=([name]=T3_ES_PIC_BSC [run]=5000 [pend]=3700 [state]=normal) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While I do like to see all the opportunistic sites here, but since these two files (and the one below) are actually considred empty templates, I am not sure if we should put specific numbers. Maybe we can add those lines and let them sit in these two secrets templates commented out. This way we will know what is expected to be done in order to configure an opportunistic resource at the agent, but we would not hard code default values which we do not know if they will hold in the future.
@amaltaro looking forward to receiving your feedback |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@anpicci apologies for missing this review.
I do like the flexibility given by this PR, even though it comes with extra complexity that perhaps isn't really needed.
I see this PR deviates from the current number of slots defined for those opportunistic sites. Was it intentional? Did you consult Dirk and/or P&R to have adapted these numbers?
Lastly, please do mention the CMSKubernetes PR as a dependency for this PR (in the initial PR description). Thanks
@amaltaro thank you for your feedback. As extra complexity, are you referring to how the information about opportunistic resources are stored? Regarding the number of slots, I derived them with: |
Yes, the secrets file no longer look straight forward to me, making it easier to mistype things. If I were to implement it, I think I would add 3 new variables: and leave "coding" to outside the secrets file. However, given that these changes have already been implemented and tested, I don't think my argument is strong enough for a refactoring, so let's just leave it as is, if you agree. For the slots, I agree with you, let's set them according to the current deployment script (also mentioned above within this comment) and in a later stage we can revisit these. It has been working fine with no requests to update them, no need to worry with those right now. |
@amaltaro sounds good. The idea was to make the definition of these env variables as flexible as possible in bash, but I see your points. If you and @todor-ivanov it works, we can merge the PR. I will address the comments on CMSKubernetes tomorrow morning |
hi @amaltaro
this:
Is not coding. This is a standard way to define an associative array in bash. :
In addition. If we were to take this approach:
This means, we would have had only a list of opportunistic resource names and two hard values to be applied to all of them.... regardless of the fact that different opportunistic resources may have different capacity, state, etc. @anpicci I think we can merge this one here and continue in CMSKubernetes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good to me
I feel like there was still discussion on going in this PR and I don't understand why we rushed with merging these changes in. In addition, it would have been better to squash these commits before merging them in (reason why I labelled this yesterday). I am happy to chat about it today, if you want. @anpicci |
Fixes #11722
Status
Ready
Description
With this PR, HPC sites are directly reported in the WMAgent.secrets template, both for production and testbed, in order to remove the hardcoded references in the script initializing the Agent in the containerized deployment model.
For each HPC site, an associative bash array is defined to collect information about the name, number of total available running slots, and number of total available pending slots in a human-readable format.
This setting is made in the sight of fetching this information with a proper manipulation leveraging bash in this CMSKubernetes script. The modification to this script is reported in this PR
Is it backward compatible (if not, which system it affects?)
NO
Related PRs
None
External dependencies / deployment changes
Addition of the opportunistic resources in the initialization script of the Docker container: dmwm/CMSKubernetes#1462