Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

May DevOps Infrastructure and Automation (3/3) #1102

Open
Tracked by #1332
elisabettai opened this issue Sep 11, 2023 · 2 comments
Open
Tracked by #1332

May DevOps Infrastructure and Automation (3/3) #1102

elisabettai opened this issue Sep 11, 2023 · 2 comments
Assignees
Labels
y7 NIH SPARC Y7

Comments

@elisabettai
Copy link
Collaborator

elisabettai commented Sep 11, 2023

Description:
Develop DevOps infrastructure empowering users to perform more monitoring and troubleshooting tasks themselves, thus facilitating debugging and reducing the required SIM-Core manpower in the longer run.
Foreseen functionality includes:

  • Provide increased automation for the following routine DevOps tasks:

    • Better organization of docker images in registry (validation of metadata, deletion of faulty images)
    • Simplification of interaction with database to have command line tools for:
      • bulk add/remove/upgrade access rights to services for a user/group
      • bulk remove/add/upgrade services from projects
      • automate resource limits assignment for users based on service and user/group-id (also relates to API extensions & computational backend refactoring discussed above)
    • Information on which users have currently which studies open
    • Development of tools to collect resource usage (compute & storage) information for potential billing (part of sustainability plans) based on
      • services
      • user groups
    • Housekeeping: Detecting stale (old/unused) user accounts and proactively find corrupt projects/services and unused data for clean-up by DevOps
  • Web-Application Monitoring: Automatic collection of metrics upon any unexpected failures and nascent bugs, which will be used to assess the user-experience during our development cycles. This will help to assess if user experience degrades or improves and how much time for maintenance is required flexibly. There is no need to rely only on traditional, mostly slow feedback cycles (such as dissatisfied users’ email feedback)

Deliverable:
Extended DevOps infrastructure: housekeeping, simplification of interaction with database, web-application monitoring

Acceptance criteria:
Infrastructure is in place that provides increased DevOps automation and web-application monitoring

Out of scope: –

Deadline:
Q1

wrike Y7-MS 7.1.3: DevOps Infrastructure and Automation (3/3)

@elisabettai elisabettai added PO issue Created by Product owners y7 NIH SPARC Y7 labels Sep 11, 2023
@elisabettai elisabettai added this to the Enchanted Odyssey milestone Mar 26, 2024
@SCA-ZMT SCA-ZMT mentioned this issue Mar 28, 2024
@SCA-ZMT SCA-ZMT removed the PO issue Created by Product owners label Mar 28, 2024
@SCA-ZMT
Copy link
Contributor

SCA-ZMT commented May 10, 2024

Prepare for review.

@elisabettai
Copy link
Collaborator Author

I am writing this up for the NIH reporting.

I am sure there's a lot we're done for "housekeeping, simplification of interaction with database, web-application monitoring"

We can for sure mention the new dashboards on Graphana to monitor running machines, right @YuryHrytsuk @sanderegg? Do you have more details that we can share?

Something about interaction with the DB? Maybe we can mention scripts or other tools that we used to migrate TIP to AWS? @matusdrobuliak66

Anything else for housekeeping @mrnicegyu11?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
y7 NIH SPARC Y7
Projects
None yet
Development

No branches or pull requests

4 participants