Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PoC for HPC job execution monitoring #4

Open
rosogon opened this issue Jun 8, 2020 · 0 comments
Open

PoC for HPC job execution monitoring #4

rosogon opened this issue Jun 8, 2020 · 0 comments
Assignees
Labels
runtime SODALITE Runtime Layer

Comments

@rosogon
Copy link

rosogon commented Jun 8, 2020

@MarioMartReq to set up an exporter to get job status on HPC. You can reuse the ipmi exporter, but for running qstat -f {{ job_id }} | grep 'exit_status' | grep -o '.$'

Then, add an alert (f.e. when job exit is not 0).

@rosogon rosogon added the runtime SODALITE Runtime Layer label Jun 8, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
runtime SODALITE Runtime Layer
Projects
None yet
Development

No branches or pull requests

4 participants