Skip to content

ReFrame gets stuck if job is OOM killed #833

@vkarak

Description

@vkarak

The problem is that sacct now reports the state of the failed job as OUT_OF_MEMORY, which wasn't the case before. It appears that in the past this state was a transient one the test could either fail or succeed. This behaviour seems to have changed recently on Slurm and is exhibited at least on version 18.08.7.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions