Skip to content

Conversation

@teojgo
Copy link
Contributor

@teojgo teojgo commented Nov 26, 2018

  • Use scontrol to retrieve the state of the nodes reported
    as unavailable and cancel the job if they are down.

  • Add methods in_state and is_down to SlurmNode class.

  • Add unit tests for the above methods.

Fixes #303

* Use scontrol to retrieve the state of the nodes reported
  as unavailable and cancel the job if they are down.

* Add methods `in_state` and `is_down` to `SlurmNode` class.

* Add unittests for the above methods.
@teojgo teojgo added this to the Upcoming sprint milestone Nov 26, 2018
@teojgo teojgo self-assigned this Nov 26, 2018
@teojgo teojgo requested a review from vkarak November 26, 2018 15:54
self.in_state(s) for s in ['DOWN', 'DRAIN', 'MAINT', 'NO_RESPOND'])
return bool({'DOWN', 'DRAIN', 'MAINT', 'NO_RESPOND'} & self._states)

def in_state(self, state):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think you need this anymore. If you want to use it, the only place is the is_available() method, which could call this one.

@vkarak vkarak merged commit e870f99 into reframe-hpc:master Dec 3, 2018
@teojgo teojgo deleted the bugfix/check_ReqNodeNotAvail_list branch December 4, 2018 07:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants