Skip to content

Nexus: Updated PBS job states for Polaris#4987

Merged
ye-luo merged 2 commits into
QMCPACK:developfrom
kayahans:develop
May 14, 2024
Merged

Nexus: Updated PBS job states for Polaris#4987
ye-luo merged 2 commits into
QMCPACK:developfrom
kayahans:develop

Conversation

@kayahans

@kayahans kayahans commented May 14, 2024

Copy link
Copy Markdown
Contributor

Proposed changes

This PR updates the PBS scheduler available job states. Most important change from the older version PBS is the change of completed to finished. May 2020 version is installed in Polaris and it uses the new job state definitions. Without this update, it is not possible to use Nexus to submit jobs in Polaris because the state finished (F) is not defined. For compatibility in other computers still using the older version PBS, I have kept both completed and finished state tags.

For reference, please see page RG-199 at https://2021.help.altair.com/2021.1/PBSProfessional/PBSReferenceGuide2021.1.pdf.

Here is the information from man qstat in Polaris which has the matching information (select '1B' after man qstat):

The job's state:
                                                 B  Array job has at least one subjob
                                                    running
                                                 E  Job is exiting after having run
                                                 F  Job is finished
                                                 H  Job is held
                                                 M  Job was moved to another server
                                                 Q  Job is queued
                                                 R  Job is running
                                                 S  Job is suspended
                                                 T  Job is being moved to new location
                                                 U  Cycle-harvesting job is suspended
                                                    due to keyboard activity
                                                 W  Job is waiting for its submitter-
                                                    assigned start time to be reached
                                                 X  Subjob has completed execution or
                                                    has been deleted

New tags in >2020 PBS version are B, F (replaced from C), M, U, X.

Nexus only uses the 'complete' status to track the jobs, therefore the rest of the status tags are there for bookkeeping:

# 'complete' is the only actively used status so far

Does this introduce a breaking change?

  • No

What systems has this change been tested on?

Polaris

Checklist

Update the following with a yes where the items apply. If you're unsure about any of them, don't hesitate to ask. This is
simply a reminder of what we are going to look for before merging your code.

  • Yes. This PR is up to date with current the current state of 'develop'
  • No. Code added or changed in the PR has been clang-formatted
  • No. This PR adds tests to cover any new code, or to catch a bug that is being fixed
  • No. Documentation has been added (if appropriate)

@kayahans kayahans changed the title Updated PBS job states for Polaris Nexus: Updated PBS job states for Polaris May 14, 2024
@ye-luo

ye-luo commented May 14, 2024

Copy link
Copy Markdown
Contributor

Test this please

@ye-luo ye-luo enabled auto-merge May 14, 2024 18:53
@ye-luo ye-luo merged commit 338798a into QMCPACK:develop May 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants