[WIP] Adding a nnpdf backend #57

scarlehoff · 2020-01-26T22:10:33Z

Nowadays I'm only using slurm but I am using it in several different systems so might as well add it as a public backend (even if I think I will be the only one using it for now :__)

Todo:

Check that the input works
Generate the runfolder before sending the job
Check the input and try to guess the necessary memory requirement

src/pyHepGrid/src/runSlurmjob.py

jcwhitehead · 2020-01-28T16:30:28Z

src/pyHepGrid/src/programs.py

+                # Fail without piety
+                raise ValueError("There are replicas in the range considered")
+
+    def include_arguments(self, current_arguments):


To check I understand - if you ever wanted to run NNPDF with Arc or DIRAC, would your include_arguments lead to command-line arguments for your executable like --memsize X?

@marianheil and I were discussing that supporting arbitrary arguments across the board might be quite useful, especially if more programs are going to be added to pyHepGrid, so we don't have to add all programs' variables to all users' headers. But it looks like they'd be ignored for SLURM, unless specifically added as a {key} to the .sh template, but for Arc/Dirac they'd be parsed and passed as additional --X Y command-line flags.

Ideally, for those programs we intend to be able to run on all the Backends, each runcard should lead to the same command-line arguments being passed regardless of which Backend it's being submitted to.

Any idea how we could both manage that, and arbitrary arguments? (Probably related to #17 )

To check I understand - if you ever wanted to run NNPDF with Arc or DIRAC, would your include_arguments lead to command-line arguments for your executable like --memsize X?

Yes. Although it will fail because I'm trusting in the existence of some arguments that are there only for slurm, but yes, that's the idea.

@marianheil and I were discussing that supporting arbitrary arguments across the board might be quite useful, especially if more programs are going to be added to pyHepGrid, so we don't have to add all programs' variables to all users' headers. But it looks like they'd be ignored for SLURM, unless specifically added as a {key} to the .sh template, but for Arc/Dirac they'd be parsed and passed as additional --X Y command-line flags.

There are a few things that are going to be always backend dependent. In principle it might be possible to do it more general but disentangling all of that would be quite a task. I've tried in the past and got bored before achieving anything :__

In any case, this is particularly true for slurm. I remember we had some problems when giving options through command line arguments so we had to have them hard-coded on the scripts (which lead to having to have several different scripts). Don't remember whether this was on the IPPP batch.

Any idea how we could both manage that, and arbitrary arguments? (Probably related to #17 )

I'd say #58 is not a bad compromise: each of the programs must decide/know how to treat the arguments.

add a include_arguments method to the program interface

Allow loading custom runmode without modifying `pyHepGrid` Code

This should allow submission/testing from any folder

marianheil · 2020-02-10T14:25:26Z

What is the status on here? Is this ready to merge? This MR includes a few extra features which would be nice to have in master.

scarlehoff · 2020-02-10T15:39:25Z

Ah, feel free to merge. This is in working condition.
There are a few things I wanted to add but I'm in a conference this week so minimum of 1-2 weeks until I work on this again. I'll push it to a new branch when I do.

jcwhitehead · 2020-02-10T16:45:43Z

src/pyHepGrid/src/test_nnlojob.py

-        util.spCall(["./{0}".format(header.runfile)] + nnlojob_args)
+        runfile = os.path.basename(header.runfile)
+        util.spCall(["chmod","+x",runfile])
+        util.spCall(["./{0}".format(runfile)] + nnlojob_args)


Question: what happens if header.runfile is not in the local directory? (or is it always?)

This is exactly what should changed. Before to run test mode you had to have the runfile in your current directory. Now you can also specify an absolute path. In the lines you highlighted we only get the file name the copy happens in:

pyHepGrid/src/pyHepGrid/src/test_nnlojob.py

Lines 15 to 16 in 9fe0cf2

shutil.copyfile(header.runfile, os.path.join(header.sandbox_dir,

os.path.basename(header.runfile)))

For the actual submission this already worked before, since we just pass header.runfile to Arc/Dirac/Slurm and they can handle absolute and relative paths.

So does the os.chdir in setup() persist after it exits?

What do you mean with "persist"? Within python yes, but it doesn't change your "main shell" from which you are running pyHepGrid. Nothing change in this regard. We also don't have to change back in pyHepGrid after run_test, since pyHepGrid finishes afterwards.

Yes, within python - I assumed it would basically work like a local variable in the setup scope, but it looks like it does persist.

jcwhitehead

Looks good to me

first working commit

5d1a674

scarlehoff force-pushed the nnpdfrun branch from 0f8dde3 to 5d1a674 Compare January 27, 2020 11:38

scarlehoff added 3 commits January 27, 2020 16:33

add initialization

d5d2925

add memory guess

8abf678

management should not need a runcard

21b7f44

marianheil reviewed Jan 28, 2020

View reviewed changes

src/pyHepGrid/src/runSlurmjob.py Show resolved Hide resolved

scarlehoff added 2 commits January 28, 2020 16:08

remove call to ipdb

0420794

add a include_arguments method to the program interface

0a58cc4

jcwhitehead reviewed Jan 28, 2020

View reviewed changes

marianheil added 2 commits January 28, 2020 19:30

Document runmodes with new setting

4087d67

Merge pull request #58 from scarlehoff/ask_program_for_arguments

771d3a4

add a include_arguments method to the program interface

marianheil approved these changes Jan 28, 2020

View reviewed changes

marianheil added 3 commits January 30, 2020 16:57

Allow loading custom runmode without modifying pyHepGrid Code

d7cbd63

Merge pull request #59 from scarlehoff/load_runmode

ba0cdce

Allow loading custom runmode without modifying `pyHepGrid` Code

Allow use of absolute path for runfile

9fe0cf2

This should allow submission/testing from any folder

jcwhitehead reviewed Feb 10, 2020

View reviewed changes

jcwhitehead approved these changes Feb 10, 2020

View reviewed changes

jcwhitehead merged commit 9aca48c into master Feb 10, 2020

jcwhitehead deleted the nnpdfrun branch February 10, 2020 18:09

marianheil added the enhancement New feature or request label Feb 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Adding a nnpdf backend #57

[WIP] Adding a nnpdf backend #57

scarlehoff commented Jan 26, 2020 •

edited

Loading

jcwhitehead Jan 28, 2020

scarlehoff Jan 28, 2020

marianheil commented Feb 10, 2020

scarlehoff commented Feb 10, 2020

jcwhitehead Feb 10, 2020

marianheil Feb 10, 2020

jcwhitehead Feb 10, 2020

marianheil Feb 10, 2020

jcwhitehead Feb 10, 2020

jcwhitehead left a comment

	shutil.copyfile(header.runfile, os.path.join(header.sandbox_dir,
	os.path.basename(header.runfile)))

[WIP] Adding a nnpdf backend #57

[WIP] Adding a nnpdf backend #57

Conversation

scarlehoff commented Jan 26, 2020 • edited Loading

jcwhitehead Jan 28, 2020

Choose a reason for hiding this comment

scarlehoff Jan 28, 2020

Choose a reason for hiding this comment

marianheil commented Feb 10, 2020

scarlehoff commented Feb 10, 2020

jcwhitehead Feb 10, 2020

Choose a reason for hiding this comment

marianheil Feb 10, 2020

Choose a reason for hiding this comment

jcwhitehead Feb 10, 2020

Choose a reason for hiding this comment

marianheil Feb 10, 2020

Choose a reason for hiding this comment

jcwhitehead Feb 10, 2020

Choose a reason for hiding this comment

jcwhitehead left a comment

Choose a reason for hiding this comment

scarlehoff commented Jan 26, 2020 •

edited

Loading