looper documentation; complete(?) todo list #254

stolarczyk · 2020-04-29T01:30:52Z

looper has changed a lot. This is a list of new features that we will need to describe in the documentation before the release:

--settings arg can be used to point to YAML file with template variables.
compute settings priority order:

CLI (--settings YAML file specification < --compute itemized specification)
config
pipeline interface
divcfg

.looper.yaml in the project directory can be used to pre-set CLI options
arguments priotity order:

CLI
looper dotfile
argparser defaults

already opened issues:

The text was updated successfully, but these errors were encountered:

nsheff · 2020-05-04T14:38:06Z

can you clarify compute settings priority order? Shouldn't it be:

Looper CLI (--compute)
PEP config, project.looper.compute section
Pipeline interface, pipeline.compute section
Activated divvy compute package (--settings or --package)

stolarczyk · 2020-05-04T15:16:37Z

your order reflects current implementation, except for --settings.

I thought that --settings is a YAML version of --compute. If that statement is correct, it does not make sense to me to make it not overwrite settings specified in the config files (2. and 3.).

nsheff · 2020-05-04T15:41:59Z

Ok, you could think of it that way. the other way to think of it is that --settings and --package are both divvy modifiers; so we let divvy handle them however it does. We then do what we're going to do after that.

nsheff · 2020-05-04T15:42:39Z

but wait, I guess --compute is also a divvy modifier, isn't it? So, yeah.

nsheff · 2020-05-04T15:43:55Z

the question is: is settings more like package or more like compute. I guess I thought of it as similar to package.

stolarczyk · 2020-05-04T15:53:40Z

I implemented this based on this sentence from #245 :

-s SETTINGS, --settings SETTINGS -- YAML file with job settings to populate the template.

so it was just my interpretation. But I think I still don't understand your intent; why would you to need to specify --package in a YAML file? It does not spare any typing since you'd need to provide a path to the file anyway.

Is this the YAML file you'd use?

package: local

I thought we want:

mem: 10000
cores: 2
jobname: test

nsheff · 2020-05-04T16:01:32Z

the second example is correct.

the settings is just between package and compute. It's legitimate to put it in either place, I think. I just don't think I would use settings for on-the-fly stuff... I'd use it more like an ad-hoc package. that's why I put it lower priority. you think of it as on-the-fly, so lumped it with compute for priority. it can be used in either way.

I think the way you did it is fine.

nsheff · 2020-05-05T21:32:57Z

For the input schema section --

the input schema format is based on the extended PEP JSON-schema validation framework, but adds some additional looper-specific capabilities. The available extended sections are:

which is true of input schemas:

Looper uses exactly eido's functionality for validating input schema
the input schema extends eido's functionality

Then, I have the same question for output schemas -- but this, I'm pretty sure all the extra stuff extends outside the scope of eido, right? Because it's not used for validation, but for summarizer stuff.

stolarczyk · 2020-05-05T21:40:49Z

2 is true

eido is not aware of required_intput_attrs and input_attrs

Then, I have the same question for output schemas -- but this, I'm pretty sure all the extra stuff extends outside the scope of eido, right? Because it's not used for validation, but for summarizer stuff.

yes, eido is not used at all with respect to output schema

nsheff · 2020-05-05T22:43:45Z

eido is not aware of required_intput_attrs and input_attrs

Wait -- that's not what we said here:

nsheff · 2020-05-05T23:39:54Z

Ok, I guess the question I have is this: Does it makes sense to move required_intput_attrs and input_attrs into eido? That's how I thought it was implemented; it makes sense to me that way.

If the other way makes more sense, why?

stolarczyk · 2020-05-06T02:59:46Z

that's not what we said here:

I confused eido functionality with general PEP schema functionality, sorry..

inputs validation is currently a Sample class method, so it is implemented in peppy. This location is relic of the previous input handling with was a looper.Sample method. Even though the looper.Sample no longer exists and the method itself has been completely rewritten it stayed a Sample method. Maybe because it sticks the determined input sizes as a Sample attribute..

I agree, it makes much more sense to move it to eido.

nsheff · 2020-05-08T13:00:13Z

I agree, it makes much more sense to move it to eido.

Ok I've updated the docs to reflect this; looper now just refers to the eido docs, and the required_intput_attrs and input_attrs stuff is all described in the eido docs. I agree it makes more sense this way.

stolarczyk · 2020-05-08T13:04:37Z

great, I'll move the input validation to eido today

nsheff · 2020-05-11T14:15:41Z

if you have a moment, can you look through the latest dev version of the docs and see if we're missing anything?

stolarczyk · 2020-05-11T14:46:50Z

found an inconsistency in the docs in https://github.com/pepkit/looper/blob/dev/docs/defining-a-project.md#2-add-a-looper-section-to-your-pep related to #253

in the current implementationcommand_extra key in Project.looper will not be picked up by looper unless a dotfile is used. According to your suggestion the dotfile is a "project config" importing the main project config. Subsequently, the values from the dofile (enriched with the Project.looper section data) are used to pre set the CLI options.

The rest of the keys mentioned in the docs are valid, because I made the system backwards compatible. So for example the following scenario works: no dotfile, but Project.output_dir set in the config

stolarczyk · 2020-05-11T14:49:53Z

I guess we could make it possible, but this would require some retooling

nsheff · 2020-05-20T19:24:32Z

@stolarczyk do you consider the docs complete now? can close this issue?

stolarczyk · 2020-05-20T19:27:45Z

after recent round of changes related to looper section in the PEP the pipeline interaface selection strategy has changed. I don't know what is the current status of the docs regarding that.

The current implementation reflects what I wrote in the last comment in #244 minus the "behind the scenes" sample-based project pipeline interface selection.

Additionally, the command_template namespaces might be out of date

stolarczyk · 2020-05-20T19:28:28Z

I can read through the docs tonight and see if they reflect the reality ;)

nsheff · 2020-05-20T19:33:06Z

I think I finished updating the pipeline interface docs...

stolarczyk · 2020-05-21T00:46:08Z

I've read through the docs and made some changes, I think they are ready. But after mkdocs serve I get:

WARNING -  A relative path to 'hello-world.md' is included in the 'nav' configuration, which is not found in the documentation files 
WARNING -  Documentation file 'README.md' contains a link to 'hello-world.md' which is not found in the documentation files.

The hello world jupyter notebook is missing so the markdown file is no longer created, but linked in 2 places. We lost the entire docs_jupyter directory from the repo. Was that intentional?

nsheff · 2020-05-21T12:04:08Z

Was that intentional?

No, let me look into that.

nsheff · 2020-05-21T12:12:37Z

I deleted them in this commit somehow: 27ca124

I got it back with:

git checkout b99edb46a056c7021b01edeae3b1aa0c20f2bcf9 docs_jupyter/hello-world.ipynb
git checkout b99edb46a056c7021b01edeae3b1aa0c20f2bcf9 docs_jupyter/build/.gitignore

Now the docs build again.

nsheff · 2020-05-21T16:54:14Z

I'll declare these docs complete enough, then, and close this issue.

stolarczyk added the docs label Apr 29, 2020

stolarczyk added this to the 1.1 milestone Apr 29, 2020

stolarczyk mentioned this issue Apr 29, 2020

v1.2.0 #247

Merged

14 tasks

nsheff added a commit that referenced this issue May 4, 2020

correct compute order docs, See #254

fd8fdca

stolarczyk added a commit to pepkit/eido that referenced this issue May 8, 2020

implement inputs validation; pepkit/looper#254 (comment)

774d041

stolarczyk added a commit that referenced this issue May 8, 2020

use eido for input validation; #254 (comment)

566b062

stolarczyk added a commit that referenced this issue May 8, 2020

update changelog; #254

02d166d

stolarczyk added the likely-solved label May 8, 2020

nsheff closed this as completed May 21, 2020

nsheff pushed a commit to pepkit/eido that referenced this issue Jun 21, 2023

implement inputs validation; pepkit/looper#254 (comment)

ae92c4c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

looper documentation; complete(?) todo list #254

looper documentation; complete(?) todo list #254

stolarczyk commented Apr 29, 2020 •

edited

Loading

nsheff commented May 4, 2020 •

edited

Loading

stolarczyk commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 4, 2020

stolarczyk commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 5, 2020

stolarczyk commented May 5, 2020

nsheff commented May 5, 2020

nsheff commented May 5, 2020

stolarczyk commented May 6, 2020

nsheff commented May 8, 2020

stolarczyk commented May 8, 2020

nsheff commented May 11, 2020

stolarczyk commented May 11, 2020

stolarczyk commented May 11, 2020

nsheff commented May 20, 2020

stolarczyk commented May 20, 2020 •

edited

Loading

stolarczyk commented May 20, 2020

nsheff commented May 20, 2020

stolarczyk commented May 21, 2020 •

edited

Loading

nsheff commented May 21, 2020

nsheff commented May 21, 2020

nsheff commented May 21, 2020

looper documentation; complete(?) todo list #254

looper documentation; complete(?) todo list #254

Comments

stolarczyk commented Apr 29, 2020 • edited Loading

nsheff commented May 4, 2020 • edited Loading

stolarczyk commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 4, 2020

stolarczyk commented May 4, 2020

nsheff commented May 4, 2020

nsheff commented May 5, 2020

stolarczyk commented May 5, 2020

nsheff commented May 5, 2020

nsheff commented May 5, 2020

stolarczyk commented May 6, 2020

nsheff commented May 8, 2020

stolarczyk commented May 8, 2020

nsheff commented May 11, 2020

stolarczyk commented May 11, 2020

stolarczyk commented May 11, 2020

nsheff commented May 20, 2020

stolarczyk commented May 20, 2020 • edited Loading

stolarczyk commented May 20, 2020

nsheff commented May 20, 2020

stolarczyk commented May 21, 2020 • edited Loading

nsheff commented May 21, 2020

nsheff commented May 21, 2020

nsheff commented May 21, 2020

stolarczyk commented Apr 29, 2020 •

edited

Loading

nsheff commented May 4, 2020 •

edited

Loading

stolarczyk commented May 20, 2020 •

edited

Loading

stolarczyk commented May 21, 2020 •

edited

Loading