Customizing location of flag files in the results_pipeline dir #34

nsheff · 2020-06-29T12:46:59Z

Right now, project.results_pipeline is used as the subdir within the project.output_dir. Looper then adds the sample name on to the end of this folder automatically, and then looks there for flags and summarizer files

The refgenie pipeline shows that we need more flexibility. We're setting up the pipeline to place flags and outputs in a much deeper folder (so we can exclude it from archive checksumming, etc). But there's no way to tell looper about this, and hence, flag checks and summarizers will never work with this pipeline.

This could be solved by making the 'results_pipeline' + {sample.name} hard-coding into a template that can be populated with sample properties, which should point directly to the output folder.

Related to pepkit/looper#242

The text was updated successfully, but these errors were encountered:

nsheff · 2020-08-31T20:37:17Z

a thought: for cloud-based job execution, maybe it should even be possible to read flags on s3.

nsheff · 2020-09-17T15:55:08Z

This may actually be solvable now using the new paths section in pepkit/looper#285.

if the results pipeline is a templatable path, then it can be specified in the pipeline interface and looper will know where to look. I guess it would just be a reserved path key, like "looper_flag_path" or "pipestat_path" or something, and looper would know where to look if that path was provided.

nsheff · 2023-04-26T17:32:32Z

With the spinoff of the status flag system into pipestat, this is now a pipestat issue, really; the question is, how will pipestat status flags be saved? will the pipeline interface provide a way to specify the location of the flags?

nsheff · 2023-04-26T17:34:30Z

I think this might make sense to add as an attribute in the pipestat schema, maybe under the status components?

Or wait, would it make more sense to add to the looper pipeline interface (maybe I moved it from looper to pipestat prematurely...)

donaldcampbelljr · 2023-06-14T20:03:26Z

Currently, the user can pass flag_file_dir to the PipestatManager class during initialization or provide the directory in the config file.

Then, pipestat determines:

  flag_file_dir = self[CONFIG_KEY].priority_get(
      "flag_file_dir", override=flag_file_dir, default=os.path.dirname(self.file)
  )
  self[STATUS_FILE_DIR] = mk_abs_via_cfg(flag_file_dir, self.config_path)

nsheff added the priority-high label Jul 1, 2020

stolarczyk self-assigned this Jul 2, 2020

nsheff mentioned this issue Aug 31, 2020

Looper summarizer specification and spin-off pepkit/looper#242

Closed

nsheff mentioned this issue Mar 18, 2021

requirement of pipestat refgenie/refgenomes.databio.org#7

Closed

nsheff changed the title ~~results_pipeline flexibility~~ Customizing location of flag files in the results_pipeline dir Apr 26, 2023

nsheff unassigned stolarczyk Apr 26, 2023

nsheff transferred this issue from pepkit/looper Apr 26, 2023

nsheff added this to the v0.4.0 milestone Apr 26, 2023

nsheff assigned vreuter Apr 26, 2023

nsheff removed the priority-high label Jun 13, 2023

donaldcampbelljr added the likely-solved label Jun 15, 2023

donaldcampbelljr linked a pull request Jun 29, 2023 that will close this issue

v0.4.0 Pipestat Release #61

Merged

1 task

donaldcampbelljr closed this as completed in #61 Jun 29, 2023

donaldcampbelljr mentioned this issue Jul 10, 2023

No job status flags in results folder pepkit/looper#326

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Customizing location of flag files in the results_pipeline dir #34

Customizing location of flag files in the results_pipeline dir #34

nsheff commented Jun 29, 2020

nsheff commented Aug 31, 2020

nsheff commented Sep 17, 2020

nsheff commented Apr 26, 2023

nsheff commented Apr 26, 2023

donaldcampbelljr commented Jun 14, 2023

Customizing location of flag files in the results_pipeline dir #34

Customizing location of flag files in the results_pipeline dir #34

Comments

nsheff commented Jun 29, 2020

nsheff commented Aug 31, 2020

nsheff commented Sep 17, 2020

nsheff commented Apr 26, 2023

nsheff commented Apr 26, 2023

donaldcampbelljr commented Jun 14, 2023