-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nextflow pipeline removes leftover folders/files #23
Conversation
- If the pipeline is stopped while Ks estimate processes are still ongoing, there might be tmp folders and incomplete file leftovers. They must be deteleted, and now the pipeline does it automatically as closure step. - Both paralog and ortholog folders are cleaned - If BLAST tmp folder is found, delete also the associated and incomplete BLAST TSV file - If Ks tmp folder is found, delete it
Shall we also delete any core dumps? And delete any empty ortholog_distributions/wgd_species_species/ and paralog_distributions/wgd_species/ folders? Could there be any temporary i-adhore files/folders that need to be cleaned up? If the pipeline stops with an error, may it help in any way to keep any of these files to figure out what went wrong? If so, maybe introduce an optional parameter that prevents any removal of temporary files. |
> Shall we also delete any core dumps? > And delete any empty ortholog_distributions/wgd_species_species/ and paralog_distributions/wgd_species/ folders? > Could there be any temporary i-adhore files/folders that need to be cleaned up? > If the pipeline stops with an error, may it help in any way to keep any of these files to figure out what went wrong? If so, maybe introduce an optional parameter that prevents any removal of temporary files. |
If we make the pattern to detect them specific enough it's unlikely to have some other similarly-named file deleted.
Yes, delete by default, but keep when parameter is set. But use a Nextflow parameter for this, and not one in the ksrates config file, since this is specific to Nextflow (at least for now). |
- Nextflow parameter added to the main.nf pipeline that switches on/off the automatic deletion of wgd and i-ADHoRe leftover folders such as ks_tmp. - Default true: it is automatically deleted when the pipeline crashes. - Can be turned off either from the NF configuration file or from the command line. - Note: configuration files must still be updated
UPDATE:
Note: if we are going to remove the |
I don't think we need to put this parameter in the config files, we can document it and use it in the Nextflow command line. We should use the same notation for these parameter names, so far we have used camelCase notation ( |
Thanks for pointing that out!
|
Thanks! Don't forget to look at my comment on the code that defines |
Update:
|
- Remove wgd sub-directories (i.e. wgd_species and wgd_species_species) if they end up being empty after removing all leftover temporary directories.
…B/ksrates into nextflow_delete_leftovers
Thanks, looks all good! |
UPDATE:
|
processes are still ongoing, there might be
tmp folders and incomplete file leftovers.
They must be deteleted, and now the pipeline
does it automatically as closure step.
associated and incomplete BLAST TSV file