Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resolve input from all tasks in join #35

Closed
thejmazz opened this issue Jul 11, 2016 · 3 comments
Closed

Resolve input from all tasks in join #35

thejmazz opened this issue Jul 11, 2016 · 3 comments
Assignees
Labels

Comments

@thejmazz
Copy link
Member

thejmazz commented Jul 11, 2016

mpileupandcall needs the reference and bam, but as the per the join lineage:

const call = join(alignAndSort, samtoolsIndex, mpileupandcall)

it will only receive whichever samtoolsIndex provides as resolved output, which will just be the bai.
alignAndSort produces the bam, and receives the reference.

Current fix is to add

skipPassed: true

to the task so that it resolves input glob patterns from fs and not the output of the previous task.

So perhaps store all input and outputs of all tasks in a join lineage, and resolve inputs for tasks later on.

Nextflow does this by declaring "channels", pushing files into them, and pulling files out of them, explicitly, in separate tasks. Problem with that was need to manually duplicate channels to use the same file in other downstream tasks.

@thejmazz thejmazz mentioned this issue Jul 11, 2016
38 tasks
@thejmazz
Copy link
Member Author

  • some sort of state maintained (key, value) to access previously maintained files
  • needs to have some sort of uniqueness so that there are no overlaps

@thejmazz
Copy link
Member Author

thejmazz commented Jul 11, 2016

@yannickwurm

flexibility

  1. rna sras (ants, flies, etc.)
  2. filter
  3. transcriptome assembly (trinity -> trinity.fasta)
  4. pull out longest transcript
  5. take sra reads that contributed to assembly, map them to assembly (e.g. blast)
  6. fastq that only contains reads that contributed to this longest transcript

(Made it its own issue: #37)

@thejmazz
Copy link
Member Author

uses DAG

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

2 participants