(Preserve old feature) - use of Channel.create() to separate Channel logic from process logic #1269

micans · 2019-08-09T09:59:55Z

Channel.create() allows a different way of organising the code, separating all channel code from all process code (see below). I am not 100% convinced by the argument that Channel.create() is misused; there are also cases where it is useful, and I am now running a Pipeline That Shall Not Be Named where the authors do all sorts of hell-raising tricks with most of the NF language, e.g.

    beforeScript "rsync -az ${outdir}/* ."
    afterScript "rsync -az ./* ${outdir}/"

(Bad code can be written in any language). I'll end with a small example of the channel/process separation; would this still be supported in DSL2 in some other way? create() can be avoided by rearranging the code but the point is that by grouping all the channel logic together the flow logic of the program is easier to grasp (imagine this scaling to 10 processes with as least as many channels between them).

This use case is quite possibly not on anyone else's radar and I don't want to make it into a huge thing. I encountered it only very recently and think it's cool. I'll try to link to a more fully fledge example ASAP.

// This is a variant on http://nextflow-io.github.io/patterns/index.html#_problem_19
params.includeB = false

ch_input = Channel.create()
ch_BC = Channel.create()

ch_input.flatMap().map { f -> [f.text.trim(), f] }.view()
  .tap { ch_AC }
  .until { !params.includeB }
  .set { ch_AB }

ch_AC.until { params.includeB }.mix(ch_BC).set{ ch_C }

// Above is all the channel plumbing, below are all the process definitions.

process processA {        // Create a bunch of files, each with its ow sample ID.
  output: file('*.txt') into ch_input
  script: 'for i in {1..7}; do echo "sample_$i" > f$i.txt; done'
}

process processB {
  input:  set val(sampleid), file(thefile) from ch_AB
  output: set val(sampleid), file('out.txt') into ch_BC
  script: "(echo 'B process'; cat $thefile; md5sum $thefile) > out.txt"
}

process processC {
  publishDir "results"
  input: set val(sampleid), file(a) from ch_C.view()
  output: file('*.out')
  script: "(echo 'C process'; cat $a) > ${sampleid}.out"
}

The text was updated successfully, but these errors were encountered:

The deprecation message for the channel create and close operation has been removed to not annoy users of pipeline which want to stay with the current syntax. However the deprecation remains for those methods when using DSL2. Related: #1269

pditommaso · 2019-08-19T09:05:53Z

In the above example, it's enough to move the snippet

ch_input.flatMap().map { f -> [f.text.trim(), f] }.view()
  .tap { ch_AC }
  .until { !params.includeB }
  .set { ch_AB }

after processA and the ch_input = Channel.create() is not needed any more.

The main problem with create is that with the new syntax the channel emission needs to be deferred after the script evaluation to infer the complete full datatflow network and allow the implicit forking of multi-read channels.

The create would give the user the ability to break this model causing the execution to stall.

However, the idea is to not break existing code without bothering users who do not want to move to the new syntax. Therefore I've removed the deprecation messages for create and close methods 454e78f.

micans · 2019-08-19T09:15:54Z

Well this particular example is too small to be convincing, but anyway, it's great if these methods will not be deprecated!

pditommaso · 2020-01-29T14:06:46Z

@JohnHadish can you provide an example of how you are Channel.create() and why the code could not be refactored without using it?

pditommaso added the lang/dsl2 label Aug 17, 2019

pditommaso added a commit that referenced this issue Aug 18, 2019

Show channel.create deprecation only with DSL2 #1269

fd11f8d

pditommaso mentioned this issue Aug 19, 2019

Fork operator [was separate] #1281

Closed

JohnHadish mentioned this issue Sep 9, 2019

Nextflow Version 19.07.0 depreciates "create" SystemsGenetics/GEMmaker#129

Closed

pditommaso closed this as completed Feb 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Preserve old feature) - use of Channel.create() to separate Channel logic from process logic #1269

(Preserve old feature) - use of Channel.create() to separate Channel logic from process logic #1269

micans commented Aug 9, 2019

pditommaso commented Aug 19, 2019

micans commented Aug 19, 2019

pditommaso commented Jan 29, 2020

(Preserve old feature) - use of Channel.create() to separate Channel logic from process logic #1269

(Preserve old feature) - use of Channel.create() to separate Channel logic from process logic #1269

Comments

micans commented Aug 9, 2019

pditommaso commented Aug 19, 2019

micans commented Aug 19, 2019

pditommaso commented Jan 29, 2020