Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"assembly best practice" -- additional flye option and SPAdes trusted contigs option #10

Open
hoelzer opened this issue Dec 14, 2019 · 4 comments
Assignees
Labels
enhancement New feature or request

Comments

@hoelzer
Copy link
Collaborator

hoelzer commented Dec 14, 2019

@sandraTriebel see here a nice guide for Nanopore/ hybrid genome assembly:

https://achri.blogspot.com/2019/12/nanopore-bacterial-genome-assemblies.html?m=1

Whereas here the focus is fast execution using a GPU, the tools and pipeline are interesting and not so different from what you already implemented.

[1]
flye --nano-raw barcode06.fastq --threads 8 --iterations 2 --plasmids -g 3m --out-dir barcode06
This is the flye command used here. Interesting: --iterations parameter that already seems to do some kind of polishing. Maybe we also want to have this.

[2]
The other really interesting part in my eyes:

Not only using the short reads for polishing but instead, integrate them again into the assembly process while using the long-read-only assembly as a real backbone. For this, the author use SPAdes with the --trusted-contigs option and passes the long-read polished contigs as a trusted set of sequences. Then they use pilon for polishing the SPAdes result using the short reads. I think you also tried pilon at some point?

spades.py -o spades --trusted-contigs medaka/consensus.fasta -1 /path/to/illumina/sample_R1_001.fastq.gz  -2 /path/to/illumina/sample_R2_001.fastq.gz

The question is: do we really need this in our case? Or: how difficult would it be for you to also implement a SPAdes rule that uses the Nanopore assembly with the error-corrected short reads as an input? So that we can compare?

@hoelzer hoelzer added the enhancement New feature or request label Dec 14, 2019
@sandraTriebel
Copy link
Member

It would not be that complicated.
SPAdes is already installed and I can write the rule with the command you mentioned above.

@hoelzer
Copy link
Collaborator Author

hoelzer commented Dec 16, 2019

Ok that's great! Then please add the spades rule using the polished long reads and the error corrected short reads as an input

@sandraTriebel
Copy link
Member

So, now we'll use the pipeline:
flye (w/ default polishing - 1 iteration) -> 4x Racon LR -> 1x medaka LR -> 4x Racon SR -> SPAdes w/ trusted contig option

or flye with 2 iterations?

@hoelzer
Copy link
Collaborator Author

hoelzer commented Dec 16, 2019 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants