Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

atlas ,how to not bin? #549

Closed
luozhy88 opened this issue Sep 6, 2022 · 15 comments
Closed

atlas ,how to not bin? #549

luozhy88 opened this issue Sep 6, 2022 · 15 comments

Comments

@luozhy88
Copy link

luozhy88 commented Sep 6, 2022

I need get a taxnomy and otu table which the input from contings , but the atlas create a taxnomy which contain binning ID.

atlas run genomes --skip-binning --resources mem=600 -j 130 is error!

@SilasK
Copy link
Member

SilasK commented Sep 6, 2022

Hmm, I'm happy to help ou but I don't understand what you need.
What microbiome do you have?

Atlas is designed to assemble and bin genomes from your reads and then use them for quantification.

Depending on the microbiome you want you could get reference genomes that are already assembled. In which case you don't need to assemble.

You can also taxonomically annotate your contigs but don't get an OTU.

What do you exactly mean by OTU, usually the term is used for 16S sequencing.

@luozhy88
Copy link
Author

luozhy88 commented Sep 6, 2022

Thanks,I want to annotate my contigs by atlas.How?

atlas run genomes --skip-binning

image

@SilasK
Copy link
Member

SilasK commented Sep 7, 2022

Set "SemiBin" as final_binner in the config file. And run atlas

atlas run binning --omit-from semibin_train

Run this first with the --dryrun option to check what it does.

It should filter contigs <1500bp and annotate each sample with the taxonomy

@SilasK SilasK closed this as completed Sep 8, 2022
@luozhy88
Copy link
Author

luozhy88 commented Sep 8, 2022

Thanks! it works!
image
but I have a problem about raw count.I need a table which the row names are contings and the cloumns are SampleID.How?
image

@SilasK
Copy link
Member

SilasK commented Sep 9, 2022

For each sample you have the coverage in "{sample}/assembly/contig_stats/postfilter_coverage_stats.txt" in the same folder you even find it by base and by 1kb block if this is of interest to you.

@luozhy88
Copy link
Author

luozhy88 commented Sep 9, 2022

if we want to analyze different samples for difference analysis for contings, then we can make a unified table. The columns are SampleID and the rownames are cotings. How can I get the table ?

when I use "atlas run binning --omit-from semibin_train", the output is the table which contain only one sample.

@SilasK
Copy link
Member

SilasK commented Sep 9, 2022

Can you explain what you want to do? I do not do a coassembly. each sample get's assembled, separately. also, the taxonomic annotation is per sample.

I create a unified genecatalog and a unified set of MAGs for quantification. However, I do not know how to combine all the contigsas there are many partially similar contigs in different samples.

@luozhy88
Copy link
Author

luozhy88 commented Sep 9, 2022

Thanks, Yes,I want to combine all the contigs in different samples in one table. Maybe I need use other method

@luozhy88
Copy link
Author

Because I changed the samples.tsv file, and save as samles_new.tsv . can I specify its absolute path?
atlas run qc --samples /home/zhiyu/atlas/samles_new.tsv ,right?

@SilasK
Copy link
Member

SilasK commented Sep 18, 2022

No you cannot specify the path to the samples.tsv. You should rename the old to samples_old.tsv and the new to samples.tsv. By the way if you have questions about the command line interface simply write --help .

@SilasK
Copy link
Member

SilasK commented Sep 18, 2022

Thanks, Yes,I want to combine all the contigs in different samples in one table. Maybe I need use other method

By the way there is a way to do what you want with atlas:

run:

atlas run None "Cobinning/vamb/coverage.tsv" <other params>

This uses minimap to map all reads to the combined filtered contigs, which you have taxonomically annotated before.
However I'm not sure how this approach handles the multi mapping.

It's uses in theory for vamb binning, but you are not interested in binding as I understood.

@luozhy88
Copy link
Author

luozhy88 commented Sep 18, 2022

#549 (comment)

I often run a lot of batch data by atlas if rename is troublesome. if sample.tsv is specified, it is convenient for me to record.

@SilasK
Copy link
Member

SilasK commented Sep 20, 2022

I don't understand.

@luozhy88
Copy link
Author

If I have 3 batches running in the same directory for sample.tsv, I will change my name three times. So I want to specify the name. Example: sample _ batch1.tsv sample _ batch2.tsv sample _ batch3.tsv.

@SilasK
Copy link
Member

SilasK commented Sep 20, 2022

Why would you run three batches in the same directory?

Maybe you don't know that atlas runs already everything in parallel and can be effiently be deployed on a cluster. See the docs.

Once the qc step passed the sample.tsv is no longer altered and you could run batches. If really necessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants