Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Coverage not calculated for single_easy_bin with less than 5 .bam files #167

Open
Sebastien-Raguideau opened this issue Jul 24, 2024 · 2 comments

Comments

@Sebastien-Raguideau
Copy link

Hello,

I saw in your code that when using single_easy_bin, you condition using coverage to having 5 or more bam files.

So, just curious about it. Is there a reason why coverage is not used when less than 5 samples and using a unique assembly?

Do I need to use the multi_easy_bin if I want to use coverage anyhow and is multi_easy_bin going to work with a unique assembly? Do I want to use coverage information if less than 5 samples?

Best,
Seb

@luispedro
Copy link
Member

Coverage is always used, it is just processed differently.

@Sebastien-Raguideau
Copy link
Author

Hi,

Thanks for the swift answer.

I spent more time reading the code. I think I understand that coverage is not being used for training on the must-link part. For theses, data_split consist of only kmer_split rather than the combined data including coverage for the split contigs.

So, I suppose I just would like to know more about this. Is using this information when the number of sample is inferior to 5, detrimental to semibin results?

Best,
Seb

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants