-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is profile-based design with custom genomes possible? #177
Comments
Actually this is possible, yes! You will need to use the |
I am not interested in gsa or pooled_gsa. Do I still need to find a unique NCBI ID or can I just use 2? For example in the reference_file.tsv can an entry look like this? |
You don't need to provide these NCBI IDs then, so yes it could look like this, though just to be safe I'd advise using absolute paths to your genomes. |
What does the biom file need to contain if an entry in the reference file looks like the one above? |
Actually, looking at the code right now I was mistaken. For CAMISIM to work, every entry in the reference file needs to have a "correct" NCBI ID and scientific name, if you choose |
I do not have the NCBI ID for all my custom genomes. Is there another way to make the profile-based design work, or is de-novo design (which I am able to run) be the best option? |
Since you do not use CAMISIM's option to download genomes the de novo design might actually be best (and more accurate). To use the abundances from the input profile you would need to use the |
Do I need to change the phase in the config file if I am using the |
No, you should not need to change the phase, CAMISIM will automatically use the files if they are provided. Note that for multiple samples these need to be absolute paths and comma-separated without whitespace: |
Should the tsv files include headers? |
No, these do not need a header, just genome_ID and abundance tab-separated |
I want to use my own fasta files as input genomes to the simulation. I am wondering if profile-based community design is possible with custom genomes. i.e. I do not want to use genomes from NCBI. Would it be an option to provide my own genome sequence collection in the profile based design, like I can do in the de-novo based design?
The text was updated successfully, but these errors were encountered: