Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to correctly add Source Modifiers for genbank submisson? #8

Closed
ammaraziz opened this issue Jun 21, 2023 · 1 comment
Closed

How to correctly add Source Modifiers for genbank submisson? #8

ammaraziz opened this issue Jun 21, 2023 · 1 comment

Comments

@ammaraziz
Copy link

ammaraziz commented Jun 21, 2023

Hi,

Thanks for creating and maintaining this very useful program.

I am trying to include patient metadata for the NCBI submission part of the process. Similar to gisaid which allows gender, passage etc, NCBI allows source modifiers such as sex.

I am not 100% sure how to include such information for the NCBI part of the metadata in the config file.

genbank_src_metadata:
  column_names:
    isolate: genbank_name
    host: host
    country: location
    isolation-source: isolation_source

Lets say I wanted to include the ncbi source modifier Sex (assuming I have a column in my metadata called gender), would I add the following:

genbank_src_metadata:
  column_names:
  ....
  Sex: gender

Is that correct?

A related question, for the structured data section eg:

COMMENT     ##Assembly-Data-START##
            Assembly Method       :: CLC Genomics
            Sequencing Technology :: PacBio Sequel II
            ##Assembly-Data-END##

How can I add more information than?

My main aim is to match all the metadata that is required in gisaid to ncbi submission.

Thanks,

Ammar

@dthoward96
Copy link
Collaborator

That is correct. You can add as many extra columns to those fields in the config file as you have available. It works the same way for editing the structured comment file. You just need to make sure you add the correct field names that match your data's header to the structured comment section of the config file. For available fields for the structured comment this provides some of that information: https://www.ncbi.nlm.nih.gov/genbank/structuredcomment/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants