Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ITAG gene naming for novel genes #60

Closed
phosmani opened this issue Jan 6, 2017 · 5 comments
Closed

ITAG gene naming for novel genes #60

phosmani opened this issue Jan 6, 2017 · 5 comments
Assignees

Comments

@phosmani
Copy link
Member

phosmani commented Jan 6, 2017

Naming ITAG genes with Solyc ID convention.
In ITAG3.0 there are several novel genes compared to ITAG2.40 which needs Solyc ID. In general, the ID should reflect chromosome number and genomic position.

Conditions
-conflict with any previous Solyc ID

@phosmani phosmani changed the title ITAG gene naming ITAG gene naming for novel genes Jan 6, 2017
suryasaha added a commit that referenced this issue Jan 9, 2017
suryasaha added a commit that referenced this issue Jan 13, 2017
suryasaha added a commit that referenced this issue Jan 14, 2017
@suryasaha
Copy link
Member

Does not handle Solyc ids in cases where > 9 genes are added between 2 Solyc ids. Just puts a placeholder for now.

@suryasaha
Copy link
Member

Novel genes should have ids starting in the middle of the interval

Nof genes Number
1 15
2 13,17
3 13,15,17
4 12,14,16,18
5 11,12,14,16,18
6 11,12,13,14,16,18
7 11,12,13,14,15,16,18
8 11,12,13,14,15,16,17,18
9 11,12,13,14,15,16,17,18,19

@suryasaha suryasaha reopened this Jan 25, 2017
@phosmani
Copy link
Member Author

chromosome 7 issue. Novel gene ID assignment for first genes at the start of chromosome. There are 3 novel genes before old Solyc ID comes up. First one got correct assignment (Alias=Solyc07g004991). But next two genes gives error.
Error:
Parsing GFF3 line 15 of 33349 to count novel genesUse of uninitialized value $prev_mRNA_ITAG24_Solycid in exists at /home/prashant/tools/Bio-GenomeUpdate/scripts/update_maker_names_gff_hard_coded.pl line 113.
Use of uninitialized value $prev_mRNA_ITAG24_Solycid in hash element at /home/prashant/tools/Bio-GenomeUpdate/scripts/update_maker_names_gff_hard_coded.pl line 118.
Parsing GFF3 line 25 of 33349 to count novel genesUse of uninitialized value $prev_mRNA_ITAG24_Solycid in exists at /home/prashant/tools/Bio-GenomeUpdate/scripts/update_maker_names_gff_hard_coded.pl line 113.
Use of uninitialized value $prev_mRNA_ITAG24_Solycid in hash element at /home/prashant/tools/Bio-GenomeUpdate/scripts/update_maker_names_gff_hard_coded.pl line 115.

@suryasaha
Copy link
Member

Using hard coded intervals. Need to automate the namespace distribution
39c2d19

@suryasaha
Copy link
Member

9964e0b

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants