Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FlyBase to supply file of PMID, genes, alleles #1743

Closed
ValWood opened this issue Jan 22, 2019 · 6 comments
Closed

FlyBase to supply file of PMID, genes, alleles #1743

ValWood opened this issue Jan 22, 2019 · 6 comments
Assignees
Labels

Comments

@ValWood
Copy link
Member

ValWood commented Jan 22, 2019

File of PMID, genes, alleles
PomBase has a JSON file that we export each night.
Make the same format with minimal info about genes and alleles.

@kimrutherford will supply a snippet of our JSON output

@ValWood
Copy link
Member Author

ValWood commented Jan 22, 2019

Notes:

  • Will probably proceed by loading all alleles as type “other” (alleles are already described in FB)
    • “Name” and “description are available
  • Expression part not required (creation, editing or display)

kimrutherford added a commit that referenced this issue Feb 18, 2019
Current status: the JSON is parsed, the publication details are fetched
from PubMed and stored and a new session is created for the publication.

Refs #1743
Refs #1779
kimrutherford added a commit that referenced this issue Feb 18, 2019
The "canto_add.pl --sessions-from-json ..." can now add genes from the
JSON file to the new session.

#1743
@kimrutherford
Copy link
Member

I've made a separate issue about the file format: #1779

As a first step, I've add a new flag to the canto_add.pl script: --sessions-from-json. So far the code will:

  • parse a JSON file
  • lookup the PubMed ID(s) using the PubMed web service
  • store the publication details (title, abstract, authors)
  • create an empty session for the pub
  • add genes to the session using their systematic ID / uniquename

It can only parse files that look like this:

{
   "PMID:2120045": {
       "genes": ["FBgn0004107", "FBgn0016131"]
    },
    "PMID:1234567": {
       "genes": [...]
    }
}

I haven't done any work on loading alleles or genotypes. I wait until after the next Skype call since there are still many things to decide.

The changes are on a branch for now: https://github.com/pombase/canto/tree/fly-canto-dev

@kimrutherford
Copy link
Member

PMID, genes, alleles

We can support genes, alleles and aberrations now. Is there anything else from FlyBase that needs be loaded into Canto?

@gm119
Copy link
Collaborator

gm119 commented Mar 22, 2019

We can support genes, alleles and aberrations now. Is there anything else from FlyBase that needs be loaded into Canto?

well thats exciting !

I will double check with @vmt25 if we need any other type of object (I don't think we do,but will be good to double check !)

@vmt25
Copy link

vmt25 commented Mar 29, 2019

Yes, that is all we need to go forward

@kimrutherford
Copy link
Member

I'll close this issue because I think we're done. If changes or additions are needed we can open new issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants