Skip to content

Compute statistics & changes for both the GO ontology and annotations

License

Notifications You must be signed in to change notification settings

geneontology/go-stats

Repository files navigation

GO-stats for the GO release pipeline

Compute statistics and changes for both the GO ontology and annotations at every release and snapshot. The libraries/go-stats/ folder contains the script that are cloned and executed in the GO pipeline from go-site/scripts

Changes to Gene Ontology Statistics

The changes to the statistics computed at each release are reported in the CHANGES.md file of this repository.

Details of libraires contained in this repository

go-stats

This is the python package that is used to compute statistics over go annotations. You can read more here.

General GO stats file access for the current release

General GO-CAM files for the current release

GO-stats for the experimental AWS pipeline

GO Notification System

The code checks the release date in the main pipeline (http://current.geneontology.org/metadata/release-date.json) and when the date changes, it triggers a secondary pipeline by publishing a message in a specific topic (SNS) and update the release date on the secondary pipeline

GO Store Changes

The code loads the GO obo file (http://purl.obolibrary.org/obo/go.obo) and compare the terms of the new release to the previous most recent release.

GO Update Statistics

The code send queries to GOLr to fetch statistics about the GO annotations (e.g. per aspect, per species, per group etc)

GO Update GO-CAMs

The code compute a number of views over the GO-CAMs data (e.g. models, gene products, go terms, etc) using the GO SPARQL endpoint.

Releases

No releases published

Packages

No packages published

Languages