Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File dump-related datacheck updates. #388

Merged
merged 4 commits into from
Jul 7, 2021

Conversation

james-monkeyshines
Copy link
Contributor

Added a check for Rapid Release display name.

Split the existing check on INSDC names into two, because only part of it is needed for exporting GFF3 formatted for ENA submission, and we want to be able to run just that bit. The INDSC name check requires all top-level sequences to have an INSDC name/synonym (rather than a set of named coord_systems), which matches the ENA submission use case.

Both of these datachecks are intended to be run within file dump pipelines, to check that the core databases contain all the information required for dumping (Ensembl/ensembl-production#576).

James Allen added 3 commits June 29, 2021 12:35
In order to embded the datacheck in a pipeline for exporting
GFF3 formatted for ENA submission, split the existing check
on INSDC names into two, because part of it is not relevant.
Tweak the INDSC name check to be a bit more general, and to
match the ENA submission use case, which requires all top-level
sequences to have an INSDC name/synonym.
Added a check for Rapid Release display name.
Gave the ENA submission group a more generic name.

Both of these datachecks are intended to be run within
file dump pipelines, to check that the core databases
contain all the information required for dumping.
@coveralls
Copy link

coveralls commented Jul 1, 2021

Pull Request Test Coverage Report for Build 1799

  • 0 of 0 changed or added relevant lines in 0 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage remained the same at 98.647%

Totals Coverage Status
Change from base Build 1786: 0.0%
Covered Lines: 2261
Relevant Lines: 2292

💛 - Coveralls

@james-monkeyshines james-monkeyshines marked this pull request as ready for review July 2, 2021 08:27
@james-monkeyshines james-monkeyshines merged commit 5958f52 into release/105 Jul 7, 2021
@james-monkeyshines james-monkeyshines deleted the feature/ena_format_gff3 branch July 9, 2021 12:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants