Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add datacheck CactusMetadataConsistency #560

Merged
merged 2 commits into from
Sep 13, 2023

Conversation

twalsh-ebi
Copy link
Contributor

Description of the problem

Issues have arisen in recent releases with Cactus genomic alignment metadata consistency: in one case a retired GenomeDB was present in the HAL mapping which links a Compara database to its configured Cactus alignments, and in another case a species in the HAL mapping was not present in the corresponding species tree. Tickets have been created proposing to develop a datatcheck for each of these issues (ENSCOMPARASW-5088 and ENSCOMPARASW-6834, respectively).

The datacheck which would be added by this PR tests both cases.

Scope of the pull request

This PR adds a Compara datacheck (CactusMetadataConsistency) which tests that for each Cactus MLSS in a Compara database:

  • a HAL mapping is present in the database;
  • a corresponding species tree is present in the database;
  • each genome_db_id in the HAL mapping is present in the database, is current, and can be found in the species tree.

WARNING: like #557, this PR moves the CompareVariationRows datacheck index entry so that it is placed alphabetically.

Testing

The datacheck was tested successfully on 3 example Metazoa Compara databases. For further information on testing, please see ENSCOMPARASW-6834.

@thiagogenez thiagogenez self-requested a review September 1, 2023 09:36
@vinay-ebi vinay-ebi self-requested a review September 12, 2023 10:11
@vinay-ebi vinay-ebi merged commit db44364 into Ensembl:main Sep 13, 2023
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants