Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge release/102 branch to master #299

Merged
merged 101 commits into from
Sep 3, 2020

Conversation

james-monkeyshines
Copy link
Contributor

No description provided.

vinay-ebi and others added 30 commits June 10, 2020 14:52
…o json to display as html through datacheck service
…ies, the SQL in one test needs to be updated to count distinct versions, not seq_region names.
…ns of a particular coord_system are components, and others are unassembled top-level regions.
remove known phenotype descriptions
Fixes for two seq_region-related datachecks
…t is not explicitly specified. This was being supplied in one part of this module (used by most datachecks), but not in another, when the registry is reloaded by a datacheck (as, for example, in a 'Compare*' datacheck that connect to the metadata database).
Fix for 'Compare*' datachecks in databases without species_id=1
…ayXrefExists for non-vertebrates. Reduce threshold for reporting differences with previous version, from 80% to 66% - too noisy otherwise, which makes it hard to spot significant problems.
Co-authored-by: Matthieu Muffato <muffato@ebi.ac.uk>
Co-authored-by: Matthieu Muffato <muffato@ebi.ac.uk>
Create new datacheck groups for xref-related pipelines.
species.db_name must be all lower-case
…eed to force the core database to be loaded into the registry (it's lazy-loaded, but for reasons that aren't clear, it isn't triggered as expected by the calls to the gene adaptor). This is not an issue when running within a pipeline, because the registry will automatically have been loaded in that case.
Force load of core db in GeneBiotypes, when run for core-like dbs
muffato and others added 25 commits August 20, 2020 14:51
length() is only useful to correct the length of circular slices when start
is greater than end. Here start is guaranteed to be lower or equal than end.

length() involves additional queries to get the seq_region_id and check
whether the slice is circular.
… low memory footprint

- Dnafrags too can be batched together.
- This works for polyploid components too.
Cross-database compara datachecks + conflict resolution
@james-monkeyshines james-monkeyshines merged commit 4ec7541 into master Sep 3, 2020
@james-monkeyshines james-monkeyshines deleted the feature/merge_102_to_master branch September 3, 2020 13:13
@coveralls
Copy link

Pull Request Test Coverage Report for Build 1400

  • 26 of 27 (96.3%) changed or added relevant lines in 3 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage decreased (-0.02%) to 95.959%

Changes Missing Coverage Covered Lines Changed/Added Lines %
lib/Bio/EnsEMBL/DataCheck/DbCheck.pm 9 10 90.0%
Totals Coverage Status
Change from base Build 1382: -0.02%
Covered Lines: 1852
Relevant Lines: 1930

💛 - Coveralls

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

9 participants