Merge release/102 branch to master #299

james-monkeyshines · 2020-09-03T13:03:56Z

No description provided.

…o json to display as html through datacheck service

Fix typo in method name

…ies, the SQL in one test needs to be updated to count distinct versions, not seq_region names.

…ns of a particular coord_system are components, and others are unassembled top-level regions.

remove known phenotype descriptions

Fixes for two seq_region-related datachecks

…an updated meta_key

Detect assembly/geneset change

…t is not explicitly specified. This was being supplied in one part of this module (used by most datachecks), but not in another, when the registry is reloaded by a datacheck (as, for example, in a 'Compare*' datacheck that connect to the metadata database).

Fix for 'Compare*' datachecks in databases without species_id=1

…ayXrefExists for non-vertebrates. Reduce threshold for reporting differences with previous version, from 80% to 66% - too noisy otherwise, which makes it hard to spot significant problems.

Co-authored-by: Matthieu Muffato <muffato@ebi.ac.uk>

Create new datacheck groups for xref-related pipelines.

species.db_name must be all lower-case

…eed to force the core database to be loaded into the registry (it's lazy-loaded, but for reasons that aren't clear, it isn't triggered as expected by the calls to the gene adaptor). This is not an issue when running within a pipeline, because the registry will automatically have been loaded in that case.

Force load of core db in GeneBiotypes, when run for core-like dbs

…es-sets

…database must be in the tested one

Missing homology MLSS check from PR #283

Missing MLSS checks from PR #292

length() is only useful to correct the length of circular slices when start is greater than end. Here start is guaranteed to be lower or equal than end. length() involves additional queries to get the seq_region_id and check whether the slice is circular.

… low memory footprint - Dnafrags too can be batched together. - This works for polyploid components too.

…connections

Cross-database compara datachecks + conflict resolution

Fix/compara dcs for e102

coveralls · 2020-09-03T13:17:07Z

Pull Request Test Coverage Report for Build 1400

26 of 27 (96.3%) changed or added relevant lines in 3 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage decreased (-0.02%) to 95.959%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
lib/Bio/EnsEMBL/DataCheck/DbCheck.pm	9	10	90.0%

Totals
Change from base Build 1382:	-0.02%
Covered Lines:	1852
Relevant Lines:	1930

💛 - Coveralls

vinay-ebi and others added 30 commits June 10, 2020 14:52

Added new module DataCheckTapToJson to convert datacheck tap output t…

494caee

…o json to display as html through datacheck service

parse_result.pl code move to DatacheckTapToJson module

7915339

Updating test dbs to latest schema

dd6c1a7

Fix typo in method name

87e66de

Merge pull request #263 from Ensembl/bugfix/method_name_fix

c588924

Fix typo in method name

Remove skeleton datachecks for unimplemented functionality.

11b364b

remove known descriptions

a44b219

rephrase sql check

669e44b

To account for assembly mappings that involve old contig-only assembl…

053d6dd

…ies, the SQL in one test needs to be updated to count distinct versions, not seq_region names.

New compara dc to check species_trees

e5a6a52

Handle partial population of the assembly table, where some seq_regio…

c581851

…ns of a particular coord_system are components, and others are unassembled top-level regions.

Merge pull request #265 from ima23/remove_pheno

bdbe1a8

remove known phenotype descriptions

Restore left outer join on assembly, needed for lrg/patch regions.

7a4a8d6

Merge pull request #268 from Ensembl/bugfix/seqlevel_toplevel

da6d46b

Fixes for two seq_region-related datachecks

Compatible to embed in compara pipelines & move species_tree fk

f978713

is_ehive_db test added

97dab41

New datacheck to detect assembly changes that are not accompanied by …

a26f61a

…an updated meta_key

Exclude LRG regions; fix typo

c74d9b6

Merge pull request #270 from Ensembl/feature/assembly_geneset_change

3698596

Detect assembly/geneset change

Merge pull request #271 from Ensembl/bugfix/species_id

8326bae

Fix for 'Compare*' datachecks in databases without species_id=1

Create new datacheck groups for xref-related pipelines. Disable Displ…

ea3cc84

…ayXrefExists for non-vertebrates. Reduce threshold for reporting differences with previous version, from 80% to 66% - too noisy otherwise, which makes it hard to spot significant problems.

species.db_name must be all lower-case

3ff2b31

cmp_rows to is_rows

7b21af0

Co-authored-by: Matthieu Muffato <muffato@ebi.ac.uk>

Remove incorrect fk check

f01d7ef

Co-authored-by: Matthieu Muffato <muffato@ebi.ac.uk>

Merge pull request #272 from Ensembl/feature/pipeline_xref_datacheck

291776e

Create new datacheck groups for xref-related pipelines.

Merge pull request #273 from Ensembl/bugfix/db_name_format

f0f081f

species.db_name must be all lower-case

Rename is_ehive_db to is_compara_ehive_db

ae72a71

Merge pull request #275 from Ensembl/bugfix/gene_biotype_of

f9563a8

Force load of core db in GeneBiotypes, when run for core-like dbs

muffato and others added 25 commits August 20, 2020 14:51

It's the species_set_header that holds the auto-increment id of speci…

1b1cdfa

…es-sets

The species-set table has to be identical, not a subset

5510316

The _tag tables work the other way: everything that is in the master …

da2e3f9

…database must be in the tested one

Be explicit about the DC being critical

02391fd

List the one table it needs, so that it can be appropriately skipped

7bc10ab

This is a critical check

e462bf6

Added more groups

c90a910

Missing homology MLSS check from PR #283

7e045d6

Merge pull request #292 from CristiGuijarro/fix/compara_homology_mlss

d23352f

Missing homology MLSS check from PR #283

Missing MLSS checks from PR #292

2080d93

Merge pull request #293 from CristiGuijarro/fix/compara_homology_mlss

efeea80

Missing MLSS checks from PR #292

Use conventional compara_master name

3ae697d

Merge compara_datachecks branch to resolve conflicts

a90f277

Added more groups

05c4b3a

Be explicit about the DC being critical

e372c7f

bugfix: POLYPLOID should be included here, like in skip_tests above

39a8a93

Use the new Compara method to fetch the slices efficiently and with a…

0890c91

… low memory footprint - Dnafrags too can be batched together. - This works for polyploid components too.

Disconnect from this species' database to avoid reaching hundreds of …

03e830a

…connections

Better documentation

878f7bb

Merge pull request #294 from Ensembl/feature/fix_conflicts

f532e30

Cross-database compara datachecks + conflict resolution

CheckHomologyMLSS to include ENSEMBL_PROJECTIONS

6d0dbb8

Temporary comment out of families check since not in >e102 compara

914b82a

Merge pull request #297 from CristiGuijarro/fix/compara_dcs_e102

07e3098

Fix/compara dcs for e102

Resolve merge conflicts

51fd1bc

james-monkeyshines requested a review from marcoooo September 3, 2020 13:04

marcoooo approved these changes Sep 3, 2020

View reviewed changes

james-monkeyshines merged commit 4ec7541 into master Sep 3, 2020

james-monkeyshines deleted the feature/merge_102_to_master branch September 3, 2020 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge release/102 branch to master #299

Merge release/102 branch to master #299

james-monkeyshines commented Sep 3, 2020

coveralls commented Sep 3, 2020

Merge release/102 branch to master #299

Merge release/102 branch to master #299

Conversation

james-monkeyshines commented Sep 3, 2020

coveralls commented Sep 3, 2020

Pull Request Test Coverage Report for Build 1400

💛 - Coveralls