Added a check to test the UNIQUE KEYs #317

muffato · 2020-10-07T21:29:00Z

See https://www.ebi.ac.uk/panda/jira/browse/ENSCORESW-3564 : UNIQUE KEYs that comprise a column that is NULL-able allow duplicated rows if they contain one NULL.
This is not something we want in the Compara schema, and it's caused some issues with the theobroma_cacao renaming this release, so here is a new DC to check those. It's structured the same way as ForeignKeysCompara and is quite generic. It could probably be applied onto other database types, should people need to.

coveralls · 2020-10-07T21:35:01Z

Pull Request Test Coverage Report for Build 1471

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage remained the same at 95.867%

Totals
Change from base Build 1469:	0.0%
Covered Lines:	1879
Relevant Lines:	1960

💛 - Coveralls

lib/Bio/EnsEMBL/DataCheck/Checks/UniqueKeysCompara.pm

james-monkeyshines

This looks fine, and I take you point about having methods to parse SQL. However, was there a reason why you didn't query the mysql schema to get the unique keys? I think that'd tend to be a bit more robust, e.g.

select
  table_name,
  index_name,
  group_concat(column_name order by seq_in_index) as index_columns,
  index_type
from
  information_schema.statistics
where
  table_schema = database() and
  non_unique = 0 and
  index_name <> 'PRIMARY'
group by table_name, index_name, index_type
order by index_name;

muffato · 2020-10-26T09:49:59Z

Yes, good point, I could have queried the list of unique keys from the schema itself. I think I was worried the database wouldn't have the correct schema in some situations. We had an issue last release with MySQLTransfer because it copies the data and the schema, and our source database was on an older version of the schema, so I preferred taking table.sql as the truth.

Added a check to test the UNIQUE KEYs

de45c6b

muffato requested a review from a team October 7, 2020 21:29

CristiGuijarro reviewed Oct 8, 2020

View reviewed changes

lib/Bio/EnsEMBL/DataCheck/Checks/UniqueKeysCompara.pm Show resolved Hide resolved

CristiGuijarro approved these changes Oct 8, 2020

View reviewed changes

lib/Bio/EnsEMBL/DataCheck/Checks/UniqueKeysCompara.pm Show resolved Hide resolved

james-monkeyshines self-requested a review October 21, 2020 11:13

james-monkeyshines approved these changes Oct 21, 2020

View reviewed changes

james-monkeyshines merged commit f4df3d1 into release/103 Oct 23, 2020

james-monkeyshines deleted the feature/check_unique_keys branch October 23, 2020 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a check to test the UNIQUE KEYs #317

Added a check to test the UNIQUE KEYs #317

muffato commented Oct 7, 2020

coveralls commented Oct 7, 2020

james-monkeyshines left a comment

muffato commented Oct 26, 2020

Added a check to test the UNIQUE KEYs #317

Added a check to test the UNIQUE KEYs #317

Conversation

muffato commented Oct 7, 2020

coveralls commented Oct 7, 2020

Pull Request Test Coverage Report for Build 1471

💛 - Coveralls

james-monkeyshines left a comment

Choose a reason for hiding this comment

muffato commented Oct 26, 2020