Update filter examples and validate optional cases #227

ml-evs · 2020-03-13T14:25:00Z

This PR completes the automation of scraping the spec for examples, since they are now all parseable as of Materials-Consortia/OPTIMADE#263. It adds the optional filters as tests in the validator that are printed with less emphasis, and do not cause the validator to return a non-zero exit code.

codecov · 2020-03-13T14:33:13Z

Codecov Report

Merging #227 into master will decrease coverage by 0.23%.
The diff coverage is 69.81%.

@@            Coverage Diff             @@
##           master     #227      +/-   ##
==========================================
- Coverage   87.57%   87.33%   -0.24%     
==========================================
  Files          43       43              
  Lines        1916     1943      +27     
==========================================
+ Hits         1678     1697      +19     
- Misses        238      246       +8

Flag	Coverage Δ
#unittests	`87.33% <69.81%> (-0.24%)`	⬇️

Impacted Files	Coverage Δ
optimade/validator/validator.py	`67.81% <60.97%> (+0.43%)`	⬆️
optimade/validator/data/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 16917b2...6715bfc. Read the comment docs.

CasperWA

Good work, thanks @ml-evs !

I have some minor suggested changes.
It is generally a very practical code, which is fine by me. But the amount of if-else statements is increasing rapidly, which tells me there may be a better design for the whole thing. However, this is definitely for a later point, when we're all bored and want to do something "fun" 😅

optimade/validator/data/__init__.py

optimade/validator/validator.py

tasks.py

tests/server/test_server_validation.py

Co-authored-by: Casper Welzel Andersen <CasperWA@users.noreply.github.com>

ml-evs · 2020-03-17T13:10:45Z

Good work, thanks @ml-evs !

I have some minor suggested changes.
It is generally a very practical code, which is fine by me. But the amount of if-else statements is increasing rapidly, which tells me there may be a better design for the whole thing. However, this is definitely for a later point, when we're all bored and want to do something "fun" sweat_smile

Yes, the important thing now to consider is validating the results properly (beyond checking that the response is in the right format)... this has been useful to catch some problems with the filterparser and transformer, but it's pretty meaningless if all we're doing his querying non-existent fields!

I'm not entirely sure how we should go about doing this. We could potentially just find one structure in the API and use that to construct a load of queries that should return at least that structure, and kind of trawl and validate that way. Hopefully this can be guided more by other implementations!

CasperWA · 2020-03-17T14:17:41Z

Yes, the important thing now to consider is validating the results properly (beyond checking that the response is in the right format)... this has been useful to catch some problems with the filterparser and transformer, but it's pretty meaningless if all we're doing his querying non-existent fields!

True. We need more real-world implementations as testbeds.
I think we can and should use providers.optimade.org as a testbed for the index meta-database, and then I am drawing a blank for the regular server. The best approach is probably still our own setup? Either running a server locally or querying heroku.

I'm not entirely sure how we should go about doing this. We could potentially just find one structure in the API and use that to construct a load of queries that should return at least that structure, and kind of trawl and validate that way. Hopefully this can be guided more by other implementations!

I don't think scraping the spec further will end in a fruitful result. Rather, we should either re-use some queries/model structures from the other OPTIMADE repository tests and/or use real-world implementations and/or have the consortium come up with a list of queries/structures that can be used. The latter would also solidify this validator as the "official" OPTIMADE implementation validator to be used.

I don't know if I misunderstood your comment in answering here, but I think my comments are still valid in their own right 😅

ml-evs · 2020-03-17T14:31:50Z

I think we can and should use providers.optimade.org as a testbed for the index meta-database, and then I am drawing a blank for the regular server. The best approach is probably still our own setup? Either running a server locally or querying heroku.

100% agree

I don't think scraping the spec further will end in a fruitful result. Rather, we should either re-use some queries/model structures from the other OPTIMADE repository tests and/or use real-world implementations and/or have the consortium come up with a list of queries/structures that can be used. The latter would also solidify this validator as the "official" OPTIMADE implementation validator to be used.

I don't know if I misunderstood your comment in answering here, but I think my comments are still valid in their own right sweat_smile

It's not the point I was making, but I do agree! In order to avoid getting people to put random data into their databases so that it can be queried, my suggestion was more to find any structure in the database (i.e. /structures?page_limit=1), then perform queries that should definitely return that structure, i.e. query for its ID, query for formula, query on its relationships etc. in order to check that the underlying mechanisms are working.

The spec scraping stuff is only really useful to check what the database says it has implemented, and what causes backend errors.

ml-evs · 2020-03-17T14:32:19Z

We should leave this to an issue discussing the future of the validator though :)

CasperWA · 2020-03-17T14:39:19Z

(...) In order to avoid getting people to put random data into their databases so that it can be queried, my suggestion was more to find any structure in the database (i.e. /structures?page_limit=1), then perform queries that should definitely return that structure, i.e. query for its ID, query for formula, query on its relationships etc. in order to check that the underlying mechanisms are working.

Ah. It would be a good check, I guess, of the filter working. Especially if using NOT to check a structure is not returned. E.g., if an implementation just disregards the filter and returns all regardless of input.
However, it also seems incomplete and not completely rigorous in its testing method. I think we'll have to come up with a robust algorithm/logic for testing this.

I would even go so far as to say that it's the implementation's own responsibility to ensure their system works properly... However, I definitely see the benefit of having a central "official" tester to make sure it works generally as expected.

CasperWA

Very good.
All of the txt files are already added to the MANIFEST in some way, right?

ml-evs · 2020-03-19T01:50:28Z

All of the txt files are already added to the MANIFEST in some way, right?

Yep!

ml-evs requested a review from CasperWA March 13, 2020 14:25

ml-evs force-pushed the ml-evs/optional_filter_tasks branch from 304241e to 7598a8b Compare March 13, 2020 19:44

CasperWA requested changes Mar 17, 2020

View reviewed changes

Update invoke task to scrape optional filters

ba9b46f

ml-evs force-pushed the ml-evs/optional_filter_tasks branch from 7598a8b to ad5c62d Compare March 17, 2020 12:57

ml-evs and others added 7 commits March 17, 2020 13:05

Load and expose optional filters

b592f54

Added optional filter validation to validator

9597729

Ignore optional validation failures in test suite

7cd7742

Resolve paths when loading filters

d4abe5e

Co-authored-by: Casper Welzel Andersen <CasperWA@users.noreply.github.com>

Removed stray print statement

b77c05b

Co-authored-by: Casper Welzel Andersen <CasperWA@users.noreply.github.com>

Combine mandatory/optional query tests into one fn

8a7b926

Co-authored-by: Casper Welzel Andersen <CasperWA@users.noreply.github.com>

Created tuple of triggers for optionality in spec

6715bfc

Co-authored-by: Casper Welzel Andersen <CasperWA@users.noreply.github.com>

ml-evs force-pushed the ml-evs/optional_filter_tasks branch from bb41c65 to 6715bfc Compare March 17, 2020 13:06

ml-evs requested a review from CasperWA March 17, 2020 13:10

CasperWA approved these changes Mar 18, 2020

View reviewed changes

ml-evs merged commit 098aa32 into master Mar 19, 2020

ml-evs deleted the ml-evs/optional_filter_tasks branch March 19, 2020 01:52

ml-evs added this to In progress in Road to optimade-python-tools 1.0 via automation Mar 31, 2020

ml-evs moved this from In progress to Done in Road to optimade-python-tools 1.0 Mar 31, 2020

CasperWA mentioned this pull request Apr 22, 2020

Up to v0.8.0 #251

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update filter examples and validate optional cases #227

Update filter examples and validate optional cases #227

ml-evs commented Mar 13, 2020

codecov bot commented Mar 13, 2020 •

edited

CasperWA left a comment

ml-evs commented Mar 17, 2020

CasperWA commented Mar 17, 2020

ml-evs commented Mar 17, 2020

ml-evs commented Mar 17, 2020

CasperWA commented Mar 17, 2020

CasperWA left a comment

ml-evs commented Mar 19, 2020

Update filter examples and validate optional cases #227

Update filter examples and validate optional cases #227

Conversation

ml-evs commented Mar 13, 2020

codecov bot commented Mar 13, 2020 • edited

Codecov Report

CasperWA left a comment

Choose a reason for hiding this comment

ml-evs commented Mar 17, 2020

CasperWA commented Mar 17, 2020

ml-evs commented Mar 17, 2020

ml-evs commented Mar 17, 2020

CasperWA commented Mar 17, 2020

CasperWA left a comment

Choose a reason for hiding this comment

ml-evs commented Mar 19, 2020

codecov bot commented Mar 13, 2020 •

edited