Add database schema and validation for simulation logs #2048

jonrkarr · 2021-01-26T22:35:35Z

An endpoint analogous to the simulator/validate endpoint would be useful. Then we can use this to create a test case within the simulators test suite which checks that simulation tools produce valid logs.

bilalshaikh42 · 2021-01-26T23:10:44Z

Currently the database is not validating the structure of the logs at all, assuming that the simulators will be producing a compliant output. This is similar to the results endpoints.

We could have the database perform this validation, but we would then need to define the database schemas for the log components

jonrkarr · 2021-01-26T23:25:21Z

I agree with assuming that simulators produce valid results. HDF5 is quite different than JSON/YAML, and the results are heavily validated by the test suite. Results are the basis for most of the test cases.

I could also put validation for logs in biosimulators-test-suite. This could be done using the JSON schema version of the schema for the logs. But I think its easier to implement that in this repo, and treat this repo as holding the primary definition of the log format. This consolidates the definition of all JSON schemas and their validation into this repo.

bilalshaikh42 · 2021-01-28T22:02:50Z

Should this be closed/ moved to the simulators repo?

jonrkarr · 2021-02-26T15:39:52Z

I think it would be best to implement a database schema for this. The test suite can check if simulation tools produce valid logs for a few examples, but its difficult to verify that a simulation tool will always produce valid logs.

I can make the test suite use the JSONSchema description of the schema to check that simulation tools produce valid logs (for a small set of examples)

jonrkarr · 2021-02-28T05:02:34Z

I looked into using the JSONSchema version of the OpenAPI spec to validate that simulators produce valid logs within the simulators test suite. This could be done, but the JSONSchema doesn't recognize nullable. Another way to look at it is that the OpenAPI spec isn't being translated into a 100% compatible JSONSchema due to the fact that the OpenAPI spec is broader than JSONschema.

One option is for us to explicitly define null as a valid type everywhere we use nullable = true in the definition of our NestJS/Swagger API schemas. Then our OpenAPI spec could likely be translated into a functionally equalivalent JSONSchema, which we could use to validate simulators in the simulators test suite.

I think the better path is:

Define a database schema for execution logs
Use this schema to provide a validation endpoint similar to the simulator specs validation endpoint

bilalshaikh42 · 2021-02-28T13:54:36Z

I remember there were quite a few discussion about this on the open api repo a few months ago. As far as I remember, open api and Jason schema are now fully compatible as of the latest versions. I will look into this more, but I seem to remember null being added to json schema at some point. Perhaps the libraries we are using do not have the latest versions implemented.

…

On Sun, Feb 28, 2021, 12:02 AM Jonathan Karr ***@***.***> wrote: I looked into using the JSONSchema version of the OpenAPI spec to validate that simulators produce valid logs within the simulators test suite. This could be done, but the JSONSchema doesn't recognize nullable. Another way to look at it is that the OpenAPI spec isn't being translated into a 100% compatible JSONSchema due to the fact that the OpenAPI spec is broader than JSONschema. One option is for us to explicitly define null as a valid type everywhere we use nullable = true in the definition of our NestJS/Swagger API schemas. Then our OpenAPI spec could likely be translated into a functionally equalivalent JSONSchema, which we could use to validate simulators in the simulators test suite. I think the better path is: - Define a database schema for execution logs - Use this schema to provide a validation endpoint similar to the simulator specs validation endpoint — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#2048 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHX4FIFG4NGMTWFGKPMTLC3TBHE7LANCNFSM4WUHFYDQ> .

jonrkarr · 2021-02-28T15:49:04Z

I think the next versions of OpenAPI (3.1) and JSON Schema (Draft 4) are supposed to be compatible.

The current version of JSON Schema supports null type, but it doesn't recognize nullable. The library we're using to convert OpenAPI to JSON Schema (https://github.com/openapi-contrib/openapi-schema-to-json-schema) is supposed to convert nullable to oneOf(..., {"type": "null"}), but this doesn't happen. I haven't inspected carefully. It could be that the OpenAPI specification doesn't have nullable everywhere we expect, which prevents the OpenAPI spec from being automatically converted to a JSON Schema as we expect.

If we explored this, we could probably figure out how to get our dependencies to generate an OpenAPI spec with nullable=True where we expect and then hopefully this would get converted to JSON Schema as we expect.

jonrkarr added the enhancement New feature or request label Jan 26, 2021

jonrkarr assigned bilalshaikh42 Jan 26, 2021

bilalshaikh42 closed this as completed Feb 16, 2021

jonrkarr reopened this Feb 26, 2021

jonrkarr closed this as completed Feb 26, 2021

jonrkarr reopened this Feb 26, 2021

jonrkarr mentioned this issue Feb 26, 2021

Use JSON schema for logs to check that logs are formatted correctly biosimulators/Biosimulators_test_suite#30

Open

jonrkarr changed the title ~~Add an endpoint to the dispatch API to validate a simulation log~~ Add database schema for simulation logs Feb 26, 2021

jonrkarr added future Features to implement in future and removed enhancement New feature or request labels Sep 28, 2021

jonrkarr mentioned this issue Oct 16, 2021

Simulation run logs should contain proper KISAO ids virtualcell/vcell#108

Closed

jonrkarr added enhancement New feature or request and removed future Features to implement in future labels Oct 22, 2021

jonrkarr assigned jonrkarr and unassigned bilalshaikh42 Oct 22, 2021

jonrkarr changed the title ~~Add database schema for simulation logs~~ Add database schema and validation for simulation logs Oct 22, 2021

jonrkarr mentioned this issue Oct 22, 2021

feat(api): added database model and validation endpoint for simulation run logs #3361

Merged

jonrkarr closed this as completed in #3361 Oct 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add database schema and validation for simulation logs #2048

Add database schema and validation for simulation logs #2048

jonrkarr commented Jan 26, 2021

bilalshaikh42 commented Jan 26, 2021

jonrkarr commented Jan 26, 2021

bilalshaikh42 commented Jan 28, 2021

jonrkarr commented Feb 26, 2021

jonrkarr commented Feb 28, 2021

bilalshaikh42 commented Feb 28, 2021 via email

jonrkarr commented Feb 28, 2021

Add database schema and validation for simulation logs #2048

Add database schema and validation for simulation logs #2048

Comments

jonrkarr commented Jan 26, 2021

bilalshaikh42 commented Jan 26, 2021

jonrkarr commented Jan 26, 2021

bilalshaikh42 commented Jan 28, 2021

jonrkarr commented Feb 26, 2021

jonrkarr commented Feb 28, 2021

bilalshaikh42 commented Feb 28, 2021 via email

jonrkarr commented Feb 28, 2021