Do the benchmark on a more realistic schema #11

ebdrup · 2015-02-01T16:38:40Z

The current benchmark is run on a subset of the tests in the official JSON-schema test suite.
Although the results seem very much in line with what other stable benchmarks produce (like themis), and exercising as much of the standard as possible is good to catch any slow corners in a validator. It might be a good idea to do a benchmark on data that is more like what you see in production.

Perhaps an performance benchmark on one simple and one advanced schema.

To keep things simple for anyone wanting to chose a validator, the benchmark should still produce one single number for each validator like it does now. Nitty gritty details about performance are for schema validator authors to do in their efforts.

ebdrup · 2015-02-01T20:05:02Z

I was thinking of machine-generating a bigger schema from the schemas in the official tests, to make sure that the test touches as many parts of the spec as possible. Just to make sure to hit any troublesome parts of a validator.

gilesbowkett · 2015-02-02T06:36:36Z

ok, so the bad news is I don't have as much time as I'd like, but the JSCK benchmarks use five schemas, and you're more than welcome to use them.

here's a categorization of those 5 schemas:

draft 3, trivial
draft 3, medium
draft 4, trivial
draft 4, medium
draft 4, complex

so you can use them to benchmark a range of complexities and draft support.

JamesNK · 2015-08-28T13:41:24Z

Realistic benchmark idea: Validating the draft 4 schema with the draft 4 schema.

epoberezkin · 2020-08-01T11:03:27Z

@ebdrup The more I think about it, the more I believe that using test-suite for performance benchmarking is completely wrong.

With the exception of some really old and slow libraries, where slow validation may affect UX in the browser, validation performance only really matters for server-side validation. This is where the performance advantage achieved by compiling validators is becoming important... Server-side validation passes 99+% of the time (excluding the cases of DOS attacks and or broken API consumer - but performance is not going to help too much in these cases).

So I think to be relevant, benchmark should only test passing cases on some data samples of small/average/large size.

Tests can still be used to benchmark compliance, separately from performance.

What do you think?

This was referenced Feb 1, 2015

Make it clear what parts of the spec a JSON-schema validator deliberately chose not to implement #4

Closed

Add link to open source benchmark for JavaScript implementations json-schema/json-schema#153

Closed

ebdrup added the enhancement label Feb 14, 2015

ebdrup closed this as completed Jul 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do the benchmark on a more realistic schema #11

Do the benchmark on a more realistic schema #11

ebdrup commented Feb 1, 2015

ebdrup commented Feb 1, 2015

gilesbowkett commented Feb 2, 2015

JamesNK commented Aug 28, 2015

epoberezkin commented Aug 1, 2020

Do the benchmark on a more realistic schema #11

Do the benchmark on a more realistic schema #11

Comments

ebdrup commented Feb 1, 2015

ebdrup commented Feb 1, 2015

gilesbowkett commented Feb 2, 2015

JamesNK commented Aug 28, 2015

epoberezkin commented Aug 1, 2020