Performance analysis for tests #4988

webzwo0i · 2021-04-02T10:10:11Z

With 4ad80d4 (from PR #4987) a lot of short timeouts got removed from frontend tests. This helps with test stability. However, in the past, at least that's my impression, people thought of those timeouts to be some kind of guard against performance degregations, ie if some editor causes slowness, it was hoped that short timeouts gonna catch it.

I don't think this assumption was entirely correct. There is few code that is guaranteed to be processed immediately and if we use timeouts and tests fail we cannot be sure if it's due to us having introduced some slowness or due to some browser/selenium change, mocha etc. Besides, timeouts are set for a whole test or suite, and a test is using multiple lines of code with DOM, events, network i/o etc. so we cannot be certain what code is running too slow.

Often this lead to commits that increase timeouts, as we saw the code is fast in some browsers, while slightly slower in others, which is again contrary to the assumption that we get info about performance regressions when tests fail due to timeouts.

I think we should rather start analyzing the build logs on a per browser basis and visualize the duration of every single test and/or use some simple average. It would be great if those analysis is done as another job, so that instead of failing a test due to timeouts, we would fail the performance analysis job. However, if for a start it would be enough to somehow visualize the deriviation from the average/expectation

Update:
imo it's useful both for backend and frontend tests

The text was updated successfully, but these errors were encountered:

rhansen · 2021-04-02T19:32:38Z

I completely agree. Fixed timeouts don't adjust for hardware performance differences, so even if there isn't noise in the system, they'll be too long on fast machines and too short on slow machines.

I suspect that there will be too much variance for test suite timing to be useful. Also, we have a lot of sleeps in our tests that hide the true cost. Ideally we would have a benchmark suite that we could use for before/after comparisons on each pull request, but that would take a lot of work to create and maintain. If Etherpad is too slow, it's likely easier and more useful to add sharding, split into microservices, or just throw more hardware at the problem.

stale · 2021-06-02T01:08:36Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

See #4988 for rationale.

webzwo0i added the Feature Request label Apr 2, 2021

webzwo0i mentioned this issue Apr 5, 2021

remove custom timeouts #4991

Merged

stale bot added the wontfix Wont Fix these things, no hate. label Jun 2, 2021

stale bot closed this as completed Jun 9, 2021

rhansen added a commit that referenced this issue Aug 30, 2021

tests: Delete overly aggressive timeouts

348bc0c

See #4988 for rationale.

rhansen mentioned this issue Aug 30, 2021

tests: Delete overly aggressive timeouts #5164

Merged

This was referenced Oct 4, 2021

Backend test improvements #5217

Merged

tests: Remove overly agressive timeouts #5220

Merged

tests: Remove overly agressive timeouts #5225

Merged

rhansen mentioned this issue Nov 10, 2021

easysync tests: remove randomness & analyze performance metrics #5267

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance analysis for tests #4988

Performance analysis for tests #4988

webzwo0i commented Apr 2, 2021 •

edited by rhansen

Loading

rhansen commented Apr 2, 2021

stale bot commented Jun 2, 2021

Performance analysis for tests #4988

Performance analysis for tests #4988

Comments

webzwo0i commented Apr 2, 2021 • edited by rhansen Loading

rhansen commented Apr 2, 2021

stale bot commented Jun 2, 2021

webzwo0i commented Apr 2, 2021 •

edited by rhansen

Loading