Performance regression testing using K6. Perfkit tool #9553

orazvaliev · 2025-06-20T11:12:06Z

Motivation and context

This PR introduces performance regression testing infrastructure using K6 and a custom Python CLI tool called "perfkit" for managing and comparing K6 performance baselines. The implementation includes Docker-based test execution, metrics monitoring, and baseline comparison capabilities.

Key changes:

Adds perfkit Python package with CLI commands for running golden baseline tests and regression comparisons
Implements K6 test scripts for performance testing with warmup and task regression scenarios
Provides Docker Compose configuration for K6, Prometheus, and Grafana integration

How has this been tested?

Checklist

I submit my changes into the develop branch
I have created a changelog fragment
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues (see GitHub docs)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.

…est folders

…ormance-tests

codecov-commenter · 2025-06-20T12:10:35Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.32%. Comparing base (0c3b8b6) to head (aeba08d).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9553      +/-   ##
===========================================
+ Coverage    73.57%   82.32%   +8.75%     
===========================================
  Files          409      464      +55     
  Lines        44882    47851    +2969     
  Branches      4056     4056              
===========================================
+ Hits         33022    39394    +6372     
+ Misses       11860     8457    -3403

Components	Coverage Δ
cvat-ui	`77.29% <ø> (+<0.01%)`	⬆️
cvat-server	`86.20% <ø> (+15.85%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…ormance-tests

tests/perf/scripts/libs/api/tasks.js

tests/perf/scripts/tests/regression/tasks.js

tests/perf/scripts/tests/warmup.js

…ormance-tests

archibald1418 · 2025-08-06T16:30:25Z

@orazvaliev you are invited to fix the remaining linting errors locally

Copilot

Pull Request Overview

This PR introduces performance regression testing infrastructure using K6 and a custom Python CLI tool called "perfkit" for managing and comparing K6 performance baselines. The implementation includes Docker-based test execution, metrics monitoring, and baseline comparison capabilities.

Key changes:

Adds perfkit Python package with CLI commands for running golden baseline tests and regression comparisons
Implements K6 test scripts for performance testing with warmup and task regression scenarios
Provides Docker Compose configuration for K6, Prometheus, and Grafana integration

Reviewed Changes

Copilot reviewed 23 out of 26 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
tests/perf/setup.py	Package setup for perfkit CLI tool
tests/perf/perfkit/*.py	Core perfkit modules for test execution, metrics handling, and baseline management
tests/perf/scripts/	K6 test scripts and API libraries for performance testing
tests/perf/docker-compose*.yml	Docker configuration for performance testing stack
tests/perf/README.md	Documentation for machine preparation and tool usage
tests/package.json	Added K6 types dependency and linting configuration

tests/perf/scripts/variables/variables.js

tests/perf/scripts/tests/regression/tasks.js

tests/perf/scripts/tests/warmup.js

tests/perf/scripts/libs/default-specs.js

tests/perf/perfkit/main.py

Copilot · 2025-08-06T21:00:58Z

tests/perf/perfkit/comparison_report.py

+            if metric_name in ALLOWED_DELTAS and stat_name in ALLOWED_DELTAS[metric_name]:
+                allowed_delta = ALLOWED_DELTAS[metric_name][stat_name]
+            else:
+                # add_report_row(metric_stat, baseline_value, stat_value)


Commented-out code should be removed to improve code cleanliness.

Suggested change

# add_report_row(metric_stat, baseline_value, stat_value)

tests/perf/perfkit/cluster.py

tests/perf/perfkit/k6_profile.py

tests/perf/README.md

…ormance-tests

archibald1418 · 2025-08-08T13:08:49Z

@orazvaliev Food for thought: wouldn't it be nicer to just let the exceptions propagate to the entrypoint instead of exit()-ing ? What is your take on this?

orazvaliev · 2025-08-11T11:34:17Z

@orazvaliev Food for thought: wouldn't it be nicer to just let the exceptions propagate to the entrypoint instead of exit()-ing ? What is your take on this?

exit() of Typer basically is a raise Exception but with fancy formatting.

…ormance-tests

archibald1418 · 2025-08-14T14:03:41Z

tests/perf/scripts/libs/api/tasks.js

@@ -0,0 +1,41 @@
+// Copyright (C) CVAT.ai Corporation
+//
+// SPDX-License-Identifier: MIT


Suggested change

// SPDX-License-Identifier: MIT

// SPDX-License-Identifier: MIT

…ormance-tests

sonarqubecloud · 2025-08-15T08:55:36Z

Quality Gate failed

Failed conditions
1 Security Hotspot
5.4% Duplication on New Code (required ≤ 3%)
C Reliability Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

Ruslan Orazvaliev added 17 commits June 10, 2025 13:23

Add step for allure report generation that copies last report to /lat…

3711aab

…est folders

enable allure reporting on all branches for debugging

77e331a

empty

496d82f

empty

021b9d1

Revert debug changes

5ba2762

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

1cdab78

…ormance-tests

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

0b3297b

…ormance-tests

Add workflow file for performance testing

19a646c

Add workflow file for performance testing

2be284f

Add workflow file for performance testing

258e201

Fix path to perf tests folder

e6cda45

Fix path to perf tests folder

bf6c8b7

Add scripts and configs

a8099e2

Add k6 test parameter

732b992

Add logging

47719e3

Add sleep after cluster is up

1f6445b

Update load profile

6ff5b89

orazvaliev requested review from SpecLad, azhavoro and bsekachev as code owners June 20, 2025 11:12

Ruslan Orazvaliev and others added 3 commits June 25, 2025 14:13

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

f59b82f

…ormance-tests

Final fixes. Add setup.py, fixed paths and imports

6394d13

Add cpuset to docker compose file. Other changes

373559a

github-advanced-security bot found potential problems Jul 9, 2025

View reviewed changes

tests/perf/scripts/libs/api/tasks.js Fixed Show fixed Hide fixed

tests/perf/scripts/libs/api/tasks.js Fixed Show fixed Hide fixed

tests/perf/scripts/tests/regression/tasks.js Fixed Show fixed Hide fixed

tests/perf/scripts/tests/warmup.js Fixed Show fixed Hide fixed

orazvaliev added 5 commits July 14, 2025 16:06

Minor fixes

4fe85d4

Remove perf workflow

775185f

Replace grafana volume with named volume

13bf2c6

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

0f02b1a

…ormance-tests

Remove cpuset override from include

de33884

Oleg Valiulin added 2 commits August 6, 2025 16:21

add more eslint rules

18df4fa

fix remaining linting issues

d8db771

nmanovic requested a review from Copilot August 6, 2025 20:59

Copilot AI reviewed Aug 6, 2025

View reviewed changes

paveltovchigrechko approved these changes Aug 7, 2025

View reviewed changes

orazvaliev added 2 commits August 8, 2025 13:05

Fixes after review

0393dac

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

ae0e31f

…ormance-tests

orazvaliev added 2 commits August 11, 2025 12:06

Fix import

23081c3

Remove unsupported type annotations

87f8d85

orazvaliev added 5 commits August 13, 2025 11:25

Fix aliases option. Fix metrics averaging

1c18797

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

450200a

…ormance-tests

Minor format fixe

923de37

Fix bugs

40853a3

Increase thresholds for metrics

967a03f

archibald1418 reviewed Aug 14, 2025

View reviewed changes

orazvaliev and others added 6 commits August 14, 2025 16:56

Merge remote-tracking branch 'origin/develop' into ro/regression-perf…

b767ca4

…ormance-tests

Put baselines into repository

69004f0

Renamed perf baselines

0392010

update cpuset to fit self hosted runner configuration

45075a3

Update baselines

73e0dff

update yarn

aeba08d

archibald1418 self-requested a review August 15, 2025 09:45

archibald1418 approved these changes Aug 15, 2025

View reviewed changes

orazvaliev merged commit aa994c2 into develop Aug 15, 2025
37 of 38 checks passed

orazvaliev deleted the ro/regression-performance-tests branch August 15, 2025 09:49

archibald1418 mentioned this pull request Aug 19, 2025

Update test scenarios for tasks endpoint #9734

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance regression testing using K6. Perfkit tool #9553

Performance regression testing using K6. Perfkit tool #9553

Uh oh!

orazvaliev commented Jun 20, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Jun 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archibald1418 commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Aug 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archibald1418 commented Aug 8, 2025 •

edited

Loading

Uh oh!

orazvaliev commented Aug 11, 2025

Uh oh!

archibald1418 Aug 14, 2025

Uh oh!

sonarqubecloud bot commented Aug 15, 2025

Uh oh!

Uh oh!

Uh oh!

	// SPDX-License-Identifier: MIT
	// SPDX-License-Identifier: MIT

Performance regression testing using K6. Perfkit tool #9553

Performance regression testing using K6. Perfkit tool #9553

Uh oh!

Conversation

orazvaliev commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and context

How has this been tested?

Checklist

License

Uh oh!

codecov-commenter commented Jun 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archibald1418 commented Aug 6, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

archibald1418 commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orazvaliev commented Aug 11, 2025

Uh oh!

archibald1418 Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud bot commented Aug 15, 2025

Quality Gate failed

Uh oh!

Uh oh!

Uh oh!

orazvaliev commented Jun 20, 2025 •

edited

Loading

codecov-commenter commented Jun 20, 2025 •

edited

Loading

archibald1418 commented Aug 8, 2025 •

edited

Loading