Lightweight backups for cloud storage based tasks #9535

Eldies · 2025-06-16T12:01:16Z

Motivation and context

How has this been tested?

Checklist

I submit my changes into the develop branch
I have created a changelog fragment
I have updated the documentation accordingly
I have added tests to cover my changes
I have linked related issues (see GitHub docs)

License

I submit my code changes under the same MIT License that covers the project.
Feel free to contact the maintainers if that's a concern.

…ange cloud storage

Marishka17

I do not think that we need to mark a data storage as "missing" when a user does not select a cloud storage from the list but clicks next to the field. It looks better to me to show the original cloud storage.
We should check on the server that the newly attached storage has "available" status (bucket still exists and credentials are valid)
If I open a task created from a lightweight backup file, then there will be an exception notification:
I think now we need to change the exception raised on the server when preparing a data chunk with unlinked CS. Now there will be a 500 status code when fetching a chunk or a task preview - let's return more appropriate codes (probably a 409 status code or at least a better message)

cvat/apps/engine/background.py

Marishka17 · 2025-06-17T10:16:16Z

cvat/apps/engine/background.py

@@ -214,6 +229,7 @@ def _init_callback_with_params(self):
            Exporter,
            logger,
            self.job_result_ttl,
+            self.export_args.make_lightweight_backup,


This change affects already created rq jobs. We need to remove them or add a redis migration. Additionally, I do not really like this approach because this option can only be used when task data is linked with a cloud storage; for other storage types, this option is useless. Probably we can use the rq job meta or introduce another exporter class that inherits TaskExporter.
@SpecLad, what do you think?

It seems to me that this parameter has a similar role to the format/save_images parameters of dataset exports, so it should be handled in a similar way.

It would only make sense for me if:

this option is handled for all storage types

there is an option in the UI to attach some data to the created empty tasks

this option is handled for all storage types

Well... couldn't it be? I don't think there's anything actually preventing us from creating lightweight backups regardless of the current data storage type. We don't have to actually implement this right now, but we might as well structure the code so that it could be implemented in the future.

there is an option in the UI to attach some data to the created empty tasks

Sorry, I don't understand. This part of the code creates the backup, whereas this sounds like something you'd do when restoring the backup. How is it relevant?

make lightweight parameter into a kwarg for create_backup. That way, existing rq backup jobs still can execute

That way, existing rq backup jobs still can execute

But with another behavior since it's now True :)

cvat/apps/engine/backup.py

cvat/apps/dataset_manager/util.py

cvat/schema.yml

SpecLad · 2025-06-18T14:23:14Z

cvat/schema.yml

@@ -9466,6 +9476,10 @@ components:
          allOf:
          - $ref: '#/components/schemas/StorageRequest'
          nullable: true
+        data_storage:
+          allOf:
+          - $ref: '#/components/schemas/TaskDataStorageRequest'


Why not use the same type as source_storage/target_storage?

they have the id field, which is not needed here. Removed it, using other way to change data storage

SpecLad · 2025-06-18T14:28:53Z

cvat/schema.yml

@@ -10791,6 +10824,10 @@ components:
          description: |
            The number of consensus replica jobs for each annotation job.
            Configured at task creation
+        data_storage:


Does this field need to be here? AFAIK, TaskWriteRequest is only used to create blank tasks, and the data storage for those is only set via the POST /api/tasks/<id>/data request.

got rid of this in favor of using GET/PATCH /api/tasks/<id>/data/meta

SpecLad · 2025-06-18T15:45:04Z

cvat/schema.yml

+        name: lightweight
+        schema:
+          type: boolean
+        description: Make lightweight backup for cloud based tasks


It would be useful to explain what "lightweight" means.

I only skimmed the PR so I could've missed it, but I didn't see any code that would reject this parameter when the task is not cloud based. Please add it if it's not there.

Tried to write a better description.

I don't quite understand why it should be explicitly rejected instead of just doing nothing

codecov-commenter · 2025-06-30T09:36:43Z

Codecov Report

Attention: Patch coverage is 71.02804% with 31 lines in your changes missing coverage. Please review.

Project coverage is 71.84%. Comparing base (ae32938) to head (bfd14f8).
Report is 4 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9535      +/-   ##
===========================================
+ Coverage    71.74%   71.84%   +0.10%     
===========================================
  Files          441      441              
  Lines        46246    46326      +80     
  Branches      3946     3952       +6     
===========================================
+ Hits         33180    33284     +104     
+ Misses       13066    13042      -24

Components	Coverage Δ
cvat-ui	`77.68% <43.58%> (-0.07%)`	⬇️
cvat-server	`67.22% <86.76%> (+0.22%)`	⬆️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

cvat/apps/engine/views.py

@@ -586,6 +587,11 @@
                data=str(ex),
                status=status.HTTP_500_INTERNAL_SERVER_ERROR,
            )
+        except CloudStorageMissingError as ex:
+            return Response(
+                data=str(ex),


To fix the issue, the code should replace the detailed exception message (str(ex)) with a generic error message when responding to external users. The detailed exception message can still be logged on the server for debugging purposes. This approach ensures that sensitive information is not exposed while retaining the ability to diagnose issues internally.

Steps to implement the fix:

Replace str(ex) in the response with a generic error message, such as "An internal error has occurred."

Log the detailed exception message using the slogger instance (ServerLogManager) for internal debugging.

cvat/apps/engine/views.py

@@ -694,6 +700,11 @@
                    data=str(ex),
                    status=status.HTTP_500_INTERNAL_SERVER_ERROR,
                )
+            except CloudStorageMissingError as ex:
+                return Response(
+                    data=str(ex),


To fix the issue, the code should be modified to ensure that the exception message is not directly exposed to the user. Instead, a generic error message should be returned, while the detailed exception information is logged for debugging purposes. This approach ensures that sensitive information is not leaked to external users while still allowing developers to diagnose issues.

The changes required are:

Replace str(ex) in the HTTP response with a generic error message, such as "An internal error occurred."

Log the detailed exception message using the slogger logger for internal debugging.

Eldies

I do not think that we need to mark a data storage as "missing" when a user does not select a cloud storage from the list but clicks next to the field. It looks better to me to show the original cloud storage.

Fixed

We should check on the server that the newly attached storage has "available" status (bucket still exists and credentials are valid)

doing it now

I think now we need to change the exception raised on the server when preparing a data chunk with unlinked CS. Now there will be a 500 status code when fetching a chunk or a task preview - let's return more appropriate codes (probably a 409 status code or at least a better message)

Made a better message and also 409 code.

Eldies · 2025-06-30T11:08:52Z

cvat/schema.yml

+        name: lightweight
+        schema:
+          type: boolean
+        description: Make lightweight backup for cloud based tasks


Tried to write a better description.

I don't quite understand why it should be explicitly rejected instead of just doing nothing

Eldies · 2025-06-30T11:10:11Z

cvat/schema.yml

@@ -10791,6 +10824,10 @@ components:
          description: |
            The number of consensus replica jobs for each annotation job.
            Configured at task creation
+        data_storage:


got rid of this in favor of using GET/PATCH /api/tasks/<id>/data/meta

Eldies · 2025-06-30T11:12:37Z

cvat/schema.yml

@@ -9466,6 +9476,10 @@ components:
          allOf:
          - $ref: '#/components/schemas/StorageRequest'
          nullable: true
+        data_storage:
+          allOf:
+          - $ref: '#/components/schemas/TaskDataStorageRequest'


they have the id field, which is not needed here. Removed it, using other way to change data storage

cvat/schema.yml

sonarqubecloud · 2025-07-01T10:25:42Z

Quality Gate passed

Issues
9 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Marishka17 · 2025-07-16T16:22:24Z

cvat/apps/engine/background.py

+
+    def init_request_args(self) -> None:
+        super().init_request_args()
+        lightweight = to_bool(self.request.query_params.get("lightweight", True))


True by default may be unexpected behavior for users, as it changes the output result despite the fact that the usage of API parameters does not change on the user's side (in some scripts).

Marishka17 · 2025-07-16T16:27:16Z

cvat/apps/engine/background.py

@@ -216,6 +229,9 @@ def _init_callback_with_params(self):
            logger,
            self.job_result_ttl,
        )
+        self.callback_kwargs = dict(


Consider using dictionary literal

Marishka17 · 2025-07-16T16:49:20Z

cvat/apps/engine/backup.py

+            if data['client_files'] != [self.MEDIA_MANIFEST_FILENAME]:
+                raise ValidationError(f"Expected {self.MEDIA_MANIFEST_FILENAME} in backup files")
+
+            manifest = ImageManifestManager(os.path.join(self._db_task.data.get_upload_dirname(), self.MEDIA_MANIFEST_FILENAME))
+            data['server_files'] = list(manifest.data)
+


There are 2 file types (client_files/server_files) that are passed to the task creation logic. I don't remember that we're using somewhere more than one data type when creating tasks and I rather think that this should not be allowed.

Marishka17 · 2025-07-16T16:59:25Z

cvat/apps/engine/backup.py

@@ -1100,6 +1118,8 @@ def create_backup(
    Exporter: Type[ProjectExporter | TaskExporter],
    logger: Logger,
    cache_ttl: timedelta,
+    *,
+    lightweight: bool = True,


I also don't think the lightweight should be True by default because it changes the logic for RQ jobs created before.

One more disadvantage of setting the default value in this func - the default value for this argument is defined 2 times now: here and when reading the request.query_params. IMHO, it's better to have only one place where the default value is defined (when handling query params). But I'm okay if it's okay for others.

Marishka17 · 2025-07-16T17:09:23Z

cvat/apps/engine/backup.py

@@ -971,13 +984,14 @@ def _prepare_project_meta(self, project):
 class ProjectExporter(_ExporterBase, _ProjectBackupBase):
    ModelClass: ClassVar[models.Project] = models.Project

-    def __init__(self, pk, version=Version.V1):
+    def __init__(self, pk, version=Version.V1, *, lightweight: bool = True):


I suggest not defining the default value for the lightweight argument here (the same comment for the TaskExported class).

Suggested change

def __init__(self, pk, version=Version.V1, *, lightweight: bool = True):

def __init__(self, pk, *, lightweight: bool, version: Version = Version.V1):

Marishka17 · 2025-07-16T17:40:33Z

cvat-core/src/server-proxy.ts

 ): Promise<string | void> {
    const { backendAPI } = config;
    const params: Params = {
        ...enableOrganization(),
        ...configureStorage(targetStorage, useDefaultSettings),
        ...(fileName ? { filename: fileName } : {}),
+        ...(!lightweight ? { lightweight } : {}),


It looks strange that you send the lightweight query param only when it's False.

Probably the lightweight param should be optional and we should send it only if its specified

...(typeof lightweight === 'boolean' ? { lightweight } : {}),

Marishka17 · 2025-07-16T17:42:21Z

cvat-core/src/server-response-types.ts

@@ -483,6 +483,8 @@ export interface SerializedFramesMetaData {
    size: number;
    start_frame: number;
    stop_frame: number;
+    storage: StorageLocation;


In fact it can be also "share" type

Marishka17 · 2025-07-16T17:46:23Z

cvat-core/src/session.ts

+        targetStorage: Storage,
+        useDefaultSettings: boolean,
+        fileName?: string,
+        lightweight: bool,


Suggested change

lightweight: bool,

lightweight: boolean,

But I also think it should be optional

Marishka17 · 2025-07-16T17:50:47Z

cvat-ui/src/components/export-backup/export-backup-modal.tsx

+                            checked={lightweight}
+                            onChange={setLightweight}
+                        />
+                        <Text strong>Make light-weight backup</Text>


Suggested change

<Text strong>Make light-weight backup</Text>

<Text strong>Make a lightweight backup</Text>

Marishka17 · 2025-07-16T17:54:25Z

cvat-ui/src/components/export-backup/export-backup-modal.tsx

@@ -35,6 +38,7 @@ const initialValues: FormValues = {
        cloudStorageId: undefined,
    },
    useProjectTargetStorage: true,
+    lightweight: true,


I still think we shouldn't show this option at least in cases where we know that it's useless (e.g. when a task has no cloud data). Was it agreed upon with others?

I would agree here, in case there is no cs for task we shouldnt even show the form item for this

bsekachev · 2025-07-22T10:37:09Z

I do not think it makes sense to keep this switch for tasks, not created from a cloud storage

Let's add a tooltip (like "Use default settings" has), as it is not obviously for a user what is difference between light weight backup and regular

bsekachev · 2025-07-22T10:40:27Z

When I export regular backup and lightweight, I can't see both results on requests page, it seems they overwrite each other. Only the latest is visible. I believe it should work like for exports with different formats

bsekachev · 2025-07-22T10:48:35Z

The layout here does not look well.

Are there any reasons why the field only gets rendered after fetching corresponding cloud storage?
May we show loading indicator instead while fetching?

bsekachev · 2025-07-22T10:51:55Z

UI for view/edit is different, may we just always show selector and do this field similar to "Assigned to"? (maybe even align it on the right).

In this form "Select cloud storage" has "*" mark, making this field obligatory, however I may left it empty. I do not think we need this mark

bsekachev · 2025-07-22T11:03:09Z

For a project containing mixed cloud storage and non cloud storage data, the switcher "Make lightweight backup" seems a little uncertain. I would suggest to rename it, maybe like "Use lightweight backup whenever possible" (with corresponding hint)..? IDK, I would suggest to think on better message here

bsekachev · 2025-07-22T11:14:41Z

@klakhov Could you please look at client changes here?

klakhov

I agree that we should update the style of the cloud storage selector to match the assignee selector.
We should also move it to the right to align with the assignee selector.

klakhov · 2025-07-23T13:15:13Z

cvat-core/src/frames.ts

@@ -701,6 +715,22 @@ function saveJobMeta(meta: FramesMetaData, jobID: number): Promise<FramesMetaDat
    return frameMetaCache[jobID];
 }

+function saveTaskMeta(meta: FramesMetaData, taskID: number): Promise<FramesMetaData> {


This method is complete dublicate of what we have for saveJobMeta, I think we can rewrite it so we have only one function

klakhov · 2025-07-23T13:15:34Z

cvat-core/src/frames.ts

@@ -949,6 +979,14 @@ export async function patchMeta(jobID: number): Promise<FramesMetaData> {
    return newMeta;
 }

+export async function patchTaskMeta(taskID: number, meta: FramesMetaData): Promise<FramesMetaData> {


Same comment as for saveTaskMeta

klakhov · 2025-07-23T13:17:08Z

cvat-core/src/project.ts

+        targetStorage: Storage,
+        useDefaultSettings: boolean,
+        fileName?: string,
+        lightweight: bool,


Suggested change

lightweight: bool,

lightweight: boolean,

Also we cant have a required parameter follow optional parameter fileName

klakhov · 2025-07-23T13:17:34Z

cvat-core/src/server-proxy.ts

@@ -995,12 +995,14 @@ async function backupTask(
    targetStorage: Storage,
    useDefaultSettings: boolean,
    fileName?: string,
+    lightweight: boolean,


Same comment here, we cant have required param follow optional one

klakhov · 2025-07-23T13:20:06Z

cvat-core/src/server-proxy.ts

 ): Promise<string | void> {
    const { backendAPI } = config;
    const params: Params = {
        ...enableOrganization(),
        ...configureStorage(targetStorage, useDefaultSettings),
        ...(fileName ? { filename: fileName } : {}),
+        ...(!lightweight ? { lightweight } : {}),


Probably the lightweight param should be optional and we should send it only if its specified

...(typeof lightweight === 'boolean' ? { lightweight } : {}),

klakhov · 2025-07-23T13:27:17Z