Skip to content

Enhancement: Dataverse files are downloaded one-by-one, not concurrently #61

@ross-spencer

Description

@ross-spencer

Please describe the problem you'd like to be solved.

I would like to see this portion of the code corrected to enable concurrent downloading of a Dataverse payload: https://github.com/artefactual/archivematica-storage-service/blob/9e6f97392042997bfd7ee251308e0708f514860e/storage_service/locations/models/dataverse.py#L253-L275

The speed at which one-by-one downloading happens at present impacts user efficacy or at least the speed of.

Describe the solution you'd like to see implemented.

The python requests library supports concurrent downloads, we could try a solution such as this outlined on Stack Overflow: https://stackoverflow.com/a/9189249

Describe alternatives you've considered.

Alternative libraries or mechanisms which we are already doing this in Archivematica might exist.

Additional Context.

Async requests: http://docs.python-requests.org/en/v0.10.6/user/advanced/#asynchronous-requests

Metadata

Metadata

Assignees

No one assigned

    Labels

    OCUL: AM-DataverseOCUL: AM-Dataverseissue-closure-2025This issue was not addressed in a release, but was closed as part of a blanket issue triage in 2025.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions