Skip to content

Use an increased release threshold for the default async pool when using DeviceMemoryResource #771

@shwina

Description

@shwina

As reported in #756 (comment), allocation via DeviceMemoryResource can be very slow when there are no active allocations. This is due to the default pool being configured with a release threshold of 0.

A fix for this could be for DeviceMemoryResource to configure the default pool to have a larger (or maximum) release threshold via cuMemPoolSetAttribute.

Metadata

Metadata

Labels

P0High priority - Must do!cuda.coreEverything related to the cuda.core moduleenhancementAny code-related improvements

Type

No type

Projects

Status

Done

Relationships

None yet

Development

No branches or pull requests

Issue actions