As reported in #756 (comment), allocation via DeviceMemoryResource can be very slow when there are no active allocations. This is due to the default pool being configured with a release threshold of 0.
A fix for this could be for DeviceMemoryResource to configure the default pool to have a larger (or maximum) release threshold via cuMemPoolSetAttribute.