Implement Partial Memory Flush #22

osanj · 2020-01-07T23:54:08Z

As soon as a cache is dirty and a related shader is run again, the entire buffer will be flushed. This can become quite expensive for large buffers and for cpu-gpu communication. The vulkan specification allows to do partial copies using buffer regions: https://www.khronos.org/registry/vulkan/specs/1.1-extensions/man/html/vkCmdCopyBuffer.html

Currently, only a single region (the entire buffer) is provided:

lava/lava/api/pipeline.py

Lines 201 to 202 in e82f6d3

    
           region = vk.VkBufferCopy(0, 0, src_buffer.get_size()) 
        
           vk.vkCmdCopyBuffer(self.command_buffer_handle, src_buffer.handle, dst_buffer.handle, 1, [region])

The specific usecase which should benefit from this, is partial invalidation of large buffers. Ideally the regions can be computed agonistic, e.g. by comparing current bytes with new bytes.

Tasks:

experiment with the regions
~~implement more finegrained "dirty byte detection"~~ (already in ByteCache, except scalar arrays)
allow for manual dirty setting?
make configurable?
figure out if the memory mapping for host memory to cpu buffer is a bottleneck as well or if mapping the entire memory can be kept

lava/lava/buffer.py

Line 155 in e82f6d3

self.vulkan_buffer.map(bytez)
write tests

osanj · 2020-01-19T22:56:29Z

In the first implementation the new buffer bytes are compared with the existing ones, a mask is created and converted into regions (offset & size).

In the associated test, floats are changed from 1 to 2. On the byte level this only changes 2 out of 4 bytes per float, resulting in a lot of small regions being copyied:

lava/test/buffer.py

Lines 262 to 267 in e110d18

    
           buffer_in["arrayIn"] = np.ones(length, dtype=np.float32) 
        
           stage.run_and_wait() 
        
           np.testing.assert_equal(buffer_in["arrayIn"].unwrap(), buffer_out["arrayOut"].unwrap()) 
        
           buffer_in["arrayIn"][length // 2:] = 2 
        
           stage.run_and_wait()

Some other tests showed that this leads to increased instead of reduced upload duration...

osanj added the enhancement New feature or request label Jan 7, 2020

osanj modified the milestones: 0.4.0, 0.5.0 Jan 7, 2020

osanj modified the milestones: 0.4.0, 0.5.0 Jun 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Partial Memory Flush #22

Implement Partial Memory Flush #22

osanj commented Jan 7, 2020 •

edited

Loading

osanj commented Jan 19, 2020 •

edited

Loading

Implement Partial Memory Flush #22

Implement Partial Memory Flush #22

Comments

osanj commented Jan 7, 2020 • edited Loading

osanj commented Jan 19, 2020 • edited Loading

osanj commented Jan 7, 2020 •

edited

Loading

osanj commented Jan 19, 2020 •

edited

Loading