Join GitHub today
GitHub is home to over 31 million developers working together to host and review code, manage projects, and build software together.Sign up
Unaligned copies on 8/16 bpp formats on DX11 #2318
The copy shaders at
Currently these smaller sized formats dispatch calls are scaled down so that they fit with the "1 load per thread", storing and reading 4/2 texels at a time, but a potential solution could be to not scale at all and "waste" 24 bits of bandwidth and do 1 load per texel, like the other format copies.
Note that raw buffers support unaligned 32bit loads at byte granularity.