Extend the scope of render target allocation strategy acrosss layers #3374

kvark · 2018-11-29T20:55:47Z

Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1509672

Technically consists of 3 parts:

cleanup bits, don't affect performance
spread the dynamic target allocation bins across the layers. Previously, we had each layer with it's own allocator, and we were only ever trying to allocate from the last one. This scheme was weak against cases where we reach the ideal maximum target size, since guillotine allocation of small bits forced us to spawn more and more layers... reaching almost 150 in the target case. New scheme uses the layers more efficiently, reducing the layers to just 3 (.. 50x reduction :P). Pending try push.
when we reach the ideal max size, also round up the requested size to 256. This makes allocations within a layer to be more robust against small inputs. With this change, the number of layers drops to just 2 (.. 75x reduction lol), which appears to be minimal for this case. Pending try push.

The page scrolls smoothly with those changes, on my GTX TI 1050 at least.

r? @gw3583

Note: the bugzilla issue also suffers from poor batching, I'm going to look at it separately.

This change is

kvark · 2018-11-29T22:18:55Z

Both tries look green. Appveyor used an older Rust version, which is updated in the last commit.
OSX Release TC bots are being updated to 1.30 by @staktrace as we speak.
This is ready for reviews!

gw3583

Reviewed 5 of 5 files at r1.
Reviewable status: complete! all files reviewed, all discussions resolved

gw3583 · 2018-11-29T22:48:18Z

Seems sane! Do we need to do any profiling / testing before merging?

kvark · 2018-11-30T01:53:03Z

@gw3583 I visited a few websites with RT debug display enabled, and the improvement is real. On cnn.com we are down from 3 large (~3K x 2K) slices to just one. On HN, similarly, we only have a single slice with everything instead of 3.

Looks like this is going to be a major win on common sites, not just extreme cases. 🎉

gw3583 · 2018-11-30T02:01:59Z

@kvark Cool, ship it! 🚀

staktrace · 2018-11-30T02:33:24Z

The mac builders should have 1.30 now

zptan · 2018-11-30T03:46:20Z

OS X debug tests failed:

...
sccache --stop-server || true
Stopping sccache server...
error: couldn't connect to server
caused by: Connection refused (os error 61)
mkdir -p ../artifacts
RUST_LOG=sccache=trace SCCACHE_ERROR_LOG=$PWD/../artifacts/sccache.log sccache --start-server
TRACE:sccache::cmdline: parse
TRACE:sccache::commands: Command::StartServer
Starting sccache server...
TRACE:sccache::commands: run_server_process
TRACE:sccache::cmdline: parse
TRACE:sccache::commands: Command::InternalStartServer
error: Timed out waiting for server startup
...

kvark · 2018-11-30T04:34:01Z

@bors-servo try

bors-servo · 2018-11-30T04:34:04Z

⌛ Trying commit c2f04cd with merge d99dedb...

@gw3583

Extend the scope of render target allocation strategy acrosss layers Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1509672 Technically consists of 3 parts: - cleanup bits, don't affect performance - spread the dynamic target allocation bins across the layers. Previously, we had each layer with it's own allocator, and we were only ever trying to allocate from the last one. This scheme was weak against cases where we reach the ideal maximum target size, since guillotine allocation of small bits forced us to spawn more and more layers... reaching almost 150 in the target case. New scheme uses the layers more efficiently, reducing the layers to just 3 (.. 50x reduction :P). Pending [try push](https://treeherder.mozilla.org/#/jobs?repo=try&revision=27464c7d62f0f05a0bc96a7133b55e9706d3d449). - when we reach the ideal max size, also round up the requested size to 256. This makes allocations *within a layer* to be more robust against small inputs. With this change, the number of layers drops to just 2 (.. 75x reduction lol), which appears to be minimal for this case. Pending [try push](https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f4593fd68455842a7b12f396d0abbdf887a11a0). The page scrolls smoothly with those changes, on my GTX TI 1050 at least. r? @gw3583 Note: the bugzilla issue also suffers from poor batching, I'm going to look at it separately.  --- This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/servo/webrender/3374)

bors-servo · 2018-11-30T05:35:17Z

☀️ Test successful - status-appveyor, status-taskcluster
State: approved= try=True

kvark · 2018-11-30T14:46:28Z

@gw3583 I've added a bit of code for validating texture allocator correctness, at least at test time.
@bors-servo r=gw3583

bors-servo · 2018-11-30T14:46:28Z

📌 Commit 3362a5e has been approved by gw3583

bors-servo · 2018-11-30T14:46:32Z

⌛ Testing commit 3362a5e with merge dbaa109...

@gw3583

Extend the scope of render target allocation strategy acrosss layers Fixes https://bugzilla.mozilla.org/show_bug.cgi?id=1509672 Technically consists of 3 parts: - cleanup bits, don't affect performance - spread the dynamic target allocation bins across the layers. Previously, we had each layer with it's own allocator, and we were only ever trying to allocate from the last one. This scheme was weak against cases where we reach the ideal maximum target size, since guillotine allocation of small bits forced us to spawn more and more layers... reaching almost 150 in the target case. New scheme uses the layers more efficiently, reducing the layers to just 3 (.. 50x reduction :P). Pending [try push](https://treeherder.mozilla.org/#/jobs?repo=try&revision=27464c7d62f0f05a0bc96a7133b55e9706d3d449). - when we reach the ideal max size, also round up the requested size to 256. This makes allocations *within a layer* to be more robust against small inputs. With this change, the number of layers drops to just 2 (.. 75x reduction lol), which appears to be minimal for this case. Pending [try push](https://treeherder.mozilla.org/#/jobs?repo=try&revision=1f4593fd68455842a7b12f396d0abbdf887a11a0). The page scrolls smoothly with those changes, on my GTX TI 1050 at least. r? @gw3583 Note: the bugzilla issue also suffers from poor batching, I'm going to look at it separately.  --- This change is [<img src="https://reviewable.io/review_button.svg" height="34" align="absmiddle" alt="Reviewable"/>](https://reviewable.io/reviews/servo/webrender/3374)

bors-servo · 2018-11-30T15:58:18Z

☀️ Test successful - status-appveyor, status-taskcluster
Approved by: gw3583
Pushing dbaa109 to master...

heftig · 2018-11-30T17:44:41Z

With the latter try build the issue is gone for me. Thanks! 👍

…ab3e7ace77d6 (WR PR #3374). r=kats servo/webrender#3374 Differential Revision: https://phabricator.services.mozilla.com/D13626 --HG-- extra : moz-landing-system : lando

…ab3e7ace77d6 (WR PR #3374). r=kats servo/webrender#3374 Differential Revision: https://phabricator.services.mozilla.com/D13626

…ab3e7ace77d6 (WR PR #3374). r=kats servo/webrender#3374 Differential Revision: https://phabricator.services.mozilla.com/D13626 UltraBlame original commit: be65d09ef059e78daf60df9737503cfcb6dc8a86

gw3583 approved these changes Nov 29, 2018

View reviewed changes

kvark added 7 commits November 30, 2018 09:44

Minor refactor for the texture allocator

9e7d615

Refactor the bin selection in texture allocator

0930c49

Remove the dirty flag and FitsInside helper from texture allocator

8683e9a

Rewrite texture allocator to spread bins across slices

cde51e1

Fix RT layer index, round up the size

f951f75

Update appveyor rustc version to 1.30

0781d9d

Automated texture allocator testing

3362a5e

kvark force-pushed the rt-alloc branch from c2f04cd to 3362a5e Compare November 30, 2018 14:45

bors-servo merged commit 3362a5e into servo:master Nov 30, 2018

kvark deleted the rt-alloc branch December 12, 2018 01:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend the scope of render target allocation strategy acrosss layers #3374

Extend the scope of render target allocation strategy acrosss layers #3374

kvark commented Nov 29, 2018 •

edited by larsbergstrom

Loading

kvark commented Nov 29, 2018

gw3583 left a comment

gw3583 commented Nov 29, 2018

kvark commented Nov 30, 2018

gw3583 commented Nov 30, 2018

staktrace commented Nov 30, 2018

zptan commented Nov 30, 2018

kvark commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

kvark commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

heftig commented Nov 30, 2018

Extend the scope of render target allocation strategy acrosss layers #3374

Extend the scope of render target allocation strategy acrosss layers #3374

Conversation

kvark commented Nov 29, 2018 • edited by larsbergstrom Loading

kvark commented Nov 29, 2018

gw3583 left a comment

Choose a reason for hiding this comment

gw3583 commented Nov 29, 2018

kvark commented Nov 30, 2018

gw3583 commented Nov 30, 2018

staktrace commented Nov 30, 2018

zptan commented Nov 30, 2018

kvark commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

kvark commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

bors-servo commented Nov 30, 2018

heftig commented Nov 30, 2018

kvark commented Nov 29, 2018 •

edited by larsbergstrom

Loading