Layout algorithm decoupling, Sizing Constraints & Perf Improvements #246

nicoburns · 2022-11-19T01:26:09Z

Objective

Lays the groundwork for Support multiple layout algorithms #28 by splitting the "leaf" algorithm out from the flexbox algorithm ~~and creating a trait that each algorithm implements~~ (I now intend to make this a separate PR)
Addresses Improve docs and API to clarify the purpose of MeasureFunc #214 by adding an available_space parameter to MeasureFuncs. The AvailableSpace is also used for the flexbox parent size parameter.
Dramatically improves performance on deep trees (17s -> 3ms) by improving caching. (Fixes Slow performance with deep hierachies #243)
Adds a debug module with support for printing a nested tree of logs when rendering a tree (todo: add a feature flag for outputting logs and default to off).

Benchmark Results

Pay attention to units: there are all of seconds, milliseconds and microseconds in here.

Warning

Note for anyone reading this in the future: the absolute values in these benchmarks turned to be bunk due to flaw in the measuring methodology. However the relative improvement ended up being similar. See benches folder for up to date benchmark results.

Benchmark	main (w/new benchmarks)	This Branch	% change
big trees/10_000 nodes (2-level hierarchy)	44.472 µs	45.272 µs	no change
big trees/100_000 nodes (2-level hierarchy)	4.2695 ms	4.5856 ms	no change
big trees/100_000 nodes (7-level hierarchy)	2.8302 s	4.2983 ms	-99.738%
big trees/4000 nodes (12-level hierarchy))	10.988 s	17.138 µs	-100.000%
big trees/10_000 nodes (14-level hierarchy)	46.336 s	3.3227 ms	-99.992%
big trees/100_000 nodes (17-level hierarchy)	gave up after 20 mins	10.998 s	-
deep hierarchy/build	701.74 ns	696.26 ns	no change
deep hierarchy/single	6.8114 µs	6.3035 µs	no change
deep hierarchy/relayout	4.1475 µs	2.2235 µs	-46.389%
generated benchmarks	206.40 µs	208.63 µs	no change

Context

Code is still WIP. Cleanup needed in a number of places. It is passing all tests though.

Feedback wanted

Nothing specific, but general feedback welcome.

alice-i-cecile · 2022-11-19T02:13:36Z

Wow, I'm kind of at a loss for words with those benchmark results. I'm going to be prioritizing getting this reviewed and merged: let me know when you feel it's ready.

nicoburns · 2022-11-19T02:20:30Z

Wow, I'm kind of at a loss for words with those benchmark results. I'm going to be prioritizing getting this reviewed and merged: let me know when you feel it's ready.

Same! The majority of the improvement came from a one-line change too! I'm busy tomorrow, but I'll see if I can find some time to get this into a mergeable state on Sunday :)

This gives huge performance wins on deep trees. On my machine, an improvement from 17s to 3ms (note change of unit) on the benchmark with 10,000 nodes at a depth of 14

alice-i-cecile

Stream of consciousness thoughts as I read:

Jesus those perf numbers. Algorithmic complexity really does matter eh?
AvailableSpace::Definite is really nice! Very clear, very explicit
Size::NONE -> Size::MAX_CONTENT definitely needs a migration guide. More clear though!
AvailableSpace instead of an Option<f32> is so much clearer.
Ditto RunMode and SizingMode. Great docs too!
The methods in debug.rs feel like they will be genuinely useful to users: I'd make them properly pub.

Overall, this does a ton of the things that the team has wanted to do for this library: stronger types, better docs, dramatically better performance, foundations for multiple layout algorithms. I'm looking forward to merging this when it's ready!

Plenty to nitpick (missing doc links, commented out code), but I trust you'll get around to those :) Let me know when you're ready for a final review pass!

Weibye

❤️ RunMode, SizingMode and AvailableSpace. It really helps readability :)
The debug seems particularly useful! We should further build on that both for ourselves and end-users.

Really appreciate the work!

src/geometry.rs

src/layout.rs

src/geometry.rs

src/data.rs

src/compute/flexbox.rs

TimJentzsch

Amazing performance improvements!

src/compute/leaf.rs

src/debug.rs

src/layout.rs

nicoburns · 2022-11-22T01:04:18Z

So I've done some rough benchmarking against yoga by:

Recreating the "Huge nested layout" from https://github.com/facebook/yoga/blob/578d197dd6652225b46af090c0b46471dc887361/javascript/tests/Benchmarks/YGBenchmark.js in our benchmark suite.
Recreating it using the node.js bindings for yoga-layout (the yoga-layout-prebuilt package on NPM because I couldn't get it to build easily using the yoga-layout package. This will add a small overhead to call into native, but that's only a small constant overhead (a single function call) and as demonstrated by the 10 node benchmark that can't be greater than 45µs, so I think it's still pretty fair. I didn't use a benchmarking framework, but I manually ran it several times and the results were pretty similar each time.

Results

Benchmark	Yoga	Taffy
big trees/10 nodes (1-level hierarchy)	45.1670 µs	34.110 ns
big trees/100 nodes (2-level hierarchy)	134.1250 µs	341.80 ns
big trees/1_000 nodes (3-level hierarchy)	1.2221 ms	3.8351 µs
big trees/10_000 nodes (4-level hierarchy)	13.8672 ms	37.551 µs
big trees/100_000 nodes (5-level hierarchy)	141.5307 ms	1.7385 ms
big trees/1_000_000 nodes (6-level hierarchy)	error*	44.145 ms

* But in fairness to yoga, the error is "please increase the memory limit" and the equivalent taffy benchmark was using much more memory (6gb+) than yoga's limit of 134mb. I'd like to run taffy without criterion, to get a better idea of how much memory it uses in real-world usage. Perhaps we could also try https://docs.rs/dhat/latest/dhat/

Conclusions

At least on this benchmark we seem to be quite a bit faster than yoga. Although I'm a little worried that it seems a bit too good to be true.

Code for yoga benchmarks

package.json


{
  "name": "layout-benchmark",
  "version": "1.0.0",
  "main": "index.js",
  "license": "MIT",
  "private": false,
  "dependencies": {
    "yoga-layout-prebuilt": "^1.10.0"
  }
}

index.js


const Yoga = require('yoga-layout-prebuilt');
function buildTreeLevel(parent, nodesPerLevel, remainingLevels) {
for (var i = 0; i < nodesPerLevel; i++) {

var child = Yoga.Node.create();

child.setFlexGrow(1);

child.setWidth(10);

child.setHeight(10);

parent.insertChild(child, 0);
if (remainingLevels > 1) {
  buildTreeLevel(child, nodesPerLevel, remainingLevels - 1);
}

}
}
function createRoot(nodesPerLevel, levels) {

var root = Yoga.Node.create();
buildTreeLevel(root, nodesPerLevel, levels);
return root;

}
function benchmark(cb) {

let start = performance.now();
cb();
let end = performance.now();
return end - start;

}
function deepTreeBench(nodesPerLevel, levels, print = true) {

let root = createRoot(nodesPerLevel, levels);

let time = benchmark(() => root.calculateLayout(Yoga.UNDEFINED, Yoga.UNDEFINED, Yoga.DIRECTION_LTR));

if (print) {

if (time < 1) {

console.log(Nodes: ${Math.pow(nodesPerLevel, levels)} ${(time*1000).toFixed(4)} µs);

} else {

console.log(Nodes: ${Math.pow(nodesPerLevel, levels)} ${time.toFixed(4)} ms);

}

}

root.freeRecursive();

}
// Initial run seems to have a fixed ~30ms overhead, so we run once and ignore the result.

deepTreeBench(10, 4, false);
// Benchmark at 10 through 100,000 nodes

deepTreeBench(10, 1);

deepTreeBench(10, 2);

deepTreeBench(10, 3);

deepTreeBench(10, 4);

deepTreeBench(10, 5);

deepTreeBench(10, 6);

alice-i-cecile · 2022-11-22T01:17:24Z

Admittedly, yoga also implements parts of the flexbox spec we're ignoring. I'd avoid publicizing it widely until we close that gap.

nicoburns · 2022-11-22T02:07:14Z

@alice-i-cecile I now consider this ready for review. I had planned to add a LayoutAlgorithm trait, however this has ended up being more involved than I expected (potentially having some tricky design tradeoffs), so I now think this would be best off in a separate PR (and tbh, may not be top of my list to do next).

P.S. You've added the 0.3 milestone to this PR, but I believe we are yet to release a 0.2 version so it probably ought to be that? On that note, perhaps it would make sense to start gearing up for a release (release notes need a bit of work I think!) once this lands? Seems to me that this, along with the cumulative changes already on main would be worth getting out to people...

nicoburns · 2022-11-22T02:10:26Z

Admittedly, yoga also implements parts of the flexbox spec we're ignoring.

Interesting. Do you have a list in your head of things that they implement that we don't?

I'd avoid publicizing it widely until we close that gap.

I feel like it might be worth calling out in our release notes, but with the explicit caveat that we're not yet that confident in our benchmarks and would appreciate scrutiny from 3rd parties. So long as we don't come across as showing off or putting others down I think we should be alright sharing numbers?

TimJentzsch · 2022-11-22T12:35:42Z

Honestly, I'm not sure how useful a direct comparison between yoga and taffy is, since they use completely different languages.
I would only see value in a comparison of yoga and JS bindings for taffy, then you could say "just replace yoga with WASM taffy and you'll get more performance". Otherwise I think it's more like comparing apples with oranges.

alice-i-cecile · 2022-11-22T13:07:21Z

You've added the 0.3 milestone to this PR, but I believe we are yet to release a 0.2 version so it probably ought to be that? On that note, perhaps it would make sense to start gearing up for a release (release notes need a bit of work I think!) once this lands? Seems to me that this, along with the cumulative changes already on main would be worth getting out to people...

Agreed, let's ship it.

WRT missing features, I think gap was the big one? We've been tracking this in the issue tracker.

RELEASES.md

src/debug.rs

alice-i-cecile

Really exceptional work. Two things to add to the release notes (see the comments), then I'll merge this in!

nicoburns · 2022-11-22T18:11:22Z

Two things to add to the release notes (see the comments)

Added :)

Co-authored-by: Alice Cecile <alice.i.cecile@gmail.com>

alice-i-cecile · 2022-11-23T15:06:46Z

Awesome work! Let's get gap implemented and then cut a release.

nicoburns mentioned this pull request Nov 19, 2022

DELETED (replaced by new PR) #245

Closed

alice-i-cecile added code quality Make the code cleaner or prettier. performance Layout go brr labels Nov 19, 2022

alice-i-cecile requested review from TimJentzsch, jkelleyrtp and Weibye November 19, 2022 02:12

nicoburns added 16 commits November 19, 2022 02:21

Add deeper depth benchmarks

f82930b

Cargo fmt

6e2c41c

Add map_width, map_height, and zip_map functions to Size

853259b

Define AvailableSpace enum

8a109d1

Move flexbox module to be a submodule of the compute module

f4a3dc9

Convert perform_layout boolean to descriptive enum

490a875

Rename node_size to known_dimensions

751c71a

Update generated tests to use AvailableSpace rather than Option<f32>

79f1bc2

Separate out leaf algorithm

bbceab4

Flexbox: clamp style size inputs

c69e634

Bust cache based on run_mode

929c27b

Fix caching (and only_measure_once) test

6c7e601

Comment out debug logs

7913918

Assign cache slot based on which dimensions are known

12eb82d

This gives huge performance wins on deep trees. On my machine, an improvement from 17s to 3ms (note change of unit) on the benchmark with 10,000 nodes at a depth of 14

Fix new benchmarks for AvailableSpace

073b749

Cargo fmt

c67507a

nicoburns force-pushed the lat/layout-algo-trait branch from 73c3ea4 to c67507a Compare November 19, 2022 02:21

alice-i-cecile reviewed Nov 19, 2022

View reviewed changes

Weibye mentioned this pull request Nov 19, 2022

Remove Dimension::Undefined #188

Closed

Weibye reviewed Nov 19, 2022

View reviewed changes

src/geometry.rs Show resolved Hide resolved

src/layout.rs Show resolved Hide resolved

src/layout.rs Outdated Show resolved Hide resolved

src/geometry.rs Show resolved Hide resolved

src/data.rs Show resolved Hide resolved

src/compute/flexbox.rs Outdated Show resolved Hide resolved

TimJentzsch reviewed Nov 19, 2022

View reviewed changes

src/compute/leaf.rs Outdated Show resolved Hide resolved

src/compute/leaf.rs Outdated Show resolved Hide resolved

src/debug.rs Outdated Show resolved Hide resolved

src/debug.rs Outdated Show resolved Hide resolved

src/layout.rs Outdated Show resolved Hide resolved

Weibye mentioned this pull request Nov 19, 2022

Add maybe_clamp to maybe_math #247

Closed

alice-i-cecile added this to the 0.3 milestone Nov 20, 2022

Add yoga-style benchmarks

2a3a2e9

nicoburns added 2 commits November 22, 2022 01:05

Add smaller yoga-style benchmarks

3b69cae

cargo fmt

0fa0ceb

Update changelog with AvailableSpace changes

778b87e

nicoburns marked this pull request as ready for review November 22, 2022 02:10

TimJentzsch mentioned this pull request Nov 22, 2022

Add calculated expressions to Dimension #232

Closed

nicoburns added 2 commits November 22, 2022 13:40

Remove accidentally comitted debug logs from generated tests

231da2b

Rename AvailableSpace.as_option to AvailableSpace.into_option

5100e7b

nicoburns mentioned this pull request Nov 22, 2022

Implement "gap" property for flexbox algorithm #248

Merged

nicoburns changed the title ~~WIP: Layout algorithm trait, Sizing Constraints & Perf Improvements~~ Layout algorithm decoupling, Sizing Constraints & Perf Improvements Nov 22, 2022

alice-i-cecile reviewed Nov 22, 2022

View reviewed changes

RELEASES.md Show resolved Hide resolved

alice-i-cecile reviewed Nov 22, 2022

View reviewed changes

src/debug.rs Outdated Show resolved Hide resolved

alice-i-cecile requested changes Nov 22, 2022

View reviewed changes

Make debug::print_tree public and document in release notes

5fbe2db

Update RELEASES.md

6ef89c9

Co-authored-by: Alice Cecile <alice.i.cecile@gmail.com>

alice-i-cecile approved these changes Nov 23, 2022

View reviewed changes

alice-i-cecile merged commit ba4cd3c into DioxusLabs:main Nov 23, 2022

This was referenced Nov 25, 2022

Make taffy::compute::compute_layout public #263

Closed

Support CSS Grid #204

Closed

nicoburns mentioned this pull request Jan 8, 2023

Support multiple layout algorithms #28

Closed

nicoburns mentioned this pull request Mar 22, 2023

RFC: React DOM for Native (reduce API fragmentation) react-native-community/discussions-and-proposals#496

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Layout algorithm decoupling, Sizing Constraints & Perf Improvements #246

Layout algorithm decoupling, Sizing Constraints & Perf Improvements #246

nicoburns commented Nov 19, 2022 •

edited

Loading

alice-i-cecile commented Nov 19, 2022

nicoburns commented Nov 19, 2022

alice-i-cecile left a comment

Weibye left a comment

TimJentzsch left a comment

nicoburns commented Nov 22, 2022

alice-i-cecile commented Nov 22, 2022

nicoburns commented Nov 22, 2022

nicoburns commented Nov 22, 2022

TimJentzsch commented Nov 22, 2022

alice-i-cecile commented Nov 22, 2022

alice-i-cecile left a comment

nicoburns commented Nov 22, 2022

alice-i-cecile commented Nov 23, 2022

Layout algorithm decoupling, Sizing Constraints & Perf Improvements #246

Layout algorithm decoupling, Sizing Constraints & Perf Improvements #246

Conversation

nicoburns commented Nov 19, 2022 • edited Loading

Objective

Benchmark Results

Context

Feedback wanted

alice-i-cecile commented Nov 19, 2022

nicoburns commented Nov 19, 2022

alice-i-cecile left a comment

Choose a reason for hiding this comment

Weibye left a comment

Choose a reason for hiding this comment

TimJentzsch left a comment

Choose a reason for hiding this comment

nicoburns commented Nov 22, 2022

Results

Conclusions

Code for yoga benchmarks

alice-i-cecile commented Nov 22, 2022

nicoburns commented Nov 22, 2022

nicoburns commented Nov 22, 2022

TimJentzsch commented Nov 22, 2022

alice-i-cecile commented Nov 22, 2022

alice-i-cecile left a comment

Choose a reason for hiding this comment

nicoburns commented Nov 22, 2022

alice-i-cecile commented Nov 23, 2022

nicoburns commented Nov 19, 2022 •

edited

Loading