[Experiment] Replace HashMap with OrderMap #45282

ghost · 2017-10-14T14:40:24Z

Do not merge.

This is just a simple experiment where I modified FxHashMap to use OrderMap instead of HashMap, as explained in #45273. Don't expect the code to look good. :)

cc @Mark-Simulacrum - Can we please run performance tests to see how this PR impacts compile times?
cc @bluss @kennytm

@eddyb said on IRC that we shouldn't blindly swap the implementation just yet - let's investigate a bit further. If changing the hash map implementation affects performance a lot, then we can probably gain even more by using a different data structure.

rust-highfive · 2017-10-14T14:40:27Z

r? @aturon

(rust_highfive has picked a reviewer for you, use r? to override)

eddyb · 2017-10-14T14:41:19Z

cc @rust-lang/compiler

kennytm · 2017-10-14T14:44:52Z

@bors try

Why are you copying the whole package instead of depending on the crates.io version though?

bors · 2017-10-14T14:45:02Z

⌛ Trying commit 0f01c3b with merge 41825985ccde0f03bf3ea37052b1ad4679c82355...

ghost · 2017-10-14T14:52:12Z

@kennytm No good reason. I wasn't sure how easy or hard it'd be to add ordermap as a dependency from crates.io, so I just copy-pasted the whole crate and called it a day. Last time I worked on rustc it wasn't easy to add crates.io dependencies. But that was long time ago...

Edit: I've pushed a commit that uses ordermap from crates.io.

estebank · 2017-10-14T15:27:54Z

@bors try

(with the crates.io dep now)

bors · 2017-10-14T15:28:04Z

⌛ Trying commit 9da5d99 with merge ddc21f8b46d56234290379ed208d9b58a9e1ea42...

kennytm · 2017-10-14T15:28:30Z

@bors try

The previous try build failed due to Cargo.lock being outdated. Together with the new commit being pushed, the error is not returned. Let's try again.

EDIT: Oh no

bors · 2017-10-14T15:45:02Z

⌛ Trying commit 5dbe2e4 with merge 43308d4...

@Mark-Simulacrum

[Experiment] Replace HashMap with OrderMap Do not merge. This is just a simple experiment where I modified `FxHashMap` to use `OrderMap` instead of `HashMap`, as explained in #45273. Don't expect the code to look good. :) cc @Mark-Simulacrum - Can we please run performance tests to see how this PR impacts compile times? cc @bluss @kennytm @eddyb said on IRC that we shouldn't blindly swap the implementation just yet - let's investigate a bit further. If changing the hash map implementation affects performance a lot, then we can probably gain even more by using a different data structure.

kennytm · 2017-10-14T17:49:24Z

@bors r- try- retry clean

Try build is successful but bors is not commenting.

Travis log: https://travis-ci.org/rust-lang/rust/builds/287955789
Commit = 43308d48de7bbeb014fcd9f4caa1e82e56e394c2.

cc @Mark-Simulacrum

kennytm

The run-make/reproducible-build test failed in the CI.

[00:56:24] ---- [run-make] run-make/reproducible-build stdout ----
[00:56:24] 	
[00:56:24] error: make failed
[00:56:24] status: exit code: 2
[00:56:24] command: "make"
[00:56:24] stdout:
[00:56:24] ------------------------------------------
[00:56:24] make[1]: Entering directory '/checkout/src/test/run-make/reproducible-build'
[00:56:24] LD_LIBRARY_PATH="/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu:/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/lib:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-sysroot/lib/rustlib/x86_64-unknown-linux-gnu/lib:" '/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/bin/rustc' --out-dir /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu -L /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu  reproducible-build-aux.rs
[00:56:24] LD_LIBRARY_PATH="/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu:/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/lib:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-sysroot/lib/rustlib/x86_64-unknown-linux-gnu/lib:" '/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/bin/rustc' --out-dir /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu -L /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu  reproducible-build.rs -o"/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build1"
[00:56:24] LD_LIBRARY_PATH="/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu:/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/lib:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-tools/x86_64-unknown-linux-gnu/release/deps:/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-sysroot/lib/rustlib/x86_64-unknown-linux-gnu/lib:" '/checkout/obj/build/x86_64-unknown-linux-gnu/stage2/bin/rustc' --out-dir /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu -L /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu  reproducible-build.rs -o"/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build2"
[00:56:24] nm "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build1" | sort > "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build1.nm"
[00:56:24] nm "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build2" | sort > "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build2.nm"
[00:56:24] cmp "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build1.nm" "/checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build2.nm" || exit 1
[00:56:24] /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build1.nm /checkout/obj/build/x86_64-unknown-linux-gnu/test/run-make/reproducible-build.stage2-x86_64-unknown-linux-gnu/reproducible-build2.nm differ: char 5265, line 108
[00:56:24] Makefile:3: recipe for target 'all' failed
[00:56:24] make[1]: Leaving directory '/checkout/src/test/run-make/reproducible-build'
[00:56:24] 
[00:56:24] ------------------------------------------
[00:56:24] stderr:
[00:56:24] ------------------------------------------
[00:56:24] make[1]: warning: jobserver unavailable: using -j1.  Add '+' to parent make rule.
[00:56:24] warning: ignoring --out-dir flag due to -o flag.
[00:56:24] 
[00:56:24] warning: unused variable: `dropped`
[00:56:24]   --> reproducible-build.rs:80:9
[00:56:24]    |
[00:56:24] 80 |     let dropped = Struct {
[00:56:24]    |         ^^^^^^^
[00:56:24]    |
[00:56:24]    = note: #[warn(unused_variables)] on by default
[00:56:24]    = note: to avoid this warning, consider using `_dropped` instead
[00:56:24] 
[00:56:24] warning: unused variable: `pointer_shim`
[00:56:24]    --> reproducible-build.rs:123:9
[00:56:24]     |
[00:56:24] 123 |     let pointer_shim: &Fn(i32) = &regular_fn;
[00:56:24]     |         ^^^^^^^^^^^^
[00:56:24]     |
[00:56:24]     = note: to avoid this warning, consider using `_pointer_shim` instead
[00:56:24] 
[00:56:24] warning: ignoring --out-dir flag due to -o flag.
[00:56:24] 
[00:56:24] warning: unused variable: `dropped`
[00:56:24]   --> reproducible-build.rs:80:9
[00:56:24]    |
[00:56:24] 80 |     let dropped = Struct {
[00:56:24]    |         ^^^^^^^
[00:56:24]    |
[00:56:24]    = note: #[warn(unused_variables)] on by default
[00:56:24]    = note: to avoid this warning, consider using `_dropped` instead
[00:56:24] 
[00:56:24] warning: unused variable: `pointer_shim`
[00:56:24]    --> reproducible-build.rs:123:9
[00:56:24]     |
[00:56:24] 123 |     let pointer_shim: &Fn(i32) = &regular_fn;
[00:56:24]     |         ^^^^^^^^^^^^
[00:56:24]     |
[00:56:24]     = note: to avoid this warning, consider using `_pointer_shim` instead
[00:56:24] 
[00:56:24] make[1]: *** [all] Error 1
[00:56:24] 
[00:56:24] ------------------------------------------
[00:56:24] 
[00:56:24] thread '[run-make] run-make/reproducible-build' panicked at 'explicit panic', /checkout/src/tools/compiletest/src/runtest.rs:2467:8
[00:56:24] note: Run with `RUST_BACKTRACE=1` for a backtrace.
[00:56:24] 
[00:56:24] 
[00:56:24] failures:
[00:56:24]     [run-make] run-make/reproducible-build
[00:56:24] 
[00:56:24] test result: FAILED. 160 passed; 1 failed; 0 ignored; 0 measured; 0 filtered out
[00:56:24] 
[00:56:24] thread 'main' panicked at 'Some tests failed', /checkout/src/tools/compiletest/src/main.rs:323:21

Mark-Simulacrum · 2017-10-14T20:04:12Z

http://perf.rust-lang.org/compare.html?start=af7de7b6774b061b7809ce9aa6db31ea29df33c8&end=43308d48de7bbeb014fcd9f4caa1e82e56e394c2

kennytm · 2017-10-14T20:25:28Z

The numbers don’t look good. Most of them become slower.

ghost · 2017-10-15T09:54:32Z

This is just too bad. :(
So I think the difference in performance was simply the difference of using the nightly rustc from rustup and using my own compiled rustc.

kennytm · 2017-10-15T10:07:22Z

@Mark-Simulacrum I want to ensure I'm reading the report correctly, as different measures give conflicting results.

measure	before (total)	after (total)	change
cpu-clock	372185.29	369019.14	-0.85%
cycles:u	1421498864061	1409937857765	-0.81%
faults	1511256	1510959	-0.02%
instructions:u	1699836885596	1723213735874	+1.38%
max-rss	16735048	16739924	+0.03%
task-clock	370849.03	367449.54	-0.92%
wall-time	202.36	203.21	+0.42%

If we judge by "instructions", there is a significant regression, but if we judge by "cpu-clock" or "cycles", there is a signficant improvement.

Mark-Simulacrum · 2017-10-15T15:56:12Z

Yes, instructions is usually a good indicator that it's worth looking at timing data. I don't see any significant improvement though (usually anything <1% isn't too major -- the timing looks like it changed by ~0.5 seconds which isn't too meaningful).

ghost · 2017-10-15T18:43:32Z

Hmm, so what is it that makes my locally compiled rustc 5% to 20% faster than the one that came from rustup? Does ./x.py build --stage 1 optimize for the local machine or something?

My compile times:

crate	rustc provided by rustup	locally compiled rustc
crossbeam	31.94 secs	26.44 secs
rayon	9.40 secs	8.20 secs
pdqsort	0.23 secs	0.18 secs
chan	6.16 secs	5.26 secs
string-cache	10.13 secs	9.71 secs
parking_lot	5.45 secs	4.92 secs
timely-dataflow	19.69 secs	16.55 secs

Mark-Simulacrum · 2017-10-15T18:54:33Z

What is your config.toml like? On most (all?) platforms nightly rustc is shipped with LLVM assertions enabled, whereas by default they are disabled in locally compiled builds.

ghost · 2017-10-15T19:02:50Z

I assume you meant .rustup/toolchains/nightly-x86_64-unknown-linux-gnu/lib/rustlib/multirust-config.toml:

config_version = "1"

[[components]]
pkg = "rustc"
target = "x86_64-unknown-linux-gnu"

[[components]]
pkg = "rust-std"
target = "x86_64-unknown-linux-gnu"

[[components]]
pkg = "cargo"
target = "x86_64-unknown-linux-gnu"

[[components]]
pkg = "rust-docs"
target = "x86_64-unknown-linux-gnu"

[[components]]
pkg = "rust-analysis"
target = "x86_64-unknown-linux-gnu"

[[components]]
pkg = "rust-src"
target = "*"

Is it out of the question to disable LLVM assertions in shipped rustc? I mean, this is a pretty nice speedup... :)

bluss · 2017-10-21T15:25:31Z

I just want to learn the thinking behind it, why is it needed to look at the instructions stat before cpu-clock or cycles?

Mark-Simulacrum · 2017-10-21T15:35:20Z

LLVM assertions being disabled in nightly builds is actually a topic of ongoing discussion (and has been for a long time, to an extent). By config.toml, I meant the file in your locally compiled rustc -- or did you just run ./x.py build (without copying config.toml.example or running configure?) Do you perhaps have a config.mk still?

@bluss It's not required, but generally we see instructions as a far more stable measure of performance than wall time or cycles. If we see a regression or improvement in the number of instructions, then a look at wall time for actual impact on compile times is warranted. It's not necessarily clear that this is the best way of doing things, but it's how we approach the process for now. I'd be happy to hear suggestions if you have them!

bluss · 2017-10-21T15:44:47Z

@Mark-Simulacrum I haven't used the perf comparison website much in the past, I just saw the different measures now and seeing the reduced cpu-clock times certainly changed my perception of this benchmark.

On one hand, it's not surprising ordermap would use more instructions — its implementation doesn't have much micro optimizations, we have bounds checked vector accesses and other kinds of code that means more instructions are executed for the equivalent operations. So to see that it still runs fast makes us imagine something about the wonders of branch prediction :-)

Effectively straight line code with less branch prediction failures, can manage to run through more instructions in less time. (Like in this article and the 2x instructions vs 7x runtime difference; their benchmark is irrelevant to this discussion otherwise though.)

ghost · 2017-10-21T15:53:17Z

By config.toml, I meant the file in your locally compiled rustc -- or did you just run ./x.py build (without copying config.toml.example or running configure?) Do you perhaps have a config.mk still?

I just ran ./x.py build --stage 1 src/libtest without any additional fiddling. I don't have a config.mk.

Mark-Simulacrum · 2017-10-21T16:04:21Z

Yep, that'll mean that you have a build without llvm assertions, which is expected to be faster.

@bluss Regarding instructions vs wall time, it's certainly true that more instructions may not mean worse results timing wise, but that's why we look at wall time as well. While it seems like OrderMap might be a win there, it's a small one, and may not even exist. It's interesting to know that it isn't a loss, but I don't think the results show enough impact to consider adding OrderMap at this point. I'd be happy to rerun the benchmarks later if OrderMap is optimized and we think it's worth another look.

bluss · 2017-10-21T16:08:06Z

It certainly warms my heart that it isn't even a loss. I didn't see those numbers until today. Mind you, I'm pretty convinced that OrderMap has slower lookup just like I'm convinced it has faster iteration.

[Experiment] Replace HashMap with OrderMap

0f01c3b

rust-highfive assigned aturon Oct 14, 2017

kennytm added the S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. label Oct 14, 2017

Use ordermap from crates.io

9da5d99

Add changes to Cargo.lock

5dbe2e4

kennytm requested changes Oct 14, 2017

View reviewed changes

ghost closed this Oct 15, 2017

ghost deleted the experiment-ordermap branch October 15, 2017 09:54

ghost mentioned this pull request Oct 15, 2017

Optimizing rustc: replace HashMap with OrderMap? #45273

Closed

bluss mentioned this pull request Oct 18, 2017

DOC: Comparison with HashMap indexmap-rs/indexmap#44

Open

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Experiment] Replace HashMap with OrderMap #45282

[Experiment] Replace HashMap with OrderMap #45282

ghost commented Oct 14, 2017

rust-highfive commented Oct 14, 2017

eddyb commented Oct 14, 2017

kennytm commented Oct 14, 2017

bors commented Oct 14, 2017

ghost commented Oct 14, 2017 •

edited by ghost

Loading

estebank commented Oct 14, 2017

bors commented Oct 14, 2017

kennytm commented Oct 14, 2017 •

edited

Loading

bors commented Oct 14, 2017

kennytm commented Oct 14, 2017 •

edited

Loading

kennytm left a comment

Mark-Simulacrum commented Oct 14, 2017

kennytm commented Oct 14, 2017

ghost commented Oct 15, 2017

kennytm commented Oct 15, 2017

Mark-Simulacrum commented Oct 15, 2017

ghost commented Oct 15, 2017 •

edited by ghost

Loading

Mark-Simulacrum commented Oct 15, 2017

ghost commented Oct 15, 2017 •

edited by ghost

Loading

bluss commented Oct 21, 2017

Mark-Simulacrum commented Oct 21, 2017

bluss commented Oct 21, 2017 •

edited

Loading

ghost commented Oct 21, 2017

Mark-Simulacrum commented Oct 21, 2017

bluss commented Oct 21, 2017

[Experiment] Replace HashMap with OrderMap #45282

[Experiment] Replace HashMap with OrderMap #45282

Conversation

ghost commented Oct 14, 2017

rust-highfive commented Oct 14, 2017

eddyb commented Oct 14, 2017

kennytm commented Oct 14, 2017

bors commented Oct 14, 2017

ghost commented Oct 14, 2017 • edited by ghost Loading

estebank commented Oct 14, 2017

bors commented Oct 14, 2017

kennytm commented Oct 14, 2017 • edited Loading

bors commented Oct 14, 2017

kennytm commented Oct 14, 2017 • edited Loading

kennytm left a comment

Choose a reason for hiding this comment

Mark-Simulacrum commented Oct 14, 2017

kennytm commented Oct 14, 2017

ghost commented Oct 15, 2017

kennytm commented Oct 15, 2017

Mark-Simulacrum commented Oct 15, 2017

ghost commented Oct 15, 2017 • edited by ghost Loading

Mark-Simulacrum commented Oct 15, 2017

ghost commented Oct 15, 2017 • edited by ghost Loading

bluss commented Oct 21, 2017

Mark-Simulacrum commented Oct 21, 2017

bluss commented Oct 21, 2017 • edited Loading

ghost commented Oct 21, 2017

Mark-Simulacrum commented Oct 21, 2017

bluss commented Oct 21, 2017

ghost commented Oct 14, 2017 •

edited by ghost

Loading

kennytm commented Oct 14, 2017 •

edited

Loading

kennytm commented Oct 14, 2017 •

edited

Loading

ghost commented Oct 15, 2017 •

edited by ghost

Loading

ghost commented Oct 15, 2017 •

edited by ghost

Loading

bluss commented Oct 21, 2017 •

edited

Loading