Static Object Optimizations #197

ahirreddy · 2024-01-04T01:46:42Z

This PR uses the Parser & StaticOptimizer thread local string interner for keys in static objects
Similarly, we deduplicate the String -> Boolean map used to determine if a field is static.
For static objects we also use immutable.VectorMap (with a JavaWrapper) for the field set.
Lastly for the value cache, we size it according to the number of keys in the object this reduces unnecessary up-sizing for large objects, and more importantly removes the large number of sparse maps we previously had for small objects (the default was 16 elements)

Before: 855MB for the parsed file

After: 425MB

szeiger

What's the performance impact of these changes? Are we trading memory for speed?

szeiger · 2024-01-08T12:28:37Z

bench/src/main/scala/sjsonnet/MainBenchmark.scala

+// This is a dummy benchmark to see how much memory is used by the interpreter.
+// You're meant to execute it, and once it prints "sleeping" you can attach yourkit and take a heap
+// dump. Because we store the cache, the parsed objects will have strong references - and thus will
+// be in the heap dump.


Could we turn this into a simple command line program that does a gc before and after parsing and then prints the memory usage diff to the console so you can easily retest this for future changes without having to attach a profiler?

Done and updated readme.

szeiger · 2024-01-08T12:30:38Z

sjsonnet/src/sjsonnet/StaticOptimizer.scala

+  // HashMap to deduplicate strings.
+  private[this] val strings = new mutable.HashMap[String, String]
+
+  private[this] val fieldSet = new mutable.HashMap[Val.StaticObjectFieldSet, java.util.LinkedHashMap[String, java.lang.Boolean]]
+


Why is this cache separate from the Parser's? Should it be a single cache at the Interpreter level?

Changed, there's now a single cache at the interpreter level.

szeiger · 2024-01-08T12:42:40Z

sjsonnet/src/sjsonnet/Val.scala

@@ -297,15 +298,45 @@ object Val{
    }
  }

-  def staticObject(pos: Position, fields: Array[Expr.Member.Field]): Obj = {
+  final case class StaticObjectFieldSet(keys: Array[String]) {


This shouldn't be a case class. Array has no useful equality or toString and you're overriding equals and hashCode.

…hashmaps2

ahirreddy · 2024-01-09T01:32:34Z

Sorry I missed your most important question. The performance impact here was undetectable in the benchmark. I think the fact that the object interning is thread-local and unsynchronized makes it pretty fast.

szeiger · 2024-01-09T12:30:50Z

bench/src/main/scala/sjsonnet/OptimizerBenchmark.scala

@@ -3,6 +3,8 @@ package sjsonnet
 import java.io.StringWriter
 import java.util.concurrent.TimeUnit

+import scala.collection.mutable.HashMap


I'd prefer to keep mutable types qualified (i.e. only import scala.collection.mutable) everywhere for consistency.

ahirreddy added 6 commits January 3, 2024 16:16

update

dfca0a5

local string intern

e579729

field set builder

cc70b77

fix idx

4c7cc17

reduce further

8ea91ea

revert

7119af0

ahirreddy requested review from lihaoyi, lihaoyi-databricks and szeiger January 4, 2024 02:08

ahirreddy mentioned this pull request Jan 4, 2024

Compact HashMaps #196

Closed

ahirreddy added 4 commits January 3, 2024 18:42

java hashmap

003877e

remove unused bitset

256ac70

remove unused bitset

76b6448

fix merge conflicts

f76e704

lihaoyi removed their request for review January 5, 2024 07:37

szeiger reviewed Jan 8, 2024

View reviewed changes

ahirreddy added 6 commits January 8, 2024 10:27

share

7fd6802

remove

55b51ec

Merge remote-tracking branch 'origin/compact-hashmaps2' into compact-…

d57fe8b

…hashmaps2

fix compile

f66284b

memory benchmark

475fc90

update readme

3a16d4e

szeiger approved these changes Jan 9, 2024

View reviewed changes

lihaoyi-databricks added 2 commits January 9, 2024 05:34

Update ParserTests.scala

b96a4f6

Update OptimizerBenchmark.scala

28b1cd3

lihaoyi-databricks merged commit c462833 into master Jan 9, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Static Object Optimizations #197

Static Object Optimizations #197

ahirreddy commented Jan 4, 2024 •

edited

szeiger left a comment

szeiger Jan 8, 2024

ahirreddy Jan 8, 2024

szeiger Jan 8, 2024

ahirreddy Jan 8, 2024

szeiger Jan 8, 2024

ahirreddy Jan 8, 2024

ahirreddy commented Jan 9, 2024

szeiger Jan 9, 2024

Static Object Optimizations #197

Static Object Optimizations #197

Conversation

ahirreddy commented Jan 4, 2024 • edited

szeiger left a comment

Choose a reason for hiding this comment

szeiger Jan 8, 2024

Choose a reason for hiding this comment

ahirreddy Jan 8, 2024

Choose a reason for hiding this comment

szeiger Jan 8, 2024

Choose a reason for hiding this comment

ahirreddy Jan 8, 2024

Choose a reason for hiding this comment

szeiger Jan 8, 2024

Choose a reason for hiding this comment

ahirreddy Jan 8, 2024

Choose a reason for hiding this comment

ahirreddy commented Jan 9, 2024

szeiger Jan 9, 2024

Choose a reason for hiding this comment

ahirreddy commented Jan 4, 2024 •

edited