Improve encoding/decoding speed #1500

Gabriella439 · 2019-10-31T20:47:57Z

... by not going through a Term intermediate

This gives a ~28% performance improvement for decoding, which means that
cache lookups are now faster.

Here are the new decoding benchmarks before and after this change:

Before:

benchmarked Issue #108/Binary
time                 266.5 μs   (265.7 μs .. 267.4 μs)
                     1.000 R²   (1.000 R² .. 1.000 R²)
mean                 266.3 μs   (265.6 μs .. 267.1 μs)
std dev              2.418 μs   (1.891 μs .. 3.436 μs)

benchmarking Kubernetes/Binary ... took 36.94 s, total 56 iterations
benchmarked Kubernetes/Binary
time                 641.3 ms   (623.0 ms .. 655.4 ms)
                     0.999 R²   (0.997 R² .. 1.000 R²)
mean                 679.7 ms   (665.5 ms .. 702.6 ms)
std dev              29.48 ms   (14.15 ms .. 39.05 ms)

After:

benchmarked Issue #108/Binary
time                 282.2 μs   (279.6 μs .. 284.7 μs)
                     1.000 R²   (0.999 R² .. 1.000 R²)
mean                 281.9 μs   (280.7 μs .. 287.7 μs)
std dev              7.089 μs   (2.550 μs .. 15.44 μs)
variance introduced by outliers: 11% (moderately inflated)

benchmarking Kubernetes/Binary ... took 27.57 s, total 56 iterations
benchmarked Kubernetes/Binary
time                 499.1 ms   (488.1 ms .. 506.6 ms)
                     0.999 R²   (0.998 R² .. 1.000 R²)
mean                 498.9 ms   (494.4 ms .. 503.9 ms)
std dev              8.539 ms   (6.236 ms .. 12.56 ms)

There's a slight performance regression for the decoding microbenchmark, but
in practice my testing on real examples matches performance improvements seen
in the larger benchmark based on an example cache product from
dhall-kubernetes.

Note that is a breaking change because:

There is no longer a FromTerm nor ToTerm class. Now we use the
Serialise class and {encode,decode}Expression now work on ByteStrings
instead of Terms
I further narrowed the types of several encoding/decoding utilites to expect a
Void for the first type parameter of Expr
This is a regression with respect to stripping 55799 CBOR tags, mainly
because properly handling the tags at every possible point in the syntax tree
would considerably complicate the code

... by not going through a `Term` intermediate This gives a ~28% performance in decoding improvement, which means that cache looks are not faster. Here are the new decoding benchmarks before and after this change: Before: ``` benchmarked Issue #108/Binary time 266.5 μs (265.7 μs .. 267.4 μs) 1.000 R² (1.000 R² .. 1.000 R²) mean 266.3 μs (265.6 μs .. 267.1 μs) std dev 2.418 μs (1.891 μs .. 3.436 μs) benchmarking Kubernetes/Binary ... took 36.94 s, total 56 iterations benchmarked Kubernetes/Binary time 641.3 ms (623.0 ms .. 655.4 ms) 0.999 R² (0.997 R² .. 1.000 R²) mean 679.7 ms (665.5 ms .. 702.6 ms) std dev 29.48 ms (14.15 ms .. 39.05 ms) ``` After: ``` benchmarked Issue #108/Binary time 282.2 μs (279.6 μs .. 284.7 μs) 1.000 R² (0.999 R² .. 1.000 R²) mean 281.9 μs (280.7 μs .. 287.7 μs) std dev 7.089 μs (2.550 μs .. 15.44 μs) variance introduced by outliers: 11% (moderately inflated) benchmarking Kubernetes/Binary ... took 27.57 s, total 56 iterations benchmarked Kubernetes/Binary time 499.1 ms (488.1 ms .. 506.6 ms) 0.999 R² (0.998 R² .. 1.000 R²) mean 498.9 ms (494.4 ms .. 503.9 ms) std dev 8.539 ms (6.236 ms .. 12.56 ms) ``` There's a slight performance regression for the decoding microbenchmark, but in practice my testing on real examples matches performance improvements seen in the larger benchmark based on an example cache product from `dhall-kubernetes`. Note that is a breaking change because: * There is no longer a `FromTerm` nor `ToTerm` class. Now we use the `Serialise` class and `{encode,decode}Expression` now work on `ByteString`s instead of `Term`s * I further narrowed the types of several encoding/decoding utilites to expect a `Void` for the first type parameter of `Expr` * This is a regression with respect to stripping 55799 CBOR tags, mainly because properly handling the tags at every possible point in the syntax tree would considerably complicate the code

sjakobi

The speedup is great, but it seems to me that especially the decoding got more low-level and a bit trickier. Would you mind checking that the acceptance test suite gives us good test coverage there? Otherwise I can do that too!

There's also a build failure in dhall-lsp-server:

    C:\projects\dhall-haskell\dhall-lsp-server\src\Dhall\LSP\Backend\Dhall.hs:168:30: error:
        * Couldn't match type `Src' with `Void'
          Expected type: Expr Void Void
            Actual type: Expr Src Void
        * In the first argument of `Dhall.hashExpressionToCode', namely
            `alphaNormal'
          In the expression: Dhall.hashExpressionToCode alphaNormal
          In an equation for `hashNormalToCode':
              hashNormalToCode (Normal expr)
                = Dhall.hashExpressionToCode alphaNormal
                where
                    alphaNormal = Dhall.alphaNormalize expr
        |
    168 |   Dhall.hashExpressionToCode alphaNormal
        |                              ^^^^^^^^^^^
Command exited with code 1

dhall/benchmark/parser/Main.hs

sjakobi · 2019-10-31T22:17:00Z

dhall/src/Dhall/Binary.hs

+                return (BoolLit b)
+
+            TypeString -> do
+                s <- Decoding.decodeString


Can you add a type annotation that shows that this is a Text? decodeString is pretty confusing! :/

There are a lot of decodeStrings that would need to be changed if we did that

Right. Never mind then!

Gabriella439 · 2019-10-31T23:16:16Z

@sjakobi: I did a code coverage check and the main things that are not exercised by the test suite are:

Decoding the IntegerToDouble/ListFold built-ins
Decoding large Natural numbers
Encoding an empty list without a type annotation
Decoding an integrity check
Decoding an import beginning with ../ or ~/
Error messages

Only one of these concerns me: I believe the code is not correctly handling Natural numbers greater than maxBound :: Word64, which I'll fix

sjakobi · 2019-10-31T23:23:25Z

Thanks for checking! :) 👍

I can take care of adding test cases for the currently uncovered code to the test suite.

@sjakobi

... as suggested by @sjakobi

sjakobi

👍

sjakobi · 2019-10-31T23:35:24Z

dhall/src/Dhall/Binary.hs

+                return (BoolLit b)
+
+            TypeString -> do
+                s <- Decoding.decodeString


Right. Never mind then!

@Gabriel439

@Gabriel439 had discovered in dhall-lang/dhall-haskell#1500 that we were missing some test coverage here.

@Gabriel439

@Gabriel439 had discovered in dhall-lang/dhall-haskell#1500 that we were missing some test coverage here.

Gabriella439 added 2 commits October 31, 2019 13:43

Fix doctest failure

b03e0ca

sjakobi reviewed Oct 31, 2019

View reviewed changes

Fix build failure in dhall-lsp-server

f03127c

Fix bug in decoding large Natural literals

41b0c02

Use error to detect decoding failures in benchmarks

850c71e

... as suggested by @sjakobi

sjakobi approved these changes Oct 31, 2019

View reviewed changes

dhall/src/Dhall/Binary.hs

return (BoolLit b)

TypeString -> do

s <- Decoding.decodeString

Copy link

Collaborator

sjakobi Oct 31, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Right. Never mind then!

Gabriella439 merged commit b843cae into master Nov 1, 2019

Gabriella439 deleted the gabriel/fast_serialize_2 branch November 1, 2019 03:05

sjakobi added a commit to dhall-lang/dhall-lang that referenced this pull request Nov 12, 2019

Add tests for decoding big Integers and Naturals

77fedb6

@Gabriel439 had discovered in dhall-lang/dhall-haskell#1500 that we were missing some test coverage here.

sjakobi mentioned this pull request Nov 12, 2019

Add tests for decoding big Integers and Naturals dhall-lang/dhall-lang#819

Merged

sjakobi added a commit to dhall-lang/dhall-lang that referenced this pull request Nov 14, 2019

Add tests for decoding big Integers and Naturals (#819)

3b31dea

@Gabriel439 had discovered in dhall-lang/dhall-haskell#1500 that we were missing some test coverage here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve encoding/decoding speed #1500

Improve encoding/decoding speed #1500

Gabriella439 commented Oct 31, 2019 •

edited

Loading

sjakobi left a comment

sjakobi Oct 31, 2019

Gabriella439 Oct 31, 2019

sjakobi Oct 31, 2019

Gabriella439 commented Oct 31, 2019

sjakobi commented Oct 31, 2019

sjakobi left a comment

sjakobi Oct 31, 2019

Improve encoding/decoding speed #1500

Improve encoding/decoding speed #1500

Conversation

Gabriella439 commented Oct 31, 2019 • edited Loading

sjakobi left a comment

Choose a reason for hiding this comment

sjakobi Oct 31, 2019

Choose a reason for hiding this comment

Gabriella439 Oct 31, 2019

Choose a reason for hiding this comment

sjakobi Oct 31, 2019

Choose a reason for hiding this comment

Gabriella439 commented Oct 31, 2019

sjakobi commented Oct 31, 2019

sjakobi left a comment

Choose a reason for hiding this comment

sjakobi Oct 31, 2019

Choose a reason for hiding this comment

Gabriella439 commented Oct 31, 2019 •

edited

Loading