Fix #5 by using as many BigInt in-place ops as possible to avoid allo… #20

quinnj · 2019-03-07T06:08:57Z

…cations.

codecov-io · 2019-03-07T06:15:32Z

Codecov Report

Merging #20 into master will increase coverage by 8.72%.
The diff coverage is 95%.

@@            Coverage Diff             @@
##           master      #20      +/-   ##
==========================================
+ Coverage   75.29%   84.02%   +8.72%     
==========================================
  Files           5        5              
  Lines         757      726      -31     
==========================================
+ Hits          570      610      +40     
+ Misses        187      116      -71

Impacted Files	Coverage Δ
src/Parsers.jl	`80.06% <100%> (+5.72%)`	⬆️
src/floats.jl	`88.31% <90%> (+11.03%)`	⬆️
src/dates.jl	`95.45% <0%> (+4.15%)`	⬆️
src/tries.jl	`85.71% <0%> (+10.71%)`	⬆️
src/strings.jl	`85.1% <0%> (+11.93%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 53a41c3...bb0931e. Read the comment docs.

codecov-io · 2019-03-07T06:15:32Z

Codecov Report

Merging #20 into master will increase coverage by 0.15%.
The diff coverage is 88.46%.

@@            Coverage Diff             @@
##           master      #20      +/-   ##
==========================================
+ Coverage   75.29%   75.45%   +0.15%     
==========================================
  Files           5        5              
  Lines         757      774      +17     
==========================================
+ Hits          570      584      +14     
- Misses        187      190       +3

Impacted Files	Coverage Δ
src/floats.jl	`77.77% <88.46%> (+0.5%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 53a41c3...7bc0567. Read the comment docs.

quinnj · 2019-03-07T18:37:18Z

@simonbyrne, just pinging you here if you wouldn't mind taking a look; I know we've talked strategy here a bit and I think this is looking pretty good. Here's a taste of the perf improvements by pre-allocating a few BigInts and doing in-place operations:

Before:

julia> run(@benchmarkable Parsers.defaultparser(io, r) setup=(io = IOBuffer("0.0017138347201173243"); r = Parsers.Result($T)))
BenchmarkTools.Trial:
  memory estimate:  408 bytes
  allocs estimate:  24
  --------------
  minimum time:     1.098 μs (0.00% GC)
  median time:      1.277 μs (0.00% GC)
  mean time:        1.363 μs (0.00% GC)
  maximum time:     29.360 μs (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1

After:

julia> run(@benchmarkable Parsers.defaultparser(io, r) setup=(io = IOBuffer("0.0017138347201173243"); r = Parsers.Result($T)))
BenchmarkTools.Trial:
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     216.000 ns (0.00% GC)
  median time:      232.000 ns (0.00% GC)
  mean time:        244.993 ns (0.00% GC)
  maximum time:     36.176 μs (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1

And compared to Base.parse:

julia> @benchmark parse(Float64, "0.0017138347201173243")
BenchmarkTools.Trial:
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     299.788 ns (0.00% GC)
  median time:      310.359 ns (0.00% GC)
  mean time:        329.445 ns (0.00% GC)
  maximum time:     928.122 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     245

src/floats.jl

simonbyrne · 2019-03-07T21:38:26Z

src/floats.jl

+const QUO = BigInt()
+const REM = BigInt()
+const SCL = BigInt()
+
 const BIG_E = UInt8('E')
 const LITTLE_E = UInt8('e')

 const bipows5 = [big(5)^x for x = 0:325]

 function roundQuotient(num, den)


I assume this is doing something like round(num/den), but round-to-nearest?

simonbyrne · 2019-03-07T21:40:25Z

Nice. It generally looks okay. The main problem is handling multithreading.

For worst-case test values, see https://www.icir.org/vern/papers/testbase-report.pdf

simonbyrne · 2019-03-07T23:54:42Z

src/floats.jl

        bex = bitlength(num) - significantbits(T)
        bex <= 0 && return ldexp(T(num), exp)
-        quo = roundQuotient(num, big(1) << bex)
+        MPZ.mul_2exp!(MPZ.set_si!(ONE, 1), bex)
+        quo = roundQuotient(num, ONE)


Just so I understand: this is essentially implementing Float64(x::BigInt)?

Man, we need to improve the performance of the one in Base.

JuliaLang/julia#31293

…arsing thread-safe

quinnj · 2019-03-08T22:57:07Z

@simonbyrne thanks for taking a look; good call on the multithreaded case, we should be good now. I also added a bunch of tests from the paper you linked; great stuff!

Fix #5 by using as many BigInt in-place ops as possible to avoid allo…

7bc0567

…cations.

Avoid a few more allocations

66eb9dc

simonbyrne reviewed Mar 7, 2019

View reviewed changes

src/floats.jl Outdated Show resolved Hide resolved

simonbyrne reviewed Mar 7, 2019

View reviewed changes

quinnj added 2 commits March 8, 2019 11:22

More perf improvements and make new pre-allocated BigInts for float p…

95cd4f1

…arsing thread-safe

Add a bunch of tests:

bb0931e

quinnj merged commit ab8ef26 into master Mar 8, 2019

quinnj deleted the jq/floatperf branch March 8, 2019 22:57

ararslan mentioned this pull request Mar 12, 2019

realloc(): invalid pointer #22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #5 by using as many BigInt in-place ops as possible to avoid allo… #20

Fix #5 by using as many BigInt in-place ops as possible to avoid allo… #20

quinnj commented Mar 7, 2019

codecov-io commented Mar 7, 2019 •

edited

codecov-io commented Mar 7, 2019

quinnj commented Mar 7, 2019

simonbyrne Mar 7, 2019

simonbyrne commented Mar 7, 2019

simonbyrne Mar 7, 2019

simonbyrne Mar 8, 2019

quinnj commented Mar 8, 2019

Fix #5 by using as many BigInt in-place ops as possible to avoid allo… #20

Fix #5 by using as many BigInt in-place ops as possible to avoid allo… #20

Conversation

quinnj commented Mar 7, 2019

codecov-io commented Mar 7, 2019 • edited

Codecov Report

codecov-io commented Mar 7, 2019

Codecov Report

quinnj commented Mar 7, 2019

simonbyrne Mar 7, 2019

Choose a reason for hiding this comment

simonbyrne commented Mar 7, 2019

simonbyrne Mar 7, 2019

Choose a reason for hiding this comment

simonbyrne Mar 8, 2019

Choose a reason for hiding this comment

quinnj commented Mar 8, 2019

codecov-io commented Mar 7, 2019 •

edited