Remove INLINE Pragma on indices #219

Tarmean · 2018-02-21T02:09:10Z

For context, here is a recent reddit discussion about a program that runs faster with optimizations disabled..

This is caused by an interaction between the INLINE pragma on indices and the INLINE [1] pragma on isInfixOf. For some reason this combination keeps buildTable 0 0 (nlen-2) from being floated out. The end result is buildTable being run in each iteration of scan resulting in some impressively slow searches.

There are other ways to solve this issue but indices doesn't participate in fusion anyway and this seems like it'd impact code readability the least.

This fixes an interaction with {-# INLINE [1] isInfixOf #-} that made buildTable run once for each scan iteration

hvr · 2018-02-21T13:09:24Z

I'd like @nomeata to give this a look as he's been recently also looking at the fusion framework rules...

nomeata

This is independent of fusion, right?

But if leaving the inlining decision to GHC yields better performance, then that’s great of course.

@Tarmean, did you run benchmarks to test this?

Tarmean · 2018-02-21T23:51:41Z

...I really should have done that before opening the pull request, sorry. I am running into some problems when trying to run the benchmarks, though:

8.0.2:

 text/benchmarks > dist/build/text-benchmarks/text-benchmarks
q[1]    5174 segmentation fault (core dumped)  dist/build/text-benchmarks/text-benchmarks

8.2.2:

text-benchmarks: internal error: Unable to commit 1214251008 bytes of memory
    (GHC version 8.2.2 for x86_64_unknown_linux)
    Please report this as a GHC bug:  http://www.haskell.org/ghc/reportabug
[1]    5201 abort (core dumped)  .stack-work/install/x86_64-linux-nopie/lts-10.6/8.2.2/bin/text-benchmarks

hvr · 2018-02-22T19:01:07Z

@Tarmean Did the benchmark executable fail right away or did it produce more output than you showed us? I can't reproduce neither the segfault nor the GHC panic with a text-benchmarks executable built using cabal new-build (with a GHC distro specifically built for Ubuntu). I suspect either your GHC installation to have a problem, some package-db corruption, or that you're running this on a memory-scarce machine.

Tarmean · 2018-02-22T22:05:22Z

The executable fails right away. I tried nuking everything ghc related and reinstalling but that didn't fix it.

The issue does seem to be memory related, the error changes when specifying a max heap size with -M. How much memory are the benchmarks supposed to use?

  ~/Projects/text/benchmarks  ➦ ab90c65 ✚  ~/Projects/text/benchmarks/.stack-work/install/x86_64-linux-nopie/lts-10.6/8.2.2/bin/text-benchmarks +RTS -M9G
text-benchmarks: Heap exhausted;
text-benchmarks: Current maximum heap size is 9663676416 bytes (9216 MB).
text-benchmarks: Use `+RTS -M<size>' to increase it.

hvr · 2018-02-22T23:40:29Z

@Tarmean ok, now that you mention it, I see it too... I got a 32GiB ram machine, and didn't realise how much memory is used shortly after startup (but then gets quickly GC it seems). In any case, the lowest value I was able to get it working was with +RTS -M20G. I need to look into when this regressed, as I don't think the benchmark suite always had such a huge memory spike.

PS: #204 is related

sjakobi · 2021-03-17T00:04:18Z

I've frequently seen buildTable in the midst of Core dumps from pandoc. The indices code seemed to significantly contribute to pandoc's terrible compile times (https://gitlab.haskell.org/ghc/ghc/-/issues/18010). At least I've been able to reduce the compile times substantially by actively preventing splitOn (which uses indices) from being inlined on overloaded string literals. See jgm/doclayout#1 and jgm/doclayout#2.

I strongly suspect that the terrible string matching performance observed in haskell/bytestring#307 (comment) is also related to this issue.

sjakobi · 2021-03-17T14:47:41Z

This is caused by an interaction between the INLINE pragma on indices and the INLINE [1] pragma on isInfixOf. For some reason this combination keeps buildTable 0 0 (nlen-2) from being floated out.

It might be good to report this on GHC's issue tracker. There might be a compiler bug hiding here.

Tarmean · 2021-03-17T23:02:00Z

I still think that removing the INLINE pragma should improve performance in most cases. Not at all certain anymore that nested INLINE pragmas are at fault.

The interaction between the bang pattern on buildTable and the INLINE pragma for some reason does seem to prevent buildTable from being floated out when indices is called in a loop. But buildTable also isn't floated when indices isn't inlined so that doesn't explain the weird performance regression on -O2.

Guess I'm gonna try if this still can be reproduced with profiling builds tomorrow.

Remove INLINE Pragma on indices

1e2c44c

This fixes an interaction with {-# INLINE [1] isInfixOf #-} that made buildTable run once for each scan iteration

hvr requested a review from nomeata February 21, 2018 13:07

nomeata reviewed Feb 21, 2018

View reviewed changes

Lysxia added the internal No API-level changes label Mar 7, 2021

Bodigrim marked this pull request as draft February 28, 2023 18:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove INLINE Pragma on indices #219

Remove INLINE Pragma on indices #219

Tarmean commented Feb 21, 2018

hvr commented Feb 21, 2018

nomeata left a comment

Tarmean commented Feb 21, 2018 •

edited

hvr commented Feb 22, 2018

Tarmean commented Feb 22, 2018 •

edited

hvr commented Feb 22, 2018 •

edited

sjakobi commented Mar 17, 2021

sjakobi commented Mar 17, 2021

Tarmean commented Mar 17, 2021 •

edited

Remove INLINE Pragma on indices #219

Are you sure you want to change the base?

Remove INLINE Pragma on indices #219

Conversation

Tarmean commented Feb 21, 2018

hvr commented Feb 21, 2018

nomeata left a comment

Choose a reason for hiding this comment

Tarmean commented Feb 21, 2018 • edited

hvr commented Feb 22, 2018

Tarmean commented Feb 22, 2018 • edited

hvr commented Feb 22, 2018 • edited

sjakobi commented Mar 17, 2021

sjakobi commented Mar 17, 2021

Tarmean commented Mar 17, 2021 • edited

Tarmean commented Feb 21, 2018 •

edited

Tarmean commented Feb 22, 2018 •

edited

hvr commented Feb 22, 2018 •

edited

Tarmean commented Mar 17, 2021 •

edited