Allow to turn off stacktrace bookkeeping #96

yonromai · 2019-04-24T20:02:07Z

Hi,

The stack = traceback.extract_stack()[:-1] instruction makes inserts too slow for my use case.

This PR adds a flag to disable this behavior if needed.

According to some basic testing, it speeds up inserts by ~20x:

Test code:

from sqlitedict import SqliteDict

def insert(outer_stack):
    d = SqliteDict(outer_stack=outer_stack)
    for i in range(10000):
        d["key_{}".format(i)] = "value_{}".format(i)
    d.close()

%%timeit
insert(outer_stack=True)

=> 2.68 s ± 42.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

%%timeit
insert(outer_stack=False)

=> 121 ms ± 3.04 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

sqlitedict.py

piskvorky · 2019-04-25T07:00:52Z

makes inserts too slow for my use case

Can you post some benchmarks? before / after your PR.

Intuitively, I'd expect debugging features that significantly slow down normal execution to be an opt-in, not opt-out. But if the difference is "slight", as per PR #28, it's OK as opt-out for power-users. So the benchmark numbers matter. CC @jquast .

yonromai · 2019-04-25T14:29:16Z

@mpenkov Thanks for the review - Amended the commit based on your comments, PTAL

yonromai · 2019-04-25T14:36:28Z

@piskvorky Although I cannot share my code (not open source), I observed a similar improvement as shown above (about ~23x faster after than before, although I didn't compute the variance in the real use case, as opposed to above).

FWIW I am evaluating this map compared to a leveldb index and sqlitedict is still ~4x slower than leveldb in my use case.

piskvorky · 2019-04-25T18:42:48Z

Yeah, that wouldn't surprise me. Sqlitedict is aiming to be a simple-to-use wrapper for SQLite, speed is not one of our primary objectives.

yonromai · 2019-04-26T17:49:48Z

@piskvorky LMK if you think I should make this the default behavior

piskvorky · 2019-04-26T18:02:39Z

@yonromai I'm not very familiar with this part of the code. Can you summarize the pros/cons?

What would we (the users, the developers) gain / lose by making it the default?

yonromai · 2019-04-26T18:15:46Z

Pro: Inserts are 23x faster.
Con: If an error occurs, the stack trace will mostly get swallowed, and 'No outer stack to display. Enable it using outer_stack=True' will be displayed in place of the stacktrace.

piskvorky · 2019-04-26T19:22:19Z

I'm not sure. I'm leaning slightly toward more intelligible messages… but damn, 23x is a lot! How often / under what conditions can these errors happen? In our code, in user code…?

@mpenkov WDYT?

'No outer stack to display. Enable it using outer_stack=True' will be displayed

That message is too vague: where should I (as a user) enter this outer_stack=True? Can you please make it more specific and actionable.

mpenkov · 2019-06-05T10:05:34Z

@yonromai why did you close this PR?

yonromai · 2019-06-05T14:45:07Z

@mpenkov I closed the PR because it's been stale and I'm trying to keep my github.com/pulls clean since I use it a lot.

I kept the fork alive in case you want to cherry pick the changes.

mpenkov · 2019-06-05T15:13:09Z

Are you able to complete the PR? You received feedback from @piskvorky. From what I can see:

Document the use case (most importantly, source of exceptions)
Improve error message

yonromai · 2019-06-05T16:11:53Z

@mpenkov Sorry but I don't have much more bandwidth to dedicate to this (we decided to not use sqlitedict).

mpenkov · 2019-06-06T01:31:27Z

OK, that's fair enough. Thanks for the changes in your PR. We'll take it from here.

purplesyringa · 2022-02-21T17:16:35Z

Has there been any progress on this PR? traceback.extract_stack() stat(2)'s all Python source files in the traceback--making more syscalls than sqlite itself. No wonder this is so slow.

mpenkov · 2022-02-22T00:56:34Z

@imachug No. Are you interested in fixing this?

purplesyringa · 2022-02-22T11:18:19Z

I thought this PR is more or less ready to merge, except maybe

That message is too vague: where should I (as a user) enter this outer_stack=True? Can you please make it more specific and actionable.

Do you have any other issues with this?

mpenkov · 2022-02-22T12:31:57Z

No, TBH, I think that's the only thing left. Let me go clean up the error message and we can merge this.

piskvorky · 2022-02-22T12:50:09Z

sqlitedict.py

@@ -140,6 +141,8 @@ def __init__(self, filename=None, tablename='unnamed', flag='c',
        object.
        The default is to use pickle.

+        If you disable `outer_stack`, the stacktrace at insert time won't be saved. This


More context / info please. As a user, what is "the stacktrace at insert time", what are the implications of "not saving" it? Why would I want / not want that?

mpenkov reviewed Apr 25, 2019

View reviewed changes

sqlitedict.py Outdated Show resolved Hide resolved

mpenkov reviewed Apr 25, 2019

View reviewed changes

sqlitedict.py Outdated Show resolved Hide resolved

mpenkov reviewed Apr 25, 2019

View reviewed changes

sqlitedict.py Outdated Show resolved Hide resolved

mpenkov requested a review from piskvorky April 25, 2019 06:34

allow to turn off stacktrace bookkeeping

d6dd45b

yonromai closed this Jun 3, 2019

mpenkov self-assigned this Feb 22, 2022

mpenkov added this to the 1.8.0 milestone Feb 22, 2022

piskvorky reviewed Feb 22, 2022

View reviewed changes

mpenkov mentioned this pull request Feb 25, 2022

make outer_stack a parameter #148

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow to turn off stacktrace bookkeeping #96

Allow to turn off stacktrace bookkeeping #96

yonromai commented Apr 24, 2019

piskvorky commented Apr 25, 2019 •

edited

yonromai commented Apr 25, 2019

yonromai commented Apr 25, 2019

piskvorky commented Apr 25, 2019

yonromai commented Apr 26, 2019

piskvorky commented Apr 26, 2019

yonromai commented Apr 26, 2019

piskvorky commented Apr 26, 2019 •

edited

mpenkov commented Jun 5, 2019

yonromai commented Jun 5, 2019

mpenkov commented Jun 5, 2019 •

edited by piskvorky

yonromai commented Jun 5, 2019

mpenkov commented Jun 6, 2019

purplesyringa commented Feb 21, 2022

mpenkov commented Feb 22, 2022

purplesyringa commented Feb 22, 2022

mpenkov commented Feb 22, 2022

piskvorky Feb 22, 2022 •

edited

Allow to turn off stacktrace bookkeeping #96

Allow to turn off stacktrace bookkeeping #96

Conversation

yonromai commented Apr 24, 2019

piskvorky commented Apr 25, 2019 • edited

yonromai commented Apr 25, 2019

yonromai commented Apr 25, 2019

piskvorky commented Apr 25, 2019

yonromai commented Apr 26, 2019

piskvorky commented Apr 26, 2019

yonromai commented Apr 26, 2019

piskvorky commented Apr 26, 2019 • edited

mpenkov commented Jun 5, 2019

yonromai commented Jun 5, 2019

mpenkov commented Jun 5, 2019 • edited by piskvorky

yonromai commented Jun 5, 2019

mpenkov commented Jun 6, 2019

purplesyringa commented Feb 21, 2022

mpenkov commented Feb 22, 2022

purplesyringa commented Feb 22, 2022

mpenkov commented Feb 22, 2022

piskvorky Feb 22, 2022 • edited

Choose a reason for hiding this comment

piskvorky commented Apr 25, 2019 •

edited

piskvorky commented Apr 26, 2019 •

edited

mpenkov commented Jun 5, 2019 •

edited by piskvorky

piskvorky Feb 22, 2022 •

edited