-
Question will be deleted from Stackoverflow by the Community Bot I have a large postgres database. The sql dump has a size of 59GiB and hundreds of millions of entries (Etherpad). How can I split the database by padIDs, groupIDs etc.? Postgres is very slow and needs 4 sec. per request and tools like ripgrep needs more then 10 hours to print all searched entries. Are there any ideas how can I fix this problem? Any other tools or workarounds? Current workaroundI have a list with all needed pad and groupIDs. Then I merge all IDs into a regex string ( Now ripgrep comes with: Technical setup
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
CommentsYou could try increasing Also, ten hours to just run a simple query on 59GB sounds too long to me personally. So something seems a little fishy here personally. But hard to say anything specific without a way to try it myself. – (I'm the author of ripgrep btw. Full disclosure.) – I voted to reopen this. This question could use a little more detail, but it is absolutely on topic! – Or next time, just ask a question using GitHub Discussions on the ripgrep repo. – The OOM killer factoid indeed indicates that your regex is just too massive. So I think my previous statement about you having unreasonable expectations holds. :) – In terms of detaila, I would want a way to reproduce it myself. So you would likely need to find a way to reproduce the problem.om data you can share, e.g., by scrubbing the data so that it doesn't contain amything you don't want to share. – |
Beta Was this translation helpful? Give feedback.
Comments
You could try increasing
--dfa-size-limit
to somethint very large, but given the size of your regex, you'll almost certainly have a bad time. A >2MB regex is huge, and your expectations are not particularly reasonable. The best case is building a DFA from that, which is what increasing the DFA size limit will hopefully do. –BurntSushi5
Mar 14 at 22:43
Also, ten hours to just run a simple query on 59GB sounds too long to me personally. So something seems a little fishy here personally. But hard to say anything specific without a way to try it myself. –
BurntSushi5
Mar 14 at 22:44
(I'm the author of ripgrep btw. Full disclosure.) –
BurntSushi5
Mar 14 at 22:44
I voted to reopen thi…