-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix batch processing #26
Conversation
Hi, I tested your changes against a ~1,6 GB sized database and tables with at most ~370k data entries (per table). The anonymization breaks at some point with an Im an running the anonymization with Python 3.9 in a virtualenv and on a Ubuntu Linux 21.04. with 24 GB of RAM.
|
No, i'm not getting such issues. Without proposed changes I assume everything works fine? |
Yes, as soon as I change to the development branch the anonymization works fine and completes after a couple of minutes. I haven't take a deeper look into the |
I'll fix this issue |
Thanks. I can support you with additional debug / system information if that helps you. |
Hm, I can't reproduce it on my test db |
Superseded by #27 |
Sure, I will check it with PR 27. Thanks! |
This pr fixes issue with processing huge tables. It calls
import_data
inside loop that callsfetchmany
and clearsdata
and StringIO buffers after each iterationDepends on #25