Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Anonymized dump #364

Closed
carlosmuller opened this issue Jun 18, 2021 · 9 comments · Fixed by #427 or #590
Closed

Anonymized dump #364

carlosmuller opened this issue Jun 18, 2021 · 9 comments · Fixed by #427 or #590

Comments

@carlosmuller
Copy link
Contributor

Hi thanks for the super software!
What do you guys think about a option to anonymze some columns and tables?

@davidducos
Copy link
Member

Hi @carlosmuller,
Interesting... in mydumper (export stage)? myloader (import stage)? both? How we determine the columns? filtering options? hooks?

@carlosmuller
Copy link
Contributor Author

I am thinking in the export stage so the real data never "escape" but may be a little difficult.
I was thing in some config file with tables and columns an some options like auto generate, empty, and other generators, but for initial release could be random_unique, random, empty and fixed, something like this:

User:
   name: random
   username: random_unique
   document: empty
   createdAt: '2021-06-18'

@davidducos
Copy link
Member

We could do something similar of what I'm doing in https://github.com/maxbube/mydumper/tree/config_file
The file content will be like:

[user]
name = random
username = random_unique
document = empty
createdAt = '2021-06-18'

@davidducos davidducos added this to the Release 0.10.11 milestone Jun 22, 2021
@druud
Copy link

druud commented Aug 5, 2021

See also #84, nr 6.

@davidducos davidducos linked a pull request Oct 8, 2021 that will close this issue
@davidducos
Copy link
Member

Phase II:
We need to create the function to load the tables from a configuration file.

@davidducos
Copy link
Member

We could load the configuration from the sections of the default file .my.cnf but only the sections that has a . (dot) as the format will be [<schema_name>.<table_name>]. Complete match will be mandatory to identify the table. Then, each key will be the column name and the value the function name.

@davidducos
Copy link
Member

We need to merge/complete #440 before continue with this

@davidducos
Copy link
Member

@carlosmuller another phase has been completed.

@carlosmuller
Copy link
Contributor Author

@davidducos thanks will take a look as soon I have time

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment