Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What's a data drier? #365

Open
pointyointment opened this issue Feb 17, 2019 · 1 comment
Open

What's a data drier? #365

pointyointment opened this issue Feb 17, 2019 · 1 comment

Comments

@pointyointment
Copy link

I would like to suggest that the readme should briefly explain what a "data drier" is. I have no idea, and got no relevant results from Google for that term (or "data dryer").

@naxxfish
Copy link
Contributor

naxxfish commented Feb 18, 2019

It's this: https://github.com/southlondonmakerspace/membership-dryer (which you probably can't see).

tl;dr: this quote mostly sums it up

This software is to give a "dry" environment for testing the South London Makerspace membership database, which is a Node JS web-based system which integrates with Discourse, a discussion web system.

We have a pair of exporting programs (exportmembership and exportdiscourse) which export data on the live system, suitable for backups. However, this data is considered confidential as it contains for example names, home addresses, and phone numbers.

We therefore have a "dryer" or "anonymiser" which takes those datasets and produces isomorphic data which is identical in structure but without personal information. There are two programs (drymembership and drydiscourse) which dry the two datasets. Drying the discourse data requires access to the raw membership data, in order to get matching information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants