The Great Wooly Data Anonymizer

A beast of burden tasked with masking the whereabouts of all who have walked before it.

Purpose of the Anonymizer

When you are dealing with sensitive population data, it's best to scrub out common names, locations, organizations, and other named resources. This data anonymizer parses text passed into its tusks and goes about obfuscating any sensitive information. For this reason, any user can understand the context and situation in which the strings were written without compromising the identify of the users.

Getting Started

Setup may depend on your machine. I recommend using a virtual environmnet and using pip to install packages within the virtual environment.

Install nltk and numpy through pip
Use the woolyAnonymizer() function to anonymize text

Inspiration and Resources

Chuck Dishmon's guest post on Stanford NER Taggers helped formulate much of the structure of early versions of this prototype.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
unicodecsv		unicodecsv
.gitignore		.gitignore
README.md		README.md
anoncsvchat.py		anoncsvchat.py
anoncsvtest.py		anoncsvtest.py
numbersTest.py		numbersTest.py
test_Article.txt		test_Article.txt
test_Emails.txt		test_Emails.txt
wooly.py		wooly.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Great Wooly Data Anonymizer

Purpose of the Anonymizer

Getting Started

Inspiration and Resources

About

Releases

Packages

Contributors 2

Languages

danmostudco/GWDA

Folders and files

Latest commit

History

Repository files navigation

The Great Wooly Data Anonymizer

Purpose of the Anonymizer

Getting Started

Inspiration and Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages