Skip to content

tdiprima/redaction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🕵️ PII Redaction Tool

PyPI

Wipes out personal info from text files using Microsoft Presidio.

Built for devs who want something lightweight, fast, and Torch-free — a no-drama way to clean text.

🚀 Installation

pip install redaction

🧹 Usage

redact file.txt
redact file.txt -o output.txt
redact --help
  • By default, the redacted version lands in the same directory as your input file.
  • Use the -o flag to specify your own output file.

💡 Example

Input:

My name is John Doe and my email is john@example.com.

Output:

My name is <PERSON> and my email is <EMAIL_ADDRESS>.

🧠 Tech Notes

  • Built on top of Microsoft Presidio's Analyzer + Anonymizer.
  • No GPU, no Torch — just clean text ops.
  • Designed for quick local use or integration in your data pipeline.

GitHub: https://github.com/tdiprima/redaction


⚠️ Heads up:

No PII tool is perfect — always sanity-check the output before using it in production.

About

Lightweight CLI tool to redact PII from text files using Microsoft Presidio — no PyTorch required.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages