This package takes a source document, a collection of detected PII instances, and transforms the document by replacing the PII instances in the document with a different representation.
The type of substitution done is defined by transformation policies.
Note: pii-transform
does not implement or use Transformer models for PII
purposes (for the extraction of PII instances using Transformer models, see
pii-extract-plg-transformers or pii-extract-plg-presidio).
The package provides a console script: pii-transform
loads a source document
& a collection of already-detected PII, and produces a transformed document
following the specified policies.
The same functionality provided by the command-line script can also be accessed via a Python API
The end-to-end scripts and APIs have been migrated; they are now in the
pii-process
package. That one is also the module to use to process generic
documents, since this package supports only the PIISA Source Document
format, which contains the document written as a YAML file.