On This Page
Overview
The Personally Identifiable Information (PPI) Extraction Module is a pre-trained artificial intelligence (AI) model that can reliably identify and extract PII elements contained in unstructured data. You can include this entity type in the policies you create to identify PII information in content for a data source and specify rules to classify files based on the results. For example, if a Person Name or Address is found in a file in a “public” folder, it can be set to be classified as “Restricted.” You can also upload individual documents to the classifier, and it will identify any PII found in it and provide a confidence score.
Using the Classifier
Select Content.
Select Entity Types.
Click the ellipses (…) at the end of the PII Extraction Module row.
Select Upload sample.
Select Upload on the PII Extraction Module - Upload sample file modal that appears.
Use the dialog box that appears to select the file.
SkySync displays the document type and confidence rating for the match.
Personally Identifiable Information
The classifier will identify the following information.
Label | Name | Description |
---|---|---|
person_name | Person name | Identifies a person name |
entity_name | Entity name | Identifies an entity, business, or organization name |
location | Location | Identifies an place or location |
address | Address | Identifies an address |
misc | Miscellaneous | Identifies miscellaneous items of interest |