Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »

On This Page

Overview

An entity type is what SkySync is detecting. SkySync comes with pre-installed entity types that cover a wide range of data, but you also have the option to create custom entity types. Entity types are assigned to a category that identifies the type of data being detected and a match type that identifies how the data is being detected. This page provides a high-level overview to help you understand entity types. To learn more about a specific match type, refer to the Help Center page for that specific match type.

Categories

Categories identify the type of data being detected. Entity types fall into five categories.

Category

Description

1

General

This is a catch-all category for any entity type that doesn’t fall into one of the other four categories. Custom entity types are assigned to this category.

2

Financial

These entity types are for financial information such as

  • ABA routing numbers,

  • credit card numbers, and

  • bank account numbers.

3

Privacy

This category is for any entity type pertaining to personally identifiable information (PII). Examples of this type of information include

  • driver license numbers,

  • passport numbers,

  • phone numbers,

  • social security numbers,

  • national numbers, and

  • other national ID numbers.

4

Regulation

Regulation entity types rely on related keywords to find entity types in different areas such as

  • travel,

  • banking/finance,

  • personal, and

  • government.

The built-in Document Type Classifier and Standardized Form Matcher also fall under this category.

5

Technology

This category includes entity types related to the technology field such as domain names and IP addresses.

Match Type

Match type is how the data is being detected. SkySync supports four match type methods.

Match Type

Description

1

Regular Expression

Regular expression is a pattern used to identify text. It allows you to have very fine-grained control over what content SkySync detects.

2

Block List

A block list allows matching on a given list of terms within a piece of content. 

3

Classification

This is a custom classification model that is plugged into the system.  SkySync has four pre-installed classification entity types:

  1. Document Type Classifier,

  2. Standardized Form Matcher,

  3. Language Detection Module, and

  4. PII Extraction Module.

4

Transformation

A transformation entity type allows for applying a custom metadata expression against any prior matches to generate new value.

  • No labels