Adjust Classification Tags
Only available with Snowflake integrations.
After Immuta Detect has run SDD and the Immuta DSF, it may be necessary to adjust the output tags based on your organization's data, security, and compliance needs. Your Immuta representative will work with you during the preview to customize SDD and Immuta DSF to output the desired tags and classification of data sources.
Assess your queries
Required Immuta permission: AUDIT
Use the Detect dashboards to review queries at different sensitivity levels and review the tags that have been applied to your data source columns to understand the tags that Immuta has applied there:
- Have an Immuta user who is subscribed to a data source make multiple queries to a data source in Snowflake. The user should query both non-sensitive and sensitive data.
- Navigate to the Audit page and click ↻Native Query Audit to pull in queries that were made in Snowflake.
- Navigate to the Events (Beta) page. Note that Snowflake has a 15-minute data latency for all audit events.
- Select the Event Id of one of the queries. Click the Columns tab.
-
The Column tab lists the columns in the query organized from highest to lowest sensitivity and the tags applied to each column. Check that the columns you know to be sensitive are here.
For example, if the query has a column with last names, you should see a minimum of the following tags:
Discovered.PII
,Discovered.Identifier Indirect
,Immuta DSF.Sensitive Personal
, andImmuta DSF.Personal
. -
Note any sensitive columns that are not labeled as sensitive.
- Complete steps 2-5 for as many queries as you want.
Assess your data source tags
Required Immuta permission: AUDIT
Target some data sources to manually review tags. At a minimum, select one data source that you think has sensitive data and one data source that has minimal or no presence of sensitive data. Then assess those data sources. However, you can complete this process for as many data sources as you want:
- To generate query audit events,
- Choose an Immuta user who is subscribed to the data source.
- Issue queries against a subscribed data source in Snowflake.
- Make at least a query of non-sensitive data and a query of sensitive data.
- Navigate to the Data Dictionary for the queried data source by opening the Data Sources page. Then select the data source that was queried. Click the Data Dictionary tab to open the Data Dictionary.
-
The Data Dictionary lists the data source columns, with details about the name, data type, and a list of the tags on each column. Assess whether the tags are accurate to your data; however, know that some framework rules will tag all the columns within a data source if there is the presence of one particular data content.
For example, if SDD tags a column:
Discovered.Entity.Date of Birth
, then the Immuta DFS will classify that and tag it asImmuta DFS.Personal
. Once that single column is taggedImmuta DSF.Personal
, the Immuta DFS will tag all the columns within that data source asImmuta DSF.Personal
because they are associated with a person’s date of birth and by association a person.
If you find that too many tags are applied
It is possible that the tags are unexpected, but still accurate to your data. You should consult your Immuta representative with the following information to help you determine whether the data is over tagged or tagged appropriately:
- The columns’ content: Do not send the representative a sample of the data that you believe is sensitive. Describe the types of values in the columns, their names, and what form they are in.
- The tags that were applied to the columns: A list of the tags that were applied to the columns with the unexpected tags specified.
If you find that tags are missing
If you were expecting some sensitive data to be tagged and it is not, you should consult your Immuta representative with the following information:
- The columns’ content: Do not send the representative a sample of the data that you believe is sensitive. Describe the types of values in the columns, their names, and what form they are in.
- The tags that were applied to the columns: A list of the tags that were applied to the columns.
- The tags that you think are missing from the columns
Tune your data dictionaries
Required Immuta permission: GOVERNOR and AUDIT
Tags can be edited on an individual basis on each data source. If broad changes to the classification framework are necessary in order to re-tag your data, please consult your Immuta representative.
- Navigate to the Data Sources page and select the data sources that you assessed and noted issues.
- Click the Data Dictionary tab.
- Delete unnecessary tags by clicking the delete(X) icon on the tag you want to remove from the column.
- To add tags,
- Click Add Tags in the Actions column.
- Begin typing the name of the tag you want to add in the Search by Name field and select the tag from the drop down list.
- Click Add.