Discover Scans
On This Page
Overview
The All Scans page provides you an overview of the scans added to Discover. The Scan name column displays the name assigned to the scan when it was created and lists the Discover categories that were selected for the scan. The Connection column identifies the platform, connection, and account used to access the platform. The Path column identifies the location selected for the scan (the target directory). The Scan status column identifies the information for when the scan last ran.
Viewing Child Scans
Only primary scans display on the All Scans page. When adding a Discover scan, you have the option of using a document set or making the location a mapped location. In both instances, all scanning takes place a child level rather than at the primary scan level. The primary scan runs only to determine the child scans that need to be created based. For mapped locations, a child scan is created for each folder one level under the the path selected for the scan. For document sets, a child scan is created for each path saved in the document set.
Mapped locations are identified by a folder icon in front of the path and “Mapped location” under the path. Document sets will not have a connection listed and will have “Document set” as the path. You can click the New Window icon in the Path column for these scans to view the child scans. Refer to Child Scans for more information.
Searching Scans
The Refine feature allows you to search the scans based on specific criteria. The search works as an “AND” statement when more than one option is applied. Search criteria is case sensitive, and the search text must match the content exactly.
The Search field in the menu is used to filter the search options only; it does not perform a search on the text entered.
Path: Search for a scan based on the exact path. This is the information displayed in the Location column. This is an exact search, so you must must enter the path exactly in order to return results.
Storage platform: Search for scans on a specific storage platform. You will select the platform from a list of storage platforms available to you.
Impersonation User: Search for a scan based on the user being impersonated. Note that not all platforms support impersonation. This is an exact search, so the entry must match the user exactly.
Connection name: Search for a scan based on the connection name. This is the first line of text that displays in the Connection column. This is an exact search, so the entry must match the connection name exactly.
Connection user: Search for a scan based on the user account used to create the connection. This is the second line of text that displays in the Connection column. This is an exact search, so the entry must match the user exactly.
Scan Status: Search for a scan based on the scan status. Note that if you apply a scan filter and scans within that status are missing, you may need to run the scans again. This is a known issue with how the scan execution status is calculated in the DryvIQ backend. The scan status options are:
Cancelled: The scan was manually triggered and then manually stopped by a user.
Completed: The scan has run to its scheduled stop policy. (Stop policy configuration is only available through the REST API for Discover scans.)
Failed: The scan encountered a failure causing it to stop processing.
Idle: The scan is not actively running. It is either between scheduled runs or requires manual execution.
Paused: The scan was triggered by a schedule but was stopped manually by a user.
Running: The scan is in progress currently.
Queued: A scan is triggered to run, but there is no available runner or the max concurrent job setting has been met.
Warning: The scan completed but contains warnings or item failures.
Scan Date: Search for a scan based on the date the scan last ran. Select the mode (between, on, after, or before) and the date you want to use.
Has Executions?: View only scans that have been executed. Select True to view scans that have been had at least one scan executed. Select False to view scans that have not been executed yet.
Has child scans?: View only scans that have child scans. These are scans for mapped locations and document sets. Select True to view just these scans. Select False to exclude these scans.
Is a Document set?: View only scans for document sets. Select True to view just document set scans. Select False to exclude document set scans.
Manually Starting Scans
If you did not set a schedule for scan when you created it, you will need to manually start the scan when you want it to run. Even if you assigned a schedule to a scan, you can manually start a scan whenever you want a scan to run outside of the scheduled time. You have the option of starting individual scans or selecting multiple scans.
Starting Individual Scans
Use the Scan button to manually start a scan. The scan information will identify the percentage of the scan that has completed and how many files have been scanned so you can monitor the progress. Once a scan is running, the button will become a Pause button. You can use this button to stop the scan as needed.
Starting Multiple Scans
You can also start multiple scans at once if needed. If scans are set to run on a schedule, manually starting them will override the schedule and start the scan. The scan will continue to run as scheduled when the scan is complete.
Select the box that displays when you hover on the row.
As you select scans you want to start, the label for the Start scans button will change to reflect the number of selected scans. Once you finish making your selections, click the Start # scans button.
You are prompted to confirm you want to start the selected number of scans. Click OK to continue (or Cancel to cancel the action).
DryvIQ begin starting the selected scans and display the progress. Do not refresh the page or close the browser while the scans are starting as this will interrupt the process and cause failures.
You will see a message indicating all the scans have started. Click Close to close the modal.
Understanding the Scan Progress
The Status column displays when the last scan ran for each location as well as when the next scan is due to run. If a scan is currently running, this column will display a progress bar to indicate the scan progress. If scans are running for multiple locations, each location will display its own progress bar. The progress will update every minute until the scan is complete.
Some platforms report an accurate content estimate DryvIQ can use to calculate the scan progress. For these platforms, the scan progress will reflect the true percentage of the scan progress. These scan percentages are approximations and may fluctuate based on the information provided by the platform (where applicable) or the amount of content being scanned. For example, if DryvIQ determines an estimate is wrong, it will recalculate the progress, which may decrease the progress from its previous level. Progress will then begin to increase again as the scan continues.
While SharePoint Online does provide a count estimate DryvIQ leverages to determine the scan progress, this estimate is not accurate if there are subdirectories in the source path. In this instance, the scan progress may recalculate.
For platforms that don’t provide content estimates, DryvIQ will display the following percentages based on the scan phase currently being executed:
20%: The scan is initializing and preparing to run.
30%: The scan is determining changes to the content.
40%: The scan is executing. This is when the scan is actively reading content and compiling the statistics for the reporting. During this phase, DryvIQ will update the number of files scanned, if any.
80%: The scan is finalizing.
100%: The scan is complete. The Scan status will display 100% for one minute after the scan completes. Then, the last scan date will display.
For failed scans, you can click the View Activity log link to go to the Activity tab to review the logged activity for the selected scan. If the scan is a part of a mapped location or data set, the Activity tab will display activity for all child scans included in the mapped location or data set. A pill displays the scan ID for the selected scan or parent scan for your reference. You can use the Reset button to clear the selected scan and view all scan activity again as needed. The activity may help you understand what caused the scan to fail so you can correct any issues that can be addressed before attempting to run the scan again.
Viewing Results
You can quickly view the results for any location. Click the ellipses (…) at the end of the row and click View Results from the menu that displays. You will be taken to the Results page, and the results will be filtered to display just the contents for the selected location. For mapped locations, the results will include all child locations.
Viewing Insights
You can quickly view the insights for any location. Note that the insights page that displays depends on the last insights you viewed, so you should go to the insights and select the the page first before coming here to select the location. Click the ellipses (…) at the end of the row and click View Insights from the menu that displays. You will be taken to the last insights page you viewed, and the results will be filtered to display the insights for the selected location. This option is not available for the primary scan for mapped locations since there is no information to view for these scans. Insights can be viewed through the child scans.
Duplicating a Scan
Duplicating a scan allows you to create a new scan based on the settings for an existing scan. Click the ellipses (…) at the end of the row and click Duplicate Scan from the menu that displays. The Discover Scan Setup wizard will open and automatically populate with the settings from the duplicated scan. All you have to do is select the data source you want to use for the scan and adjust any settings if needed.
When duplicating the primary scan for a document set or mapped location, the schedule will be set to scan every 6 hours since this is the default schedule used for the primary scans for these scan types; custom schedules are applied only to the child scans. If you prefer the custom schedule used for the child scans for a document set or mapped location, you should duplicate one of the child scans instead of the primary scan so you do not need to edit the scan schedule.
Renaming a Scan
You can edit the scan name assigned to a scan at any time. Click the ellipses (…) at the end of the row and click Rename Scan from the menu that displays. This will open the Rename Scan modal. Edit the scan name and click OK to save the changes.
Editing Categories
You can update the categories assigned to the scan. This allows you to add, remove, or adjust the templates the scan uses. Click the ellipses (…) at the end of the row and click Edit Categories from the menu that displays. This will open the Edit Categories modal. The options on this modal are the same options available when creating a Discover scan. Select the templates you want to add and clear the templates you want to remove. Refer to the Discover Categories section of Adding a Discover Scan for more information about customizing the individual templates.
The Reset scan option determines if a reset is performed on the next scan. A reset only resets tracking data and allows files to be rescanned. It does not remove any results from the data catalog. Once the edits are saved, the new categories will be applied to the next scan. Note that scans will behave as follows based on the data source type selected for the scan.
Edited scans With the Reset Scan Option Enabled
Location Scan: The reset forces a rescan of the entire location and publishes all entities to the data catalog again.
Mapped Location: The reset will rescan the primary location to create child scans as needed. The child scans will be reset and rescanned so the new entities can be published for all items.
Document Set: The reset will rescan to create child scans as needed.
Edited scans Without the Reset Scan Option Enabled
Location Scan: The scan only looks for new files. Existing results will not be rescanned or updated. Only new items that are identified will be scanned using the updated categories.
Mapped Location: The scan will continue to create child scans as needed. Existing items for child scans will not be rescanned or updated. Only new items that are identified will be scanned using the updated categories.
Document Set: The scan will continue to create child scans as needed. Existing items for child scans will not be rescanned or updated. Only new items that are identified will be scanned using the updated categories.
Managing Scan Schedules
You can update a scan schedule or add a schedule to a scan at any time. Click the ellipses (…) at the end of the row and click Manage Schedule from the menu that displays. This will open the Manage Schedule modal. The scheduling options on this modal are the same options available when adding a schedule when creating a Discover scan. Refer to the Schedule section of Adding a Discover Scan for more information.
Removing a Scan
You can remove a data source from Discover if you no longer want it to be evaluated. Click the ellipses (…) at the end of the row and click Remove Scan from the menu that displays. Click Yes when prompted to verify you want to remove the scan.
Sorting the Grid
By default, the locations are listed in the order they were assigned to Discover. You can sort the locations based on any column in the grid. Clicking a column header will sort that column in ascending order. An up arrow will appear on the column to indicate the ascending sort order. Clicking the column header again will sort the column in descending order. A down arrow will appear on the column to indicate the descending sort order. Clicking the column header a third time will use the default sort order. The selected sort will persist until you change the sort order or select to sort a different column.