Data Lake OvalSight
Data Lake OvalSight streamlines complex data lake management through a single interface. Analyze data composition and characteristics at both folder and cumulative levels to prioritize tasks and optimize folder structure.
Access Data Lake & Folder OvalSight
Admin and Author: (Application Access to File Manager)
Data Lake OvalSight: Access the main analysis through the Data Lake OvalSight sub-module within the File Manager. This sub-module displays only supported Data Lakes such as Amazon S3, Azure Data Lake, CIFS, NFS, and Google Drive.
File Explorer: Access Data Lake analysis directly from the File Explorer's landing page. Click the OvalSight icon in the Data Lake OvalSight column to be redirected to the sub-module.
Folder OvalSight: Analyze a specific folder within a supported Data Lake by clicking the OvalSight icon in the Folder OvalSight column for that folder.
Viewer:
Folder OvalSight: Access the analysis of a cataloged folder in the Data Catalog. Administrators can enable the Folder OvalSight tab for folders by setting the "enable.folder.ovalsight" key value to 'True' in the System Settings (Others tab). Once enabled, the Folder OvalSight tab will appear next to the Lineage tab for each folder.
Run Folder OvalSight
OvalEdge Default Admin can initiate Folder OvalSight on a selected folder in different ways:
File Explorer: From the File Explorer's 9-dot menu, choose "Run Folder OvalSight."
Data Catalog: Within the Data Catalog's File Summary 9-dot menu, select "Run Folder OvalSight."
Folder OvalSight Scheduler
In the File Explorer, Clicking the Folder OvalSight icon schedules a Folder OvalSight scan for the entire connector. This scan retrieves data on total files, empty subfolders, total subfolders, the last folder level, and other details within the connector.
Only users with Admin privileges can schedule Folder OvalSight jobs.
Folder OvalSight Scheduler consists of three tabs:
Upcoming Section: Displays the next scheduled Folder OvalSight job. Manage schedules through Schedule Analysis.
Frequency: Set as Bi-weekly, Monthly, Custom, or None.
Next Analysis On: Define the job's date and time.
Include/Exclude Buckets: Select specific buckets, include all, or exclude certain ones.
Analysis Details Section: Shows job status, completion date, running/failed status, analyzed and scheduled bucket counts, total bucket count, and the scheduler's username.
History Section: Logs schedule creation, updates, and deletions with username, date, and time.
Upcoming Analysis
Displays the next scheduled Folder OvalSight job. Manage schedules through Schedule Analysis.
The Schedule Analysis enables the admin to manage Folder OvalSight jobs through three main fields: Frequency, Next Analysis On, and Include/Exclude Buckets. Schedule a new analysis or update an existing one before the scheduled time.
Frequency:
Schedule the folder analysis job by choosing from four options: Bi-weekly, Monthly, Custom, or None.
"Bi-weekly" runs the job every two weeks,
"Monthly" runs once a month,
“Custom” allows setting specific dates and times for folder analysis.
"None" cancels or deletes the existing schedule entirely.
Next Analysis on:
After selecting a frequency option (Bi-weekly, Monthly, or Custom), specify the exact date and time for the next folder analysis.
For example, selecting the Monthly option schedules the analysis to run on the 1st of every month at 10:00 AM. Choosing Custom allows setting specific dates and times, such as October 5th at 3:00 PM and October 20th at 4:00 PM.
Include/Exclude Buckets:
Control which buckets are analyzed using Include and Exclude options.
Include: Specify buckets to include in the analysis.
Exclude: Prevent selected buckets from being analyzed.
Additionally, selecting "Include All" means all buckets in the connector are chosen for analysis. This gives flexibility in managing the scope of the folder analysis.
Bucket Selection:
Select specific buckets for analysis. All selected buckets are displayed in the designated field.
Last Analysis
It displays the job status, completion date, running/failed status, analyzed and scheduled bucket counts, total bucket count, and the scheduler's username.
The Analysis Details section provides key information about the folder analysis job with the following:
Analysis Date: It displays the date and time when the folder analysis job is successfully completed. While the job is running, the field will indicate, “The data will be shown after the job.” If the job fails, the field will remain blank and display no date or time.
Analysis Status: It indicates the current state of the folder analysis job. If the job is currently running, the status will be displayed as "Running."
Job Status: It indicates the current state of the folder analysis job. If the job is currently running, the status will be displayed as "Running."
Analyzed Buckets: It displays the count of buckets for which the folder analysis has been completed.
Scheduled Buckets: Displays the number of buckets selected to run the OvalSight job.
Total Buckets: It displays the total number of buckets present in the connector.
Scheduled By: It displays the username of the individual who scheduled the job.
History
The History section logs basic actions related to folder OvalSight analysis, including creating, updating, and deleting schedules. Each entry displays the corresponding username and the action's date and time.
Select Data Lake OvalSight
The "Data Lake OvalSight" displays all available file connections, including:
Connector Name
Data Source Type (NFS, S3, etc.)
Created By (username)
Last Modified On
Last OvalSight Scan
Authors can search for specific connections by name or filter by type using the icons in the respective columns.
Copyright © 2025, OvalEdge LLC, Peachtree Corners, GA USA
Last updated
Was this helpful?

