# Amazon S3

OvalEdge uses AWS S3 SDK to connect to the data source, which allows the user to crawl and profile data objects (Files, File Columns, etc.)

<figure><img src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXe6GM6AXHrrDkSxziHSMgPEzUKv6WdEngCLWxr_VLtXaymWxzavKouv5Sd9ig4Iiyx0Pzp4bxqTL9BHZvP6IA8oSWes7x0i5HaYoLUccTGJUN6ZHxtedUJBaTBMMZpROt9DhmHvcKI-BGkBr2-lU33xaCWT?key=g6K_x7ht0cA8dS3HDGOt3A" alt=""><figcaption></figcaption></figure>

| Important: This document focuses on specific information relevant to this particular connector. Details about Establishing Connections, Connector Parameters, and Connector Settings will be found in the "[Generic Features of Connectors](https://docs.google.com/document/d/1p-qODOxHwnfLO-M1lag4TLpO2S6cZvq7oz3f-v7DyZU/edit#heading=h.a068g7zhum9t)" document. |
| ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |

### Connector Characteristics

| Connector Category                                                                                                                                    | File System                |
| ----------------------------------------------------------------------------------------------------------------------------------------------------- | -------------------------- |
| Connectivity                                                                                                                                          | AWS S3 SDK for JAVA 2.26.7 |
| <p>Connector Version</p><p>\[OvalEdge Connector Version Information]</p>                                                                              | 7.0                        |
| <p>Amazon S3 Source Versions Supported </p><p>\[Data Source Version from which OvalEdge started providing support]</p>                                | 1.12.661                   |
| <p>OvalEdge Releases Supported (Available from)</p><p>\[OvalEdge Release Version in which the connector was included in the OvalEdge Application]</p> | 4.3 Onwards                |

### Supported Features

| <p>Crawling of Metadata Objects </p><p><br></p>                                                                                                                                                                                                                                                                                               | <p>Buckets</p><p>File Columns</p><p>Folders</p><p>Files</p>                                                         |
| --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------- |
| Metadata Source                                                                                                                                                                                                                                                                                                                               | Amazon S3 Commands                                                                                                  |
| Supported File Types:                                                                                                                                                                                                                                                                                                                         | CSV, XLS, XLSX, JSON, AVRO, PARQUET, ORC, GZ                                                                        |
| Profiling                                                                                                                                                                                                                                                                                                                                     | <p>File Profiling: Row count, Columns count, and View sample data</p><p><br></p><p>Sample Profiling : Supported</p> |
| Data Preview                                                                                                                                                                                                                                                                                                                                  | Supported                                                                                                           |
| <p>Data Quality Module</p><p>More info: Data Quality</p>                                                                                                                                                                                                                                                                                      | Supported                                                                                                           |
| <p>DAM (Data Access Management) Support</p><p>More info: RDAM</p>                                                                                                                                                                                                                                                                             | Supported                                                                                                           |
| <p>Bridge Support</p><p>\[The Bridge component is used as a Tunnel to Transfer Data from the Client-Hosted Cloud or Client-on-Premises Data Center. The handshake between the Bridge Server and Client is established with a Security Tool Kit (these differ from client to client and are generated based on the IP address/DNS, etc.).]</p> | Supported                                                                                                           |
| <p>Crawl of Usage Statistics (Source System)</p><p>\[This involves systematically collecting, analyzing, and reporting on data related to user activities, including details on which users are accessing the system, their actions, and the frequency of their access.]</p>                                                                  | Not Supported                                                                                                       |
| <p>Certifications at Source (Source System)<br>\[This refers to endorsing metadata objects applied within the source system. These certifications are then fetched and displayed in OvalEdge, enabling more in-depth analysis.]</p>                                                                                                           | Not Supported                                                                                                       |

### Prerequisites&#x20;

The following are the prerequisites required for establishing a connection: &#x20;

1. API Details

Add Amazon S3 Connector drivers into the OvalEdge Jar Path (/home/ovaledge/jarpath) to communicate with the Amazon S3 database.

| API        | Version  | Details                                                                                                                        |
| ---------- | -------- | ------------------------------------------------------------------------------------------------------------------------------ |
| AWS S3 SDK | 1.12.232 | <p><https://mvnrepository.com/artifact/com.amazonaws/aws-java-sdk-s3/1.12.232></p><p>Note: The latest version is 1.12.244.</p> |

2. Service Account with Minimum Permissions

Create an Amazon S3 service account to connect to OvalEdge with the following permissions to crawl metadata into OvalEdge.&#x20;

| Operation             | Minimum Access Permission                                                                                                                                                                                                                    |
| --------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Connection Validation | LIST, GET permission on Connection Validation                                                                                                                                                                                                |
| Crawling              | <p>s3:GetBucketTagging</p><p>s3:GetBucketLocation</p><p>s3:GetEncryptionConfiguration</p><p>s3:ListBucket</p><p>s3:ListAllMyBuckets</p><p>iam:ListPolicyTags</p><p>iam:ListRoleTags</p><p>Lambda:GetFunction</p><p>Lambda:InvokeFunction</p> |
| Profiling             | <p>s3:GetObject</p><p>kms:Decrypt</p><p>kms:DescribeKey</p>                                                                                                                                                                                  |
| Access Management     | <p>iam:ListRoles</p><p>iam:GetRole</p><p><br></p>                                                                                                                                                                                            |

3. Configure Environment Variables (Optional).

This section describes the settings or instructions you should know before establishing a connection. If your environments have been configured, skip this step.

For more information, refer to the "[Generic Features of Connectors](https://docs.google.com/document/d/1MEkP-GCznqJHLlV6mxrBd-dOMyBNa5cT_-fHDAglFsc/edit#bookmark=id.ilq7aziwen7l)" document.

### Establish Connection

{% tabs %}
{% tab title="IAM User Authenticationb" %}

* SSO Connection Id: The SSO connection ID is used to authenticate and authorize access to Amazon S3.
* SSO Application Id: This is the specific application ID within the SSO system that connects to Amazon S3.
  {% endtab %}

{% tab title="Role-Based Authentication" %}

* Access key<mark style="color:red;">\*</mark>: Enter a unique identifier that is part of the credential pair, like a username.
* Secret key<mark style="color:red;">\*</mark>: A secret, like a password, that is used to sign requests to AWS.
* SSO Connection ID: The SSO connection ID is used to authenticate and authorize access to Amazon S3.
  {% endtab %}
  {% endtabs %}

***

Copyright © 2025, OvalEdge LLC, Peachtree Corners GA USA
