Profiling

Profiling generates statistical insights about data objects such as tables or files from a data source. These insights include row count, top values, number of nulls, distinct values, minimum and maximum values, etc.

Profiling analyzes and compares these statistics before using them for data quality assessment, data understanding, analysis, and reporting, etc.

Profiling requires metadata to be collected first through crawling.

Profiling works at both the table and column levels. Use table-level profiling to profile specific tables and avoid profiling the entire data source, which helps reduce system load.

The profile status appears at the column level as well. This gives a more accurate view, especially when individual columns are excluded from profiling using regex.

To profile a table, go to the Table Summary page > click the nine dots menu, and select the Profile option.

Profiling gathers the following statistics:

  • Top Values (Table Values)

  • Row Count for tables

  • Null Count for columns

  • Distinct Count for columns

  • Min Value for columns

  • Max Value for columns

  • Top 50 values (Column Values)

In this example, the SQL Server data source is used to fetch product information statistics.


Copyright © 2025, OvalEdge LLC, Peachtree Corners GA USA

Last updated

Was this helpful?