Skip to main content

Browsing the Data Catalog

Navigate and explore your complete data catalog hierarchy, from data sources down to individual table columns, with detailed metadata, profiling statistics, and volume tracking.

VIDEO TUTORIAL⏱️ 2.5 mins

📹 Exploring Your Data Catalog

Learn how to navigate through your data catalog hierarchy and explore detailed metadata about your database entities.

Overview

The data catalog provides a comprehensive view of your database structure, allowing you to drill down from data sources through databases and schemas to individual tables and their columns. Each level provides relevant metadata and insights about your data assets.

Catalog Navigation Hierarchy

The catalog follows a hierarchical structure:

Data Source
└── Database
└── Schema
└── Table
└── Columns

Accessing Table Details

  1. Start from your data source
  2. Navigate through: Databases → Schemas → Tables
  3. Click on any table name to view its detailed metadata

Table Metadata View

When you select a table, the page displays:

Header Section

  • Table Name: Large heading with the table name
  • Database Info: Type, host, and last update timestamp in small text below

Metadata Cards

Four summary cards display key information:

  1. Table Card: Table name and schema
  2. Columns Card: Column count and primary key name
  3. Constraints Card: Count of unique constraints and indexes
  4. Last Updated Card: Date and time of last metadata update

Information Tabs

The table detail view is organized into multiple tabs for easy navigation:

1. Columns Tab

The default view displays all columns in the table with comprehensive metadata:

InformationDescription
Column NameThe name of the column
DescriptionBusiness description (editable)
Data TypeDatabase data type (VARCHAR, INTEGER, etc.)
NullableWhether the column allows NULL values
Primary KeyIndicates if column is part of primary key
Default ValueDefault value if specified
Enum ValuesList of allowed values for enum types
Sensitivity LevelData sensitivity classification
Data CategoryType of data (PII, PCI, etc.)

Editing Column Metadata

To edit column information:

  1. Click the Edit button (pencil icon) next to any column
  2. The Edit Column dialog opens with three tabs:

Basic Information Tab:

  • Column Name (required)
  • Data Type (required)
  • Position (ordinal position)
  • Description
  • Nullable checkbox
  • Primary Key checkbox

Technical Details Tab:

  • Default Value
  • Data Length
  • Precision
  • Scale
  • Python Type
  • Enum Values (add/remove values)

Data Compliance Tab:

  • Sensitivity Level dropdown (High, Medium, Low, None, Unassigned)
  • Sensitivity Reason (shown when level is set)
  • Data Category dropdown (PII, PHI, PCI, HCI, General, Unassigned)
  1. Navigate between tabs to edit different aspects
  2. Click Save to apply all changes

2. Constraints Tab

View all database constraints for the table:

Primary Keys

  • Constraint name
  • Column(s) involved
  • Automatically enforced uniqueness

Foreign Keys

  • Relationship definitions
  • Referenced tables
  • Column mappings

Unique Constraints

  • Columns with unique value requirements
  • Constraint names and definitions

3. Indexes Tab

Database indexes defined on the table:

  • Index name
  • Indexed columns
  • Index type (B-tree, Hash, etc.)
  • Uniqueness indicator

4. Profile Tab

When the data profiler has been run, this tab displays statistical analysis:

Column Statistics

For each column, view:

  • Completeness: Percentage of non-null values
  • Uniqueness: Percentage of unique values
  • Data Type Distribution: Breakdown of data types found
  • Value Distribution: Common values and their frequencies
  • Statistical Summary: Min, max, mean, median for numeric columns

5. Volume Tab

Track data growth and changes over time using the TableVolumeChart component:

Volume Display

  • Current Row Count: Shows total records in the table
  • Growth Visualization: Interactive chart showing volume changes
  • Time-based Analysis: View historical data growth patterns

Data Sensitivity Classification

The catalog tracks and displays data sensitivity information:

Sensitivity Levels

  • High: Highly sensitive data requiring maximum protection
  • Medium: Moderately sensitive data
  • Low: Low sensitivity data
  • None: Non-sensitive data
  • Unassigned: Not yet classified

Data Categories

  • PII (Personally Identifiable Information)
  • PHI (Protected Health Information)
  • PCI (Payment Card Industry)
  • HCI (Highly Confidential Information)
  • General: General business data
  • Unassigned: Not yet categorized

Each column displays badges indicating its sensitivity level and category, helping you quickly identify sensitive data requiring special handling.

Additional Features

Column Actions

Each column row in the Columns tab includes:

  • Edit Button (Pencil icon): Opens the edit dialog
  • Delete Button (Trash icon): Removes the column (with confirmation dialog)

Best Practices

Regular Metadata Updates

  • Review and update column descriptions regularly
  • Ensure sensitivity classifications are accurate
  • Document business rules and data meanings

Data Profiling

  • Run profiling on critical tables periodically
  • Use statistics to identify data quality issues
  • Monitor completeness and uniqueness metrics

Volume Monitoring

  • Track unexpected growth patterns
  • Identify tables requiring archival
  • Plan capacity based on growth trends

Permissions

Access to catalog browsing requires:

  • Read permissions on the data source
  • Team membership with appropriate role
  • Successful catalog scan completion