What does Atlan crawl from Iceberg
Atlan crawls comprehensive metadata from your Iceberg catalog, including catalog information, namespaces (with nested namespace support), tables with their metadata, and column details.
Lineage
Atlan establishes the following lineage between Iceberg assets:
- Catalog → Namespaces: Each catalog contains multiple namespaces
- Namespace → Tables: Each namespace contains multiple tables
- Namespace → Namespaces: Nested namespaces have parent-child relationships
- Table → Columns: Each table contains multiple columns
- Column → Columns: Nested columns have parent-child relationships (for STRUCT, LIST, and MAP types)
Assets
Atlan crawls the following Iceberg assets, each with specific metadata fields.
IcebergCatalog
Iceberg catalogs represent the top-level catalog instances that contain namespaces and tables.
| Source field | Atlan field | Description |
|---|---|---|
catalog_name | name | Catalog name |
catalog_type | icebergCatalogType | Type of catalog (for example, rest) |
uri | icebergUri | REST catalog URI |
warehouse | icebergWarehouse | Warehouse identifier |
scope | icebergScope | Access scope configuration |
total_namespaces | icebergTotalNamespaces | Total number of namespaces in the catalog |
total_tables | icebergTotalTables | Total number of tables in the catalog |
properties | icebergCatalogProperties | Catalog configuration properties |
IcebergNamespace
Namespaces represent logical containers for organizing tables within a catalog. Iceberg supports nested namespaces.
| Source field | Atlan field | Description |
|---|---|---|
namespace_str | name | Namespace name (supports nested namespaces with dot notation) |
namespace_str | icebergNamespace | Full namespace identifier |
namespace_hierarchy | icebergNamespaceHierarchy | Ordered array representing the complete namespace hierarchy path |
parent_namespace | icebergParentNamespaceQualifiedName | Parent namespace qualified name (for nested namespaces) |
IcebergTable
Iceberg tables represent the table assets with comprehensive metadata including snapshots, partitions, and properties.
| Source field | Atlan field | Description |
|---|---|---|
table_name | name | Table name |
table_uuid | icebergTableUuid | Unique identifier for the table |
location | icebergTableLocation | Storage location of the table |
current_snapshot_id | icebergCurrentSnapshotId | Current snapshot identifier |
last_updated_ms | icebergLastUpdatedMs | Last update timestamp in milliseconds |
format_version | icebergFormatVersion | Iceberg format version |
properties | icebergTableProperties | Table-level properties and configuration |
partitions | icebergPartitions | Partition specification for the table |
snapshots | icebergSnapshots | Snapshot metadata including time travel information |
IcebergColumn
Columns represent individual fields within Iceberg tables with data types, nullability, and descriptions.
| Source field | Atlan field | Description |
|---|---|---|
column_name | name | Column name |
data_type | dataType | Column data type |
nullable | isNullable | Whether the column permits null values |
description | description | Column description |
table_name | icebergTableName | Name of the table containing this column |
namespace | icebergNamespace | Namespace containing the table |
table_uuid | icebergTableUuid | Unique identifier of the table |