Skip to main content

What does Atlan crawl from Iceberg

Atlan crawls comprehensive metadata from your Iceberg catalog, including catalog information, namespaces (with nested namespace support), tables with their metadata, and column details.

Lineage

Atlan establishes the following lineage between Iceberg assets:

  • Catalog → Namespaces: Each catalog contains multiple namespaces
  • Namespace → Tables: Each namespace contains multiple tables
  • Namespace → Namespaces: Nested namespaces have parent-child relationships
  • Table → Columns: Each table contains multiple columns
  • Column → Columns: Nested columns have parent-child relationships (for STRUCT, LIST, and MAP types)

Assets

Atlan crawls the following Iceberg assets, each with specific metadata fields.

IcebergCatalog

Iceberg catalogs represent the top-level catalog instances that contain namespaces and tables.

Source fieldAtlan fieldDescription
catalog_namenameCatalog name
catalog_typeicebergCatalogTypeType of catalog (for example, rest)
uriicebergUriREST catalog URI
warehouseicebergWarehouseWarehouse identifier
scopeicebergScopeAccess scope configuration
total_namespacesicebergTotalNamespacesTotal number of namespaces in the catalog
total_tablesicebergTotalTablesTotal number of tables in the catalog
propertiesicebergCatalogPropertiesCatalog configuration properties

IcebergNamespace

Namespaces represent logical containers for organizing tables within a catalog. Iceberg supports nested namespaces.

Source fieldAtlan fieldDescription
namespace_strnameNamespace name (supports nested namespaces with dot notation)
namespace_stricebergNamespaceFull namespace identifier
namespace_hierarchyicebergNamespaceHierarchyOrdered array representing the complete namespace hierarchy path
parent_namespaceicebergParentNamespaceQualifiedNameParent namespace qualified name (for nested namespaces)

IcebergTable

Iceberg tables represent the table assets with comprehensive metadata including snapshots, partitions, and properties.

Source fieldAtlan fieldDescription
table_namenameTable name
table_uuidicebergTableUuidUnique identifier for the table
locationicebergTableLocationStorage location of the table
current_snapshot_idicebergCurrentSnapshotIdCurrent snapshot identifier
last_updated_msicebergLastUpdatedMsLast update timestamp in milliseconds
format_versionicebergFormatVersionIceberg format version
propertiesicebergTablePropertiesTable-level properties and configuration
partitionsicebergPartitionsPartition specification for the table
snapshotsicebergSnapshotsSnapshot metadata including time travel information

IcebergColumn

Columns represent individual fields within Iceberg tables with data types, nullability, and descriptions.

Source fieldAtlan fieldDescription
column_namenameColumn name
data_typedataTypeColumn data type
nullableisNullableWhether the column permits null values
descriptiondescriptionColumn description
table_nameicebergTableNameName of the table containing this column
namespaceicebergNamespaceNamespace containing the table
table_uuidicebergTableUuidUnique identifier of the table