Skip to main content

What does Atlan crawl from Dremio

Atlan crawls comprehensive metadata from your Dremio data lakehouse, including physical datasets, virtual datasets, folders, spaces, and sources information.

Assets

Atlan crawls the following Dremio assets, each with specific metadata fields.

DremioSpace

Dremio Spaces represent logical workspaces where users create and organize virtual datasets.

Source fieldAtlan fieldDescription
namenameSpace name
descriptiondescriptionSpace description
tagsdremioLabelsLabels/tags associated with the space
iddremioIdUnique identifier for the space in Dremio

DremioSource

Dremio Sources represent external data connections providing access to various systems.

Source fieldAtlan fieldDescription
namenameSource name
descriptiondescriptionSource description
tagsdremioLabelsLabels/tags associated with the source
typedremioSourceTypeType of external source connection
configdremioSourceConnectionConfigsConfiguration parameters for connecting to the external source
accelerationGracePeriodMs, accelerationRefreshPeriodMs, accelerationRefreshSchedule, accelerationActivePolicyType, accelerationNeverExpire, accelerationNeverRefreshdremioSourceAccelerationSettingsAcceleration settings including grace period, refresh settings, and policy configuration
metadataPolicydremioSourceMetadataPoliciesMetadata policy settings for the source
iddremioIdUnique identifier for the source in Dremio

DremioFolder

Dremio Folders represent organizational containers within spaces or sources.

Source fieldAtlan fieldDescription
namenameFolder name
descriptiondescriptionFolder description
tagsdremioLabelsLabels/tags associated with the folder
iddremioIdUnique identifier for the folder in Dremio

Physical Dataset

Physical datasets represent tables from connected data sources available in Dremio.

Source fieldAtlan fieldDescription
namenamePhysical dataset name
descriptiondescriptionPhysical dataset description
tagsdremioLabelsLabels/tags associated with the dataset
rowCountrowCountNumber of rows in the dataset
columnCountcolumnCountNumber of columns in the dataset
createdAtsourceCreatedAtWhen the physical dataset was created
iddremioIdUnique identifier for the dataset in Dremio
ownersourceOwnersOwner of the dataset

Virtual Dataset

Virtual datasets represent virtual tables created in Dremio through data virtualization.

Source fieldAtlan fieldDescription
namenameVirtual dataset name
sqldefinitionSQL definition of the virtual dataset
descriptiondescriptionVirtual dataset description
tagsdremioLabelsLabels/tags associated with the dataset
createdAtsourceCreatedAtWhen the virtual dataset was created
iddremioIdUnique identifier for the dataset in Dremio
rowCountrowCountNumber of rows in the dataset
columnCountcolumnCountNumber of columns in the dataset
ownersourceOwnersOwner of the dataset

Column

Columns represent individual fields within physical and virtual datasets.

Source fieldAtlan fieldDescription
namenameColumn name
typedataTypeColumn data type
isNullableisNullableWhether the column permits null values
descriptiondescriptionColumn description
tagsdremioLabelsLabels/tags associated with the column
iddremioIdUnique identifier for the column in Dremio
isPrimaryKeyisPrimaryKeyWhether the column is a primary key
isPartitionColumnisPartitionColumnWhether the column is a partition column
ordinalPositionordinalPositionPosition of the column in the table
precisionprecisionPrecision of the column
scalescaleScale of the column
ownersourceOwnersOwner of the column
createdAtsourceCreatedAtWhen the column was created