Skip to main content

What does Atlan crawl from Amazon MSK?

Once you've crawled Amazon MSK, you can use connector-specific filters for quick asset discovery. The following filters are currently supported for these assets:

  • Topics - Message count, size (MB), partition count, and cleanup policy filters
  • Consumer groups - Member count and topic name filters

Atlan crawls and maps the following assets and properties from Amazon MSK.

Atlan currently only supports asset-level lineage between topics and consumer groups. Upstream, downstream, and column-level lineage are currently not supported.

Topics

Atlan maps topics from Amazon MSK to its KafkaTopic asset type.

Source propertyAtlan property
Topicname
PartitionCountkafkaTopicPartitionsCount
ReplicationFactorkafkaTopicReplicationFactor
segment.bytekafkaTopicSegmentBytes
compression.typekafkaTopicCompressionType
cleanup.policykafkaLogTopicCleanupPolicy
isInternalkafkaTopicIsInternal
sizeInByteskafkaTopicSizeInBytes
recordCountkafkaTopicRecordCount
retention.mskafkaTopicRetentionTimeInMs

Consumer groups

Atlan maps consumer groups from Amazon MSK to its KafkaConsumerGroup asset type.

Did you know?

Consumer groups are most likely to show up only in streaming scenarios. This is because if a topic is not being consumed actively, Amazon MSK will purge the consumer group. So, if a consumer group is inactive while the workflow runs in Atlan, it will not be cataloged as an asset.

Source propertyAtlan property
GROUPname
memberCountkafkaConsumerGroupMemberCount
ReplicationFactorkafkaTopicReplicationFactor
topic_nameskafkaTopicNames
TOPICkafkaConsumerGroupTopicConsumptionProperties.topicName
PARTITIONkafkaConsumerGroupTopicConsumptionProperties.topicPartition
LAGkafkaConsumerGroupTopicConsumptionProperties.topicLag
CURRENT-OFFSETkafkaConsumerGroupTopicConsumptionProperties.topicCurrentOffset