Microsoft Dataverse crawler App
The Microsoft Dataverse Crawler app creates or reuses a Dataverse connection in Atlan, fetches entities and their attributes, and publishes them for discovery. It also fetches Dataverse relationships to publish lineage between entities and attributes, and archives assets in Atlan when they're deleted or archived in Dataverse.
This reference provides complete configuration details for the Microsoft Dataverse Crawler app. Use this page to look up field definitions, valid inputs, and behavior.
Access
The Microsoft Dataverse Crawler app isn't enabled by default. To use this app, contact Atlan support and request it be added to your tenant. Once enabled, admins or users with workflow permissions can configure and run the crawler.
Client ID
Use the application (client) ID from your Azure app registration. This value identifies the app that the crawler uses for OAuth. Find it in Azure Portal under App registrations > Overview.
Example:
123456ab-987c-12a1-1234-ab12345a1234
Client secret
Use the client secret value created for the Azure app registration. The crawler uses this value to request tokens for the app. Use the secret value itself, not the secret ID.
Example:
Abc1Z~abCD._a1CD2EF~A1-AbcDEfgHIj1kLmnop
Tenant ID
Use the tenant (directory) ID for the Microsoft Entra tenant that owns the Dataverse environment. The crawler uses it to target the correct identity provider. Find it in Azure Portal under Microsoft Entra ID > Overview.
Example:
123abc1a-1ab2-1234-1234-123456a12ab1
Dataverse URI
Provide the full Dynamics 365 URL for the Dataverse environment. Use the base environment URL that ends with .crm.dynamics.com. The crawler uses this to reach Dataverse APIs.
Example:
https://bag-mpd365-crm-sandbox.crm.dynamics.com
Connection
Choose the name of the Atlan connection created or reused by this app. This name appears in Atlan and groups all assets from the Dataverse source. The name must be unique across Dataverse connections.
Example:
dataverse-prod
Connection admins
Controls who has admin permissions on the Atlan connection. The default value adds all workspace admins. Update this to scope admin access to the right owners.
Example:
All_admins
Include entities by regex
Use a regex filter to ingest only entities that match the pattern. This filter limits the crawl scope so only matching entities and their attributes are processed.
Example:
^(Account|Contact|Lead)$
Exclude entities by regex
Use a regex filter to exclude entities that match the pattern. This filter removes matching entities and their attributes from the crawl scope.
Example:
^Test_.*